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Important features: 
Signal peptide: 
amino acids 1-22 

N-rayriatoylatlon sites. 

amino acids 103-109, 163-169 



< 



O CAMP- and cGMP-dependent protein kinase phosphorylation site; 

^ amino acids 53-57 

0(57) Abstract: The present invention is directed to novel polypeptides and to nucleic acid molecules encoding those polypeptides 
Also provided herein are vectors and host ceUs comprising those nucleic acid sequences, chimeric polypeptide molecules comprising 
^ the polypeptides of the present invention fused to heterologous polypeptide sequences, antibodies which bind to the polypeptides of 
the present invention and to methods for producing the polypeptides of the present invention. 
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SECRETED AND TRANSMEMBRANE POLYPEPTIDES AND NUCLEIC ACIDS ENCODING THE 

SAME 



The present invention relates generally to the identification and isolation of novel DNA and to the 



BACKGROUND OF THE INVENTION 
Extracellular proteins play important roles in, among other things, the formation, differentiation and 
maintenance of multicellular organisms. The fate of many individual cells, e.g.. proliferation, migration, 
differentiation, or interaction with other cells, is typically governed by information received from other cells 



mitogenic factors, survival factors, cytotoxic factors, differentiation faaors. neuropeptides^ and hormones) which 
are, in mm. received and interpreted by diverse cell receptors or membrane-bound proteins. These secreted 
polypeptides or signaling molecules normally pass through the cellular secretory pathway to reach their site of 
action in die extracellular environment. 

Secreted proteins have various industrial applications, including as pharmaceuticals, diagnostics, 
biosensors and bioreactors. Most protein drugs avaUable at present, such as thrombolytic agents, interferons, 
interleukins, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. 
Their receptors, which are membrane proteins, also have potential as therapeutic or diagnostic agents. Efforts 
are being undertaken by both industry and academia to identify new. native secreted proteins. Many efforts arc 
focused on die screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
secreted proteins. Examples of screemng methods and techniques are described in the literature [see, for 
example, Klein et aL. Proc. Natl. Acad. Sci. 93:7108-7113 (1996); U.S. Patent No. 5,536,637)]. 

Membrane-bound proteins and receptors can play important roles in, among odier things, die formation, 
differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g. . proliferation, 
migration, differentiation, or interaction with odier cells, is typically governed by information received from 
odier cells and/or the immediate environment. Hiis information is often transmitted by secreted polypeptides 
(for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and 
hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. 
Such membrane-bound proteins and cell receptors include, but are not limited to, cytokine receptors, receptor 
kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin molecules like 
selectins and integrins. For instance, transduction of signals that regulate cell growth and differentiation is 
regulated in part by phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes diat catalyze 
that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor and 



FIELD OF THE INVENTION 



recombinant production of novel polypeptides. 



and/or die immediate environment. This information is often transmitted by secreted polypeptides (for instance. 
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nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial appUcations, including as 
pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can be employed as therapeutic 
agents to block receptor-ligand interactions. The membrane-bound proteins can also be employed for screening 
of potential peptide or small molecule inhibitors of the relevant receptor/ligand Interaction. 

Efforts are being undertaken by both industry and academia to identify new. native receptor or 
membrane-bound proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries 
to identify the coding sequences for novel receptor or membrane-bound proteins. 

SUMMARY OF THE INVENTIOM 
In one embodiment, the invention provides an isolated nucleic acid molecule comprising a nucleotide 
sequence that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80% nucleic acid sequence identity, alternatively at least about 81 % nucleic acid sequence identity, alternatively 
at least about 82% nucleic acid sequence identity , alternatively at least about 83% nucleic acid sequeiK:e identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% 
nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at 
least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 
alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid 
sequence identity, alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% 
nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at 
least about 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 
alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 
sequence identity to (a) a DNA molecule encoding a PRO polypeptide having a full-length amino acid sequence 
as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed hereiii. an extraceUular 
domain of a transmembrane protein, with or without the signal peptide, as disclosed herein or any other 
specificaUy defined fragment of the fuU-length amino acid sequence as disclosed herein, or (b) die complement 
of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 
80% nucleic acid sequence identity, alternatively at least about 81 % nucleic acid sequence identity, alternatively 
at least about 82% nucleic acid sequence identity, alternatively at least about 83 % nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 87% 
nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, alternatively at 
least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid sequence identity, 
alternatively at least about 91% nucleic acid sequence Identity, alternatively at least about 92% nucleic acid 
sequence identity . alternatively at least about 93 % nucleic acid sequence identity . alternative^ 



wo 01/40466 PCTAJSOO/32678 

nucleic acid sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at 
least about 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 
alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic acid 
sequence identity to (a) a DNA molecule comprising the coding sequence of a full-length PRO polypeptide cDN A 
as disclosed herein, the coding sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, 
the coding sequence of an extracellular domain of a transmembrane PRO polypeptide, with or without the signal 
peptide, as disclosed herein or the coding sequence of any other specifically defined fragment of the full-length 
amino acid sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 
sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 81 % nucleic acid 
sequence identity, alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83 % 
nucleic acid sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at 
least about 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% 
nucleic acid sequence identity, alternatively at least about 91 % nucleic acid sequence identity, alternatively at 
least about 92% nucleic acid sequMice identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 97% 
niicleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and alternatively 
at least about 99% nucleic acid sequence identity to (a) a DNA molecule that encodes the same manire 
polypeptide encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein, or (b) 
the complement of the DNA molecule of (a). 

Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide 
sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane domain- 
inactivated, or is complementary to sudi encoding nucleotide sequence, wherein the transmembrane doniaui(s) 
of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein described PRO 
polypeptides are conten^>lated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the complement 
thereof, that may find use as, for example, hybridization probes, for encoding fragments of a PRO polypeptide 
that may optionally encode a polypeptide comprising a bmding site for an anti-PRO antibody or as antisense 
oligonucleotide probes. Such nucleic acid fragments are usually at least about 10 nucleotides in length, 
alternatively at least about 15 nucleotides in length, alternatively at least about 20 nucleotides in length, 
alternatively at least about 30 nucleotides in length, alternatively at least about 40 nucleotides in length, 
alternatively at least about 50 nucleotides in length, alternatively at least about 60 nucleotides in lengtii, 
alternatively at least about 70 nucleotides in lengdi, alternatively at least about 80 nucleotides in lengtii. 
alternatively at least about 90 nucleotides in lengtii, alternatively at least about 100 nucleotides in lengtii, 
alternatively at least about 110 nucleotides in lengtii,- alternatively at least about 120 nucleotides in lengtii. 
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alternatively at least about 130 nucleotides in length, alternatively at least about 140 nucleotides in length, 
alternatively at least about 150 nucleotides in length, alternatively at least about 160 nucleotides in length, 
alternatively at least about 170 nucleotides in length, alternatively at least about 180 nucleotides in length, 
alternatively at least about 190 nucleotides in length, alternatively at least about 200 nucleotides in length, 
alternatively at least about 250 nucleotides in length, alternatively at least about 300 nucleotides in length, 
alternatively at least about 350 nucleotides in lengdi. alternatively at least about 400 nucleotides in length, 
alternatively at least about 450 nucleotides in length, alternatively at least about 500 nucleotides in length, 
alternatively at least about 600 nucleotides in Jengtfi, alternatively at least about 700 nucleotides in length, 
alternatively at least about 800 nucleotides in length, alternatively at least about 900 nucleotides in length and 
alternatively at least about 1000 nucleotides in length, wherein in this context the term "about" means the 
referenced nucleoHde sequence length plus or minus 10% of Uiat referenced length. It is noted that novel 
fragments of a PRO polypeptide-encoding nucleotide sequence may be determined in a routine manner by 
aligning die PRO polypeptide^ncoding nucleotide sequence with other known nucleotide sequences using any 
of a number of well known sequence alignmeu programs and determining which PRO polypq)tide-encoding 
nucleotide sequence ftagment(s) are novel. AU of such PRO polypeptide-encoding nucleotide sequences are 
contemplated herein. Also contemplated are the PRO polypeptide fragments encoded by diese nucleotide 
molecule fragments, preferably those PRO polypeptide fragments that comprise a binding site for an anti-PRO 
antibody. 

In another embodiment, the invention provides isolated PRO polypeptide encoded by any of die isolated 
nucleic acid sequences hereinabove identified. 

In a certain aspect, die invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81 % amino acid 
sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% 
amino acid sequence identity, altemativdy at least about 84% amino acid sequence identity, alternatively at least 
about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid 
sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% 
amino acid sequence identity, alternatively at least about 9 1 % amino acid sequence identity, alternatively at least 
about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, 
alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid 
sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 97 % 
amino acid sequence identity, alternatively at least about 98% amino acid sequence identity and alternatively at 
teast about 99% amino acid sequence identity to a PRO polypeptide having a fiill-length amino acid sequence 
as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein, an extracellular 
domain of a transmembrane protein, wifli or witiiout die signal peptide, as disclosed hwein or any other 
specificaUy defined fragment of die fiill-Iength amino add sequence as disclosed h«ein. 

In a further aspect, die invention concerns an isolated PRO polypeptide comprising an amino add 
sequence having at least about 80% amino add sequence identity, alternatively at least about 81 % amino add 
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sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least about 83 % 



amino acid sequence identity, alternatively at least about 84 % amino acid sequence identity, alternatively at least 
about 85% amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, 
alternatively at least about 87% amino acid sequence identity, alternatively at least about 88% amino acid 
sequence identity, alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% 
amino acid sequence identity, alternatively at least about 91 % amino acid sequence identity, alternatively at least 
about 92% amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, 
alternatively at least about 94% amino acid sequence identity, alternatively at least about 95% amino acid 
sequence identity, alternatively at least about 96% amino acid sequence identity, alternatively at least about 97% 
amino acid sequence identity, alternatively at least about 98% amino acid sequence identity and alternatively at 
least about 99% amino acid sequence identity to an amino acid sequence encoded by any of the human protein 
cDNAs deposited with the ATCC as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal 
sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for prciducing the same are also herein described, wherein 
those processes comprise culuiring a host cell comprising a vector which comprises the appropriate encoding 
nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO 
polypeptide from the cell culture. 

Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane 
domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and 
recovering the PRO polypeptide from the cell culnjre. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PRO polypeptide 
as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody or a small 



In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring a 
biological activity mediated by said PRO polypeptide. Preferably, the PRO polypqjtide is a native PRO 
polypeptide. 

In a still further embodiment, the invention concerns a composition of matter comprising a PRO 
polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described! or an anti-PRO antibody, in 
combination with a carrier. Optionally, the carrier is a pharmaceutical ly acceptable carrier. 

Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist 
or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for die preparation of a medicament 
useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist 
thereof or an anti-PRO antibody. 
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In otber embodiments of the present invention, the invention provides vectore conq)rising DNA 
encoding any ofthe herein described polypeptides. Host ceU comprising any such vector are also provided. By 
way of example, the host cells may be CHO cells. E. coli. or yeast. A process for producing any of the herein 
described polypeptides is further provided and comprises culniring host cells under conditions suitable for 
expression of the desired polypeptide and recovering the desired polypeptide from the ceU culture. 

In other embodiments, die inventionprovides chimeric molecules comprising any of the herein described 
polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such chimeric molecules 
comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region of an 
immunoglobulin. 

In anodier embodiment, the invention provides an antibody which binds, preferably specifically, to any 
ofthe above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-diain antibody. 

In yet other embodiments. Ok invention provides oligonucleotide probes vrtiich may be usefiil for 
isolating genomic and cDNA nucleotide sequences, measuring or detecting expression of an associated gene or 
as antisense probes, wherein those probes may be derived from any of flie above or below described nucleotide 
15 sequences. Preferred probe lengths are described above. 

In yet other embodiments, the present invention is directed to metiiods of using the PRO polypeptides 
of the present invention for a variety of uses based upon die functional biological assay data presented in die 
Examples below. 
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BRIEF DRgri tlPTION OFTHE DRAWINns 

Figure 1 shows a nucleotide sequoice (SEQ ID NO: 1) of a native sequence PR0177 cDNA. wherein 
SEQ ID NO:l is a clone designated herein as ''DNA16438-1387". 

Figure 2 shows die amino acid sequence (SEQ ID NO:2) derived from the coding sequence of SEQ ID 
NO:l shown in Figure 1. 

Figure 3 shows a nucleotide sequence (SEQ ID N0:3) of a native sequence PR03574 cDNA. wherein 
SEQ ID N0:3 is a clone designated herein as "DNA19360-2552". 

Figure 4 shows die amino acid sequence (SEQ ID N0:4) derived from die coding sequence of SEQ ID 
NO:3 shown in Bguie 3. 

Hgure 5 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR01280 cDNA. wherein 
SEQ ID N0:5 is a done designated herein as "DNA33455-1548". 

Figure 6 shows die amino acid sequcaice (SEQ ID NO:6) derived ftom the coding seqiKnce of SEQ ID 
NO:S ^wn in Rguie S. 

Figure 7 shows a nucleotide sequence (SEQ ID NO:7) of a native sequence PR04984 cDNA. wherein 
SEQ ID N0:7 is a clone designated herein as "DNA37155-265I". 

Figure 8 shows die amino acid sequence (SEQ ID N0:8) derived from die coding sequence of SEQ ID 
NO:7 shown in Figure 7. 
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Figure 9 shows a nucleotide sequence (SEQ ID N0:9) of a native sequence PR04988 cDNA. wherein 
SEQ ID N0:9 is a clone designated herein as "DNA38269-2654". 

Figure 10 shows the amino acid sequence (SEQ ID NO: 10) derived from the coding sequence of SEQ 
ID N0:9 shown in Figure 9. 

Figure 1 1 shows a nucleotide sequence (SEQ ID NO: 11 ) of a native sequence PRO305 cDNA, wherein 
5 SEQ ID NO: 1 1 is a clone designated herein as *'DNA40619-1220". 

Figure 12 shows the amino acid sequence (SEQ ID NO: 12) derived from the coding sequence of SEQ 
ID NO: 1 1 shown m Figure 1 1 . 

Figure 1 3 shows a nucleotide sequence (SEQ ID NO: 1 3) of a native sequence PRO 1866 cDNA, wherein 
SEQ ID NO: 13 is a clone designated herein as "DNA44 174-25 13". 
10 Figure 1 4 shows the amino acid sequence (SEQ ID NO: 14) derived from the coding sequence of SEQ 

ID NO:13 shown in Figure 13. 

Figure 15 shows a nucleotide sequence (SEQ ID NO: 15) of a native sequence PR04996 cDNA. wherein 
SEQ ID NO: 15 is a clone designated herein as ''DNA44675-2662". 

Figure 16 shows the amino acid sequence (SEQ ID NO: 16) derived from the coding sequence of SEQ 
15 ID N0:15 shown in Figure 15. 

Figure 17 shows a nucleotide sequence (SEQ ID NO: 17) of a native sequence PRO4406 cDNA. wherein 
SEQ ID NO: 17 is a clone designated herein as ''DNA45408-2615". 

Figure 18 shows the amino acid sequence (SEQ ID NO: 18) derived from die coding sequence of SEQ 
ID NO: 17 shown in Figure 17. 
20 Figure 19 shows a nucleotide sequence (SEQ ID NO: 19) of a native sequence PROl 120 cDNA, wherein 

SEQ ID NO: 19 is a clone designated herein as **DNA48606-1479" . 

Figure 20 shows the amino acid sequence (SEQ ID NO:20) derived from die coding sequence of SEQ 
ID NO: 19 shown iii Figure 19. 

Figure 2 1 shows a nucleotide sequence (SEQ ID N0:21) of a native sequence PRO4990 cDNA, wdierein 
25 SEQ ID N0:2 1 is a clone designated herein as •DNA52753-2656" . 

Figure 22 shows the amino acid sequence (SEQ ID NO:22) derived from the coding sequence of SEQ 
ID N0:2 1 shown in Figure 21 . 

Figure 23 shows a nucleotide sequence (SEQ ID NO:23) of a native sequence PR0738 cDNA, \i4ierein 
SEQ ID N0:23 is a clone designated herein as *'DNA53915-1258". 
30 Figure 24 shows die amino acid sequence (SEQ ID NO:24) derived from the coding sequence of SEQ 

ID NO:23 shown in Figure 23. 

Figure 25 shows a nucleotide sequence (SEQ ID NO:25) of a native sequence PR03577 cDNA, wherein 
SEQ ID NO:25 is a clone designated herein as "DNA53991-2553". 

Figure 26 shows the amino acid sequence (SEQ ID NO:26) derived ftx)m die coding sequence of SEQ 
35 ID NO:25 shown in Figure 25. 

Figure 27 shows a nucleotide sequence (SEQ NO:27) of a native sequence PRO 1879 cDNA. herein 
SEQ ID NO:27 is a clone designated herein as *'DNA54009-2517" . 
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Figure 28 shows the amino acid sequence (SEQ ID NO:28) derived from the coding sequence of SEQ 
ID NO:27 shown in Figure 27. 

Figure 29 shows a nucleotide sequence (SEQ ID NO:29) of a native sequence PR01471 cDNA. wherein 
SEQ ID NO:29 is a clone designated herein as "DNA56055-i643". 

Figure 30 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of SEQ 
ID NO:29 shown in Figure 29. 

Figure 3 1 shows a nucleotide sequence (SEQ ID N0:3 1) of a native sequence PROl 1 14 cDNA. wherein 
SEQ ID NO:31 is a clone designated herein as "DNA57033-1403". 

Figure 32 shows the amino acid sequence (SEQ ID NO:32) derived bom the coding sequence of SEQ 
ID NO:31 shown in Figure 31. 

Figure 33 shows a nucleotide sequence (SEQ ID NO: 33) of a native sequence PRO 1076 cDNA, wherein 
SEQ ID NO:33 is a clone designated herein as •'DNA57252-1453'. 

Figure 34 shows the amino acid sequence (SEQ ID NO:34) derived from the coding sequence of SEQ 
ID NO: 33 shown in Figure 33. 

Figure 35 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR01483 cDNA. wherein 
SEQ ID NO:35 is a clone designated herein as "DNA58799-1652". 

Figure 36 shows the amino acid sequence (SEQ ID NO:36) derived from the coding sequence of SEQ 
ID NO:35 shown in Figure 35. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:37) of a native sequence PR04985 cDNA. wAerein 
SEQ ID NO:37 is a clone designated herein as "DNA5{>770-2652". 

Figure 38 shows the amino acid sequence (SEQ ID NO:38) derived from the coding sequence of SEQ 
ID NO:37 shown in Figure 37, 

Figure 39 shows a nucleotide sequence (SEQ ID N0:3!») of a native sequence PRO5000 cDNA, wherein 
SEQ ID NO:39 is a clone designated herein as "DNA59774-2665". 

Figure 40 shows the amino acid sequence (SEQ ID N0:40) derived torn the coding sequence of SEQ 
ID NO:39 shown in Figure 39. 

Figure41 shows a nucleotide sequence (SEQ ID NO:41)ofa native sequence PR01881cDNA. wherein 
SEQ ID N0:41 is a clone designated herein as ''DNA6028I-2518°. 

Figure 42 shows the amino acid sequence (SEQ ID N0:42) derived from the coding sequence of SEQ 
ID N0:41 shown in Figure 41. 

Figure43 showsanucleotidesequence(SEQID NO:43)ofanativesequencePR04314cDNA. v/bereia 
SEQ ID NO:43 is a cl(Hie designated herein as "DNA60736-2559". 

Figure 44 shows die amino abid sequence (SEQ ID NO:44) derived from the coding sequence of SEQ 
ID NO:43 shown in Figure 43. 

Figure 45 shows a nucleotide sequence (SEQ ID NO:45) of a native sequence PR04987 cDNA, wherein 
SEQ ID NO:45 is a clone designated herein as "DNA6I875-2653". 

Figure 46 shows the amino acid sequence (SEQ ID NO:46) derived from die coding sequence of SEQ 
ID NO:45 shown in Figure 45. 
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Figure 47 shows a nucleotide sequence (SEQ ID NO:47) of a native sequence PR043 13 cDNA, wherein 
SEQ ID NO:47 is a clone designated herein as "DNA623 12-2558 \ 

Figure 48 shows the amino acid sequence (SEQ ID NO:48) derived from the coding sequence of SEQ 
ID NO:47 shown in Figure 47. 

Figure49 shows a nucleotide sequence (SEQ ID NO:49) of a native sequence PR04799 cDNA, wherein 
SEQ ID NO:49 is a clone designated herein as *'DNA62849-i604". 

Figure 50 shows the amino acid sequence (SEQ ID NO:50) derived from die coding sequence of SEQ 
ID NO:49 shown in Figure 49. 

Figure 5 1 shows a nucleotide sequence (SEQ ID N0:5 1) of a native sequence PR04995 cDNA, wherein 
SEQ ID N0:51 is a clone designated herein as **DNA66307-266r . 

Figure 52 shows the amino acid sequence (SEQ ID NO;52) derived from the coding sequence of SEQ 
ID N0:5 1 shown in Figure 51. 

Figure 53 shows a nucleotide sequence (SEQ ID NO:53) of a native sequence PR01341 cDNA, wherein 
SEQ ID NO:53 is a clone designated hercm as '•DNA66677-2535". 

Figure 54 shows the amino acid sequence (SEQ ID NO:54) derived from the coding sequence of SEQ 
ID NO:53 shown in Figure 53. 

Figure 55 shows a nudeotide sequence (SEQ ID NO:55) of a native sequeiK^e PR01777 cDNA, wherein 
SEQ ID NO:55 is a clone designated herein as •'DNA71235-1706". 

Figure 56 shows the amino acid sequence (SEQ ID NO:56) derived from the codmg sequence of SEQ 
ID NO:55 shown in Figure 55. 

Figure 57 shows a nucleotide sequence (SEQ ID NO:57) of a native sequence PRO3580 cDNA, wherein 
SEQ ID NO:57 is a clone designated herein as "DNA7 1289-2547". 

Figure 58 shows the amino acid sequence (SEQ ID NO:58) derived from the coding sequence of SEQ 
ID NO:57 shown in Figure 57. 

Figure 59 shows a nucleotide sequence (SEQ ID NO:59) of a native sequence PR01779 cDNA, wherein 
SEQ ID NO:59 is a clone designated herein as "DNA73775-1707" . 

Figure 60 shows the amino acid sequence (SEQ ID NO:60) derived from the coding sequence of SEQ 
ID NO:59 shown in Figure 59. 

Figure 61 shows a nucleotide sequence (SEQ ID NO:61) of a native sequence PRO 1754 cDNA. Wherein 
SEQ ID N0:61 is a clone designated herein as •'DNA76385-1692". 

Figure 62 shows the ammo acid sequ^ce (SEQ ID NO:62) derived from the coding sequence of SEQ 
ID NO:61 shown in Figure 61, 

Figure 63 shows a nucleotide sequence (SEQ ID NO:63) of a native sequence PROl 906 cDNA, wherein 
SEQ ID NO:63 is a clone designated herein as **DNA76395-2527" . 

Figure 64 shows the amino acid sequwice (SEQ ID NO:64) derived from the coding sequence of SEQ 
ID NO:63 shown in Figure 63. 

Figure 65 shows a nucleotide sequence (SEQ ID NO:65) of a native seqiience PROl 870 cDNA, wherein 
SEQ ID NO:65 is a clone designated herein as "DNA77622-2516". 
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Figure 66 shows the amino acid sequence (SEQ ID NO:66) derived from the coding sequence of SEQ 
ID NO:65 shown in Figure 65. 

Figure67showsanucleotidesequence(SEQ ID NO:67)ofanative sequence PR04329cDN^ 
SEQ ID NO:67 is a done designated herein as "DNA77629-2573". 

Figure 68 shows the ammo acid sequence (SEQ ID NO:68) derived from the coding sequence of SEQ 
ID NO:67 shown in Figure 67. 

Figure 69 showsanudeotidesequence(SEQ ID NO:69)ofanaiive sequence PR04979 
SEQ ID NO:69 is a clone designated herein as "DNA77645-2648". 

Figure 70 shows the amino acid sequence (SEQ ID NO:70) derived from the coding sequence of SEQ 
ID NO:69 shown in Figure 69. 

Figure 71 shows a nucleotide sequence (SEQ ID N0:71) of a native sequence PR01885 cDNA, wherein 
SEQ ID N0:71 is a clone designated herein as "DNA79302-2521 

Figure 72 shows the amino acid sequence (SEQ ID NO:72) derived from the coding sequence of SEQ 
ID NO:71 shown in Figure 71. 

Figure 73 showsanucleotidesequence(SEQIDNO:73)ofanative sequence PR01882 
SEQ ID NO:73 is a clone designated herein as •DNA79865-2519". 

Figure 74 shows the amino acid sequence (SEQ ID NO:74) derived from the coding sequence of SEQ 
ID NO;73 shown in Figure 73. 

Figure 75 shows a nucleotide sequence (SEQ ID NO:75) of a native sequence PR04989 cDNA . wherein 
SEQ ID NO:75 is a clone designated herein as "DNA80135-2655", 

Figure 76 shows the amino acid sequence (SEQ ID NO:76) derived from the coding sequence of SEQ 
ID NO:75 ^own in Figure 75. 

Figure77showsanucleotidesequence(SEQIDNO:77)ofanativesequencePR04323c^^^ 
SEQ ID NO:77 is a clone designated herein as "DNA80794.2568". 

Figure 78 shows the amino acid sequence (SEQ ID NO:78) derived from the coding sequence of SEQ 
ID NO:77 shown in Figure 77. 

Figure 79showsanucleotidesequence(SEQ ID NO:79)ofanative sequel 
SEQ ID NO:79 is a clone designated herein as "DNA80796-2523". 

Figure 80 shows the amino acid sequence (SEQ ID NO:80) derived from the coding sequence of SEQ 
ID NO:79 shown in Figure 79. 

Figure 8 1 shows a nucleotide sequence (SEQ ID NO:8 1 ) of a native sequence PR04395 cDN A. wherein 
SEQ ID N0:8I is a clone designated herein as "DNA80840-2605". 

Figure 82 shows the amino acid sequence (SEQ ID NO:82) derived from the coding sequence of SEQ 
ID N0:81 shown in Figure 81. 

Figure 83 showsanucleotidesequence(SEQ ID NO:83)ofanativesequence PRO 
SEQ ID NO:83 is a clone designated herein as ''DNA80899.2501 

Figure 84 shows the amino acid sequence (SEQ ID NO:84) derived from the coding sequence of SEQ 
ID NO:83 shown in Figure 83. 
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Figure 85 shows a nucleotide sequence (SEQ ID NO:85) of a native sequence PR04338 cDNA, wherein 
SEQ ID NO:85 is a clone designated herein as "DNA8 1228-2580". 

Figure 86 shows the amino acid sequence (SEQ ID NO:86) derived from the coding sequence of SEQ 
ID NO:85 shown in Figure 85. 

Figure 87 shows a nucleotide sequence (SEQ ID NO:87) of a native sequence PR04341 cDNA, wherein 
5 SEQ ID NO:87 is a clone designated herein as "DNA81761-2583". 

Figure 88 shows the amino acid sequence (SEQ ID NO:88) derived from the coding sequence of SEQ 
ID NO:87 shown in Figure 87. 

Figure 89 shows a nucleotide sequence (SEQ ID NO:89) of a native sequence PRO5990 cDNA, wherein 
SEQ ID NO:89 is a clone designated herein as **DNA96042-2682". 
10 Figure 90 shows the amino acid sequence (SEQ ID NO:90) derived from the coding sequence of SEQ 

ID NO:89 shown in Figure 89. 

Figure91 shows a nucleotide sequence (SEQ ID NO:91)ofa native sequence PR03438cDNA, wherein 
SEQ ID N0:91 is a clone designated herein as "DNA82364-2538". 

Figure 92 shows the amino acid sequence (SEQ ID NO:92) derived from the coding sequence of SEQ 
15 ID N0:91 shown in Figure 91 . 

Figure 93 shows a nucleotide sequence (SEQ ID NO:93) of a native sequence PR04321 cDNA, wherein 
SEQ ID NO:93 is a clone designated herein as -DNA82424-2566". 

Figure 94 shows the amino acid sequence (SEQ ID NO:94) derived from die coding sequence of SEQ 
ID NO:93 shown in Figure 93. 
20 Figure 95 shows a nucleotide sequence (SEQ ID NO:95) of a native sequence PRO4304 cDNA, wherein 

SEQ ID NO:95 is a clone designated herein as "DNA82430-2557\ 

Figure 96 shows the amino acid sequence (SEQ ID NO:96) derived from the coding sequence of SEQ 
ID NO:95 shown in Figure 95. 

Figure 97 shows a nucleotide sequence (SEQ ID NO:97) of a native sequence PRO180i cDNA, wherein 
25 SEQ ID NO:97 is a clone designated herein as "DNA83500-2506*. 

Figure 98 shows the amino acid sequence (SEQ ID NO:98) derived from die coding sequence of SEQ 
ID NO:97 shown in Figure 97. 

Figure 99 shows a nucleotide sequence (SEQ ID NO:99) of a native sequence PRO4403 cDNA. wherein. 
SEQ ID NO:99 is a clone designated herein as •*DNA83509-2612". 
30 Figure 100 shows the amino acid sequence (SEQ ID NO: 100) derived from the codmg sequence of SEQ 

ID NO:99 shown in Figure 99. 

Figure 101 shows a nucleotide sequence (SEQ ID NO: 101) of a native sequence PR04324 cDNA, 
wherein SEQ ID NO: 101 is a clone designated herein as "DNA83560-2569". 

Figure 102 shows the amino acid sequence (SEQ ID NO: 102) derived from the coding sequence of SEQ 
35 ID NO: 101 shown in Figure 101. 

Figure 103 shows a nucleotide sequence (SEQ ID NO: 103) of a native sequence PRO4303 cDNA, 
. wherein SEQ ID NO: 103 is a clone designated herein as •*DNA84 139-2555". 
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Figure 104 shows the amino acid sequence (SEQ ID NO: 1 04) dem^ 
ID NO: 103 shown in Figure 103. 

Figure 105 shows a nucleotide sequence (SEQ ID NO:105) of a native sequence PRO4305 cDNA. 
wherein SEQ ID NO: 105 is a clone designated herein as •'DNA8414I-2556". 

Figure 106 shows the amino acid sequence{SEQ ID NO:106) derived from the coding 
ID NO: 105 shown in Figure 105. 

Figure 107 shows a nucleotide sequence (SEQ ID NO:107) of a native sequence PRO4404 cDNA. 
wherein SEQ ID NO: 107 is a clone designated herein as "DNA84 142-2613". 

Figure 1 08 shows the amino acid sequence (SEQ ID NO: 108) derived from the coding sequence of SEQ 
ID NO: 107 shown in Figure 107, 

Figure 109 shows a nucleotide sequence (SEQ ID N0:109) of a native sequence PR01884 cDNA, 
wherein SEQ ID NO: 109 is a clone designated herein as "DNA843 18-2520". 

Figure 110 shows the amino acid sequence (SEQ ro NO: 1 10) derived from the codm^ 
ID NO: 109 shown m Figure 109. 

Figure 111 shows a nucleotide sequence (SEQ ID N0:111) of a native sequence PR04349 cDNA, 
wlierein SEQ ID N0:1 11 is a clone designated herein as *DNA84909-2590". 

Figurell2shows the amino acid sequence(SEQ ID NO:112)derivedfh>m the codi^ 
ID N0:111 shown in Figure 111. 

Figure 113 shows a nucleotide sequence (SEQ ID NO:113) of a native sequence PRO4401 cDNA, 
wherein SEQ ID NO:113 is a clone designated herein as "DNA84912-2610". 

Figure 1 14 shows the amino acid sequence (SEQ ID NO: 1 14) derived from the codi^^ 
ID NO: 1 13 shown in Figure 1 13. 

Figure 115 shows a nucleotide sequence (SEQ ID NO: 115) of a native sequence PR01867 cDNA, 
wherein SEQ ID NO: 115 is a clone designated herein as "DNA84925-2514". 

Figurell6shows die amino acid sequence (SEQ ID NO:116) derived from die coding 
ID NO: 1 15 shown in Figure 1 15. 

Figure 117 shows a nucleotide sequence (SEQ ID NO:117) of a native sequence PR04319 cDNA, 
wherein SEQ ID NO: 1 17 is a clone designated herein as "DNA84928-2564". 

Figure 118shows the amino acid sequence(SEQ ID NO:118) derived fiDmtiiecodmgseq^ 
ID N0:117 shown in Figure 117. 

Figure 119 shows a nucleotide sequence (SEQ ID NO: 119) of a native sequence PR04991 cDNA, 
wherein SEQ ID N0:119 is a clone designated herein as "DNA84932-2657". 

Figure 1 20 shows the amino acid sequence (SEQ ID NO: 1 20) derived from the coding sequence of SEQ 
ID NO: 119 shown in Figure 119. 

Figure 121 shows a nucleotide sequence (SEQ ID NO: 121) of a native sequence PR04398 cDNA, 
wherein SEQ ID NO: 121 is a clone designated herein as "DNA86592-2607". 

Figure 122 shows die amino acid sequence (SEQ ID NO:122)derived from the coding sequenceof SEQ 
ID NO: 121 shown in Figure 121. 
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Figure 123 shows a nucleotide sequence (SEQ ID NO: 123) of a native sequence PR04346 cDNA. 
wherein SEQ ID NO:123 is a clone designated herein as -DNA86594-2587", 

Figure 124 shows the amino acid sequence (SEQ ID NO: 124) derived from the coding sequence of SEQ 
ID NO: 123 shown in Figure 123. 

Figure 125 shows a nucleotide sequence (SEQ ID NO: 125) of a native sequence PRO4350 cDNA, 
5 wherein SEQ ID NO: 125 is a clone designated herein as •DNA86647.259r . 

Figure 126 shows the amino acid sequence (SEQ ID NO: 126) derived from the coding sequence of SEQ 
ID NO: 125 shown in Figure 125. 

Figure 127 shows a nucleotide sequence (SEQ ID NO: 127) of a native sequence PR04318 cDNA, 
wherein SEQ ID NO: 127 is a clone designated herein as "DNA87 185-2563". 
10 Figure 1 28 shows the amino acid sequence (SEQ ID NO: 128) derived from the coding sequence of SEQ 

ID NO: 127 shown in Figure 127. 

Figure 129 shows a nucleotide sequence (SEQ ID NO: 129) of a native sequence PRO4340 cDNA, 
wherein SEQ ID NO: 129 is a clone designated herein as -DNA87656-2582" . 

Figure 130 shows the amino acid sequence (SEQ ID NO: 1 30) derived from the coding sequence of SEQ 
15 ID NO: 129 shown in Figui« 129. 

Figure 131 shows a nucleotide sequence (SEQ ID N0:I31) of a native sequence PRO4400 cDNA, 
wherein SEQ ID NO: 131 is a clone designated herein as "DNA87974-2609", 

Figure 132 shows the amino acid sequence (SEQ ID NO: 132) derived from the coding sequence of SEQ 
ID NO:131 shown in Figure 131. 
20 Figure 133 shows a nucleotide sequence (SEQ ID NO: 133) of a native sequence PRO4320 cDNA, 

wherein SEQ ID NO: 133 is a clone designated herein as "DNA8800 1-2565". 

Figure 1 34 shows the amino acid sequence (SEQ ID NO: 134) derived from the coding sequence of SEQ 
ID NO: 133 shown in Figure 133. 

Figure 135 shows a nucleotide sequence (SEQ ID NO: 135) of a native sequence PRO4409 cDNA, 
25 wherein SEQ ID NO: 135 is a clone designated herem as "DN A88004-2575 " . 

Figure 136 shows the amino acid sequence (SEQ ID NO: 136) derived from the coding sequence of SEQ 
ID NO: 135 shown in Figure 135. 

Figure 137 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR04399 cDNA, 
wherein SEQ ID NO: 137 is a clone designated herein as **DNA89220-2608" . 
30 Figure 138 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ 

ID NO: 137 shown in Figure 137. ■ 

Figure 139 shows a nucleotide sequence (SEQ ID NO: 139) of a native sequence PR04418 cDNA. 
wherein SEQ ID NO: 139 is a clone designated herein as "DNA89947-2618". 

Figure 140 shows the amino acid sequence (SEQ ID NO: 1 40) derived from the coding sequence of SEQ 
35 ID NO: 139 shown in Figure 139. 

Figure 141 shows a nucleotide sequence (SEQ ID NO: 141) of a native sequence PRO4330 cDNA, 
wherein SEQ ID NO: 141 is a clone designated herein as ''DNA90842-2574". 
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Figure 142 shows the amino acid sequence (SEQ ID NO:142) derived from the coding sequenceof SEQ 
ID NO: 141 shown in Figure 141 . 

Figure 143 shows a nucleotide sequence (SEQ ID NO:143) of a native sequence PR04339 cDNA, 
wherein SEQ ID NO: 143 is a clone designated herein as "DNA91775-258r. 

Figure 1 44 shows the amino acid sequence (SEQ ID NO: 144) derived from the coding sequence of SEQ 

ID NO: 143 shown in Figure 143. 

Figure 145 shows a nucleotide sequence (SEQ ID NO: 145) of a native sequence PR04326 cDNA. 
wherein SEQ ID NO:145 is a clone designated herein as "DNA91779-2571". 

Figure 146 shows the amino acid sequence (SEQ ID NO: 146) derived from the coding sequence of SEQ 
ID NO: 145 shown in Figure 145. 

Figure 147 shows a nucleotide sequence (SEQ ID NO:147) of a native sequence PRO6014 cDNA. 
wherein SEQ ID NO:147 is a clone designated herein as "DNA92217.2697". 

Figure 148 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ 
ID NO: 147 shown in Figure 147. 

Figure 149 shows a nucleotide sequence (SEQ ID NO:149) of a native sequence PR03446 cDNA. 
wherein SEQ ID NO:149 is a clone designated herein as '•DNA92219-2541". 

Figure 150 shows the amino acid sequence (SEQ ID NO: 1 50) derived from Ifae coding sequence of SEQ 
ID NO: 149 shown in Figure 149. 

Figure 151 shows a nucleotide sequence (SEQ ID NO: 151) of a native sequence PR04322 cDNA. 
wherein SEQ ID NO:lSl is a clone designated herein as "DNA92223-2567". 

Rgure 152 shows die amino acid sequence (SEQ ID NO: 152) derived from the coding sequence of SEQ 
ID NO: 151 shown in Hguie 151. 

Figure 153 shows a nucleotide sequence (SEQ ID NO: 153) of a native sequence PR04381 cDNA. 
wherein SEQ ID NO:153 is a clone deagnated herein as "DNA92225-2603". 

Figure 1 54 shows the amino acid sequence (SEQ ID NO: 1 54) derived from the coding sequence of SEQ 
ID NO:153 shown in Figure 153. 

Figure 155 shows a nucleotide sequence (SEQ ID NO: 155) of a native sequence PR04348 cDNA. 
wherein SEQ ID NO: 155 is a clone designated herein as "DNA92232-2589". 

Figure 156 shows the amino acid sequence (SEQ ID NO: 156) derived from the coding sequence of SEQ 
ID NO:155 shown in Figure 155. 

Figure 157 shows a nucleotide sequence (SEQ ID NO: 157) of a native sequence PR0437i cDNA. 
wherein SEQ ID NO: 157 is a clone designated herein as "DNA92233-2599". 

Figure 158 shows die amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ 
ID NO: 157 shown in Figure 157. 

Figure 159 shows a nucleotide sequence (SEQ ID NO:159) of a native sequence PR03742 cDNA, 
wherein SEQ ID NO:159 is a clone designated herein as "DNA92243-2549". 

Figure 1 60 shows the amino acid sequence (SEQ ID NO: 1 60) derived from the coding sequence of SEQ 
ID NO: 159 shown in Figure 159. 
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Figure 161 shows a nucleotide sequence (SEQ ID N0:161) of a native sequence PR05773 cDNA, 
wherein SEQ ID NO:16l is a clone designated herein as "DNA92253-267r. 

Figure 162 shows the amino acid sequence (SEQ ID NO: 162) derived from the coding sequence of SEQ 
ID N0:161 shown in Figure 16L 

Figure 163 show a nucleotide sequence (SEQ ID NO: 163) of a native sequence PR05774 cDNA, 
wherein SEQ ID NO: 163 is a clone designated herein as "DNA92254.2672". 

Figure 164 shows the amino acid sequence (SEQ ID NO: 164) derived from the coding sequence of SEQ 
ID NO:163 shown in Figure 163. 

Figure 165 shows a nucleotide sequence (SEQ ID NO: 165) of a native sequence PR04343 cDNA, 
wherein SEQ ID NO: 165 is a clone designated herein as "DNA92255-2584". 

Figure 166 shows the amino acid sequence (SEQ ID NO: 166) derived from flie coding sequence of SEQ 
ID NO: 165 shown in Figure 165. 

Figure 167 shows a nucleotide sequence (SEQ ID NO: 167) of a native sequence PR04325 cDNA, 
wherein SEQ ID NO: 167 is a clone designated herein as "DNA92269-2570". 

Figure 168 shows the amino acid sequence (SEQ ID NO: 168) derived from the coding sequence of SEQ 
15 ID NO: 167 shown in Figure 167. 

Figure 169 shows a nucleotide sequence (SEQ ID NO: 169) of a native sequence PR04347 cDNA, 
wiierein SEQ ID NO:169 is a clone designated herein as "DNA92288-2588". 

Figure 170 shows the amino acid sequence (SEQ ID NO: 170) derived from the coding sequwice of SEQ 
ID NO: 169 shown in Figure 169. 
2® Figure 171 shows a nucleotide sequence (SEQ ID NO: 171) of a native sequence PR03743 cDNA, 

wherein SEQ ID NO: 171 is a clone designated herein as "DNA92290-2550". 

Figure 1 72 shows the amino acid sequence (SEQ ID NO: 172) derived from the coding sequence of SEQ 
ID NO: 171 shown in Figure 171. 

Figure 173 shows a nucleotide sequence (SEQ ID NO: 173) of a native sequence PR04426 cDNA, 
25 wherein SEQ ID NO:173 is a clone designated herein as ••DNA93012-2622". 

Figure 174 shows the amino acid sequence (SEQ ID NO: 174) derived from the coding sequence of SEQ 
ID NO: 173 shown in Figure 173. 

Figure 175 shows a nucleotide sequence (SEQ ID NO: 175) of a native sequence PRO4500 cDNA, 
wherein SEQ ID NO: 175 is a clone designated herein as **DNA93020-2642". 
30 Figure 176 shows the amino acid sequence (SEQ ID NO: 176) derived from the coding sequence of SEQ 

ID NO: 175 shown in Figure 175. 

Figure 177 shows a nucleotide sequence (SEQ ID NO: 177) of a native sequence PR04389 cDNA, 
wherein SEQ ID NO: 177 is a clone designated herein as "DNA94830-2604". 

Figure 178 shows the amino acid sequence (SEQ ID NO: 178) derived from the coding sequence of SEQ 
35 ID NO: 177 shown in Figure 177. 

Figure 179 shows a nucleotide sequence (SEQ ID NO: 179) of a native sequence PR04337 cDNA, 
wherein SEQ ID NO: 179 is a clone designated herein as •'DNA94833-2579". 
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Figure 180 shows the amiiK) acid sequence(SEQ ID NO:180)deriv^ 
ID NO:179 shown in Figure 179. 

Figure 181 shows a nucleotide sequence (SEQ ID NO: 181) of a native sequence PR04992 cDNA, 
wherein SEQ ID NO: 181 is a clone designated herem as "DNA94838-2658". 

Figure 182 shows the amino acid sequence (SEQroNO:182)derived from the coding seq^^^ 
ID NO: 181 shown in Figure 181. 

Figure 183 shows a nucleotide sequence (SEQ ID NO:183) of a native sequence PR05996 cDNA. 
wherein SEQ ID NO: 183 is a clone designated herem as "DNA94844-2686". 

Figure 184 shows the amino acid sequence (SEQ ID NO:184) derived from the coding sequence of SEQ 
ID NO: 183 shown in Figure 183. 

Figure 185 shows a nucleotide sequence (SEQ ID NO: 185) of a native sequence PR04345 cDNA. 
wherein SEQ ID NO:185 is a clone designated herein as "DNA94854-2586". 

Figure 186 shows the amino acid sequence(SEQ ID NO:186) derived from the coding seq^^ 
ID NO: 185 shown in Figure 185. 

Figure 187 shows a nucleotide sequence (SEQ ID NO: 187) of a native sequence PR04978 cDNA, 
wherein SEQ ID NO:187 is a clone designated herein as '•DNA95930". 

Figure 1 88 shows the amino acid sequence (SEQ ro NO: 188) derived from the coding sequent 
ID NO: 187 shown in Figure 187. 

Figure 189 shows a nucleotide sequence (SEQ ID NO:189) of a native sequence PRO5780 cDNA. 
wherein SEQ ID NO: 189 is a clone designated herein as "DNA96868-2677". 

Figure 190 shows the amino acid sequence (SEQ ID NO: 190) derived from the coding sequence of SEQ 
ID NO: 189 shown in Figure 189. 

Figure 191 shows a nucleotide sequence (SEQ ID NO: 191) of a native sequence PR05992 cDNA. 
wherem SEQ ID NO: 191 is a clone designated herein as ''DNA96871-2683". 

Figure 192 shows the ammo acid sequence (SEQ ro NO: 1 92) derived from the cod^^ 
ID NO: 191 shown in Figure 191. 

Figure 193 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PR04428 cDNA, 
wherein SEQ ID NO: 193 is a clone designated herein as "DNA96880-2624". 

Figure 194 shows the amino acid sequence(SEQ ID NO:194)derived from thecodm^ 
ID NO: 193 shown in Figure 193. 

Figure 195 shows a nucleotide sequence (SEQ ID NO: 195) of a native sequence PR04994 cDNA. 
wherein SEQ ID NO: 195 is a clone designated herein as "DNA96986-2660". 

Figure 196 shows the amino acid sequence(SEQ ID NO:196) derived from the coding se^^^ 
ID NO: 195 shown in Figure 195. 

Figure 197 shows a nucleotide sequence (SEQ ID NO:197) of a native sequence PR05995 cDNA. 
wherein SEQ ID NO:l97 is a clone designated herem as *DNA96988-2685". 

Figure 198 shows the amino acid sequence(SEQ ID NO:198) derived from the codmgseqw^ 
ID NO: 1 97 shown in Figure 197. 
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Figure 199 shows a nucleotide sequence (SEQ ID NO: 199) of a native sequence PRO6094 cDNA, 
wherein SEQ ID NO: 199 is a clone designated herein as '•DNA96995-2709\ 

Figure 200 shows the amino acid sequence (SEQ ID NO:200) derived from die coding sequence of SEQ 
ID NO: 199 shown in Figure 199. 

Figure 201 shows a nucleotide sequence (SEQ ID NO:201) of a native sequence PR04317 cDNA. 
wherein SEQ ID NO:201 is a clone designated herein as "DNA97004-2562". 

Figure 202 shows the amino acid sequence (SEQ ID NO:202) derived from die coding sequence of SEQ 
ID NO:201 shown in Figure 201 . 

Figure 203 shows a nucleotide sequence (SEQ ID NO:203) of a native sequence PR05997 cDNA. 
wherein SEQ ID NO:203 is a clone designated herein as "DNA97005-2687". 

Figure 204 shows the amino acid sequence (SEQ ID NO:204) derived from die coding sequence of SEQ 
ID NO:203 shown in Figure 203. 

Figure 205 shows a nucleotide sequence (SEQ ID NO:205) of a native sequence PRO5005 cDNA. 
wherein SEQ ID NO:205 is a clone designated herein as "DNA970()9-2668". 

Figure 206 shows the amino acid sequence (SEQ ID NO:206) derived from die coding sequence of SEQ 
ID NO:205 shown in Figure 205. 

Figure 207 shows a nucleotide sequence (SEQ ID NO:207) of a native sequence PRO5004 cDNA, 
wherein SEQ ID NO:207 is a clone designated herein as "DNA970 13-2667*. 

Figure 208 shows die amino acid sequence (SEQ ID NO:208) derived from die coding sequence of SEQ 
ID NO:207 shown in Figure 207. 

Figure 209 shows a nucleotide sequence (SEQ ID NO:209) of a native sequence PRO6001 cDNA, 
wherein SEQ ID NO:209 is a clone designated herein as "DNA98380-2690" . 

Figure 210 shows die amino acid sequence (SEQ ID NO:210) derived from die coding sequence of SEQ 
ID NO:209 shown m Figure 209. 

Figure 211 shows a nucleotide sequence (SEQ ID N0:211) of a native sequence PRO6013 cDNA. 
wherein SEQ ID N0:21 1 is a clone designated herein as •"DNA98561-2696". 

Figure 212 shows die amino acid sequence (SEQ ID NO:212) derived from die coding sequence of SEQ 
ID NO:21 1 diown in Figure 21 1 . 

Figure 213 shows a nucleotide sequence (SEQ ID NO:213) of a native sequence PRO4502 cDNA. 
wherein SEQ ID NO:213 is a clone designated herein as "DNA98575-2644". 

Figure 214 shows die amino acid sequence (SEQ ID NO:214) derived from die coding sequence of SEQ 
ID NO:213 shown in Figure 213. 

Figure 215 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PRO6007 cDNA, 
wherein SEQ ID NO:215 is a clone designated herein as "DNA98593-2694". 

Figure 216 shows die amino acid sequence (SEQ ID NO:216) derived from die coding sequence of SEQ 
ID NO:215 shown in Figure 215. 

Figure 217 shows a nucleotide sequence (SEQ ID NO:217) of a native sequence PRO6028 cDNA, 
wherein SEQ ID NO:217 is a clone designated herein as "DNA98600-2703". 
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Figure218shows the anuno acid sequence(SEQIDNO:218) derived from thecodmg^^ 
ID NO:217 shown in Ftguie 217. 

Figure 219 shows a nucleotide sequence (SEQ ID NO:219) of a native sequence PROlOO cDNA 
wherein SEQ ID NO:219 is a clone designated herein as -DNA99333-. 

Figure 220 shows the amino acid sequence (SEQroNO:220)derived fa,™ the coding sequence of SEQ 
ID NO:219 shown in Figure 219. 

Figure 221 shows a nucleotide sequence (SEQ ID NO:221) of a native sequence PR04327 cDNA. 
wherein SEQ ID NO:221 is a clone designated herein as "DNA99391-2572". 

Figure222 shows theamino acid sequence(SEQroNO:222)derived from the coding sequence ofSE^ 
ID NO:221 saiown in Figure 221 . 

Figure 223 shows a nucleotide sequence (SEQ ID NO:223) of a native sequence PR04315 cDNA 
wherem SEQ ID NO:223 is a clone designated herein as -DNA99393.2560". 

Figure224 shows the amino acid sequence(SEQIDNO:224)derived from Ae coding sequenceofSEQ 

ID NO:223 shown in Figure 223. 

Figure 225 shows a nucleotide sequence (SEQ ID NO:225) of a native sequence PR05993 cDNA. 
wherem SEQ ID NO:225 is a clone designated herein as -DNA100276-2684". 

Figure 226 shows the amino acid sequence(SEQ ID NO:226) derived fiomdiecodingsequenceofSEO 
ID NO:225 shown in Figure 225. 

Figure 227 shows a nucleotide sequence (SEQ ID NO:227) of a native sequence PRO4503 cDNA. 
wherein SEQ ID NO:227 is a clone designated herein as "DNA 1003 12-2645". 

Figure 228shows the amino acid sequence (SEQ ID NO:228)derived from the coding sequence of SEQ 
ID NO:227 shown in Figure 227. 

Figure 229 shows a nucleotide sequence (SEQ ID NO:229) of a native sequence PR04976 cDNA 
wherem SEQ ID NO:229 is a clone designated herein as «DNA100902-2646". 

Figure 230 shows the amino acid sequence (SEQ ID NO:230)derived from thecodiiUSsequenceofSEQ 

ID NO:229 shown in Figure 229, 

Figure 231 shows a nucleotide sequence (SEQ ID NO:231) of a native sequence PR05798 cDNA. 
wherem SEQ ID NO:23l is a clone designated herein as "DNA 1 02899-2679". 

Figure232 shows the amino acid sequence(SEQ ID NO:232)derived from the coding sequenceof SEQ 
ID NO:231 ^own in Figure 231. 

Figure 233 shows a nucleotide sequence (SEQ ID NO:233) of a native sequence PR06242 cDNA 
wherem SEQ ID NO:233 is a clone designated herein as -DNA104875.2720". 

Figure 234 shows Aeamino acid sequence(SEQ ID NO:234)derived from the codiogsequenceofSEQ 
ID NO:233 shown in Figure 233. 

Figure 235 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PRO6095 cDNA. 
wherem SEQ ID NO:235 is a clone designated herein as ''DNA105680-2710". 

Figure 236 shows die amino acid sequence(SEQ ID NO:236) derived from the codingsequenceofSEQ 
ID NO:235 shown in Figure 235. 
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Figure 237 shows a nucleotide sequence (SEQ ID NO:237) of a native sequence PRO6093 cDNA, 
wherein SEQ ID NO:237 is a clone designated herein as "DN A 105779-2708". 

Figure 238 shows the amino acid sequence (SEQ ID NO:238) derived from the coding sequence of SEQ 
ID NO:237 shown in Figure 237. 

Figure 239 shows a nucleotide sequence (SEQ ID NO:239) of a native sequence PRO6012 cDNA, 
wherein SEQ ID NO:239 is a clone designated herein as •'DNA105794-2695". 

Figure 240 shows the amino acid sequence (SEQ ID NO:240) derived from the coding sequence of SEQ 
ID NO:239 shown in Figure 239. 

Figure 241 shows a nucleotide sequence (SEQ ID NO:241) of a native sequence PRO6027 cDNA. 
wherein SEQ. ID NO:241 is a clone designated herein as **DNA105838-2702". 

Figure 242 shows the amino acid sequence (SEQ ID NO:242) derived from the coding sequence of SEQ 
ID NO:241 shown in Figure 241. 

Figure 243 shows a nucleotide sequence (SEQ ID NO:243) of a native sequence PR06181 cDNA, 
wherein SEQ ID NO:243 is a done designated herein as "DNA107698-2715". 

Figure 244 shows the amino acid sequence (SEQ ID NO:244) derived from the coding sequence of SEQ 
ID NO:243 shown in Figure 243. 

Figure 245 shows a nucleotide sequence (SEQ ID NO:245) of a native sequence PRO6097 cDNA, 
wherein SEQ ID NO:245 is a clone designated herein as *DN A 10770 1-271 1". 

Figure 246 shows the amino acid sequence (SEQ ID NO:246) derived from the coding sequence of SEQ 
ID NO :245 shown in Figure 245. 

Figure 247 shows a nucleotide sequence (SEQ ID NO:247) of a native sequence )'RO6090 cDNA, 
wherein SEQ ID NO:247 is a clone designated herein as *DNA 10778 1-2707". 

Figure 248 shows the amino acid sequence (SEQ ID NO:248) derived from the coding sequence of SEQ 
ID NO:247 shown in Figure 247. 

Figure 249 shows a nucleotide sequence (SEQ ID NO:249) of a native sequence PR07171 cDNA, 
wherein SEQ ID NO:249 is a clone designated herein as **DNA108670-2744". 

Figure 250 shows the amino acid sequence (SEQ ID NO:250) derived from the coding sequence of SEQ 
ID NO:249 shown in Figure 249. 

Figure 251 shows a nucleotide sequence (SEQ ID InIO:251) of a native sequence PR06258 cDNA, 
wherein SEQ ID NO:251 is a clone designated herein as "DN A 108688-2725". 

Figure 252 shows the amino acid sequence (SEQ ID NO:252) derived from die coding sequence of SEQ 
ID NO:25 1 shown in Figure 25 1 . ■ 

Figure 253 shows a nucleotide sequence (SEQ ID NO:253) of a native sequence PRO9820 cDNA, 
wherein SEQ ID NO:253 is a clone designated herein as "DN A 108769-2765". 

Figure 254 shows the amino acid sequence (SEQ ID NO:254) derived from the coding sequence of SEQ 
ID NO:253 shown in Figure 253. 

Figure 255 shows a nucleotide sequence (SEQ ID NO:255) of a native sequence PR06243 cDNA, 
wherein SEQ ID NO:255 is a clone designated herein as -DNA108935-2721 ". 
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Figure 256 shows the amino acid sequence (SEQ ID NO:256) derived from die coding sequence of SEQ 
ID NO:255 shown in Figure 255. 

Figure 257 shows a nucleotide sequence (SEQ ID NO:257) of a native sequence PR06182 cDNA, 
wherein SEQ ID NO:257 is a clone designated herein as "DNAl 10700-2716". 

Figure 258shows die amino acid sequence(SEQ ID NO:258) derived from the coding sequent 
ID NO:257 shown in Figiire 257. 

Figure 259 shows a nucleotide sequence (SEQ ID NO:259) of a native sequence PRO6079 cDNA. 
wherein SEQ ID NO:259 is a clone designated herein as "DNA 111750-2706". 

Figure 260 shows die amino acid sequence (SEQ ID NO:260) derived from the coding se^ 
ID NO:259 shown in Figure 259. 

Figure 261 shows a nucleotide sequence (SEQ ID NO:261) of a native sequence PR07434 cDNA. 
wherein SEQ ID NO:261 is a clone designated herein as "DNA 123430-2755". 

Figure 262 shows die amino acid sequence (SEQ ID NO:262) derived from die coding sequence of SEQ 
ID NO:261 shown in Figure 261. 

Figure 263 shows a nucleotide sequence (SEQ ID NO:263) of a native sequence PR09865 cDNA. 
wherein SEQ ID NO:263 is a clone designated herein as "DNA125 154-2785". 

Figure 264 shows die amino acid sequence (SEQ ID NO:264) derived from die coding sequence of SEQ 
ID NO:263 shown in Figure 263. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:265) of a native sequence PR09828 cDNA. 
wherein SEQ ID NO:265 is a clone designated herein as "DNA 142238-2768". 

Figure 266 shows die amino acid sequence (SEQ ID NO:266) derived from die coding sequence of SEQ 
ID NO:265 shown in Figure 265. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:267) of a native sequence PR0196 cDNA. 
Dtlierein SEQ ID NO:267 is a clone designated herein as "DNA22779.1130". 

Figure 268 shows die amino acid sequence (SEQ ID NO:268) derived from die coding sequence of SEQ 
ID NO:267 shown in Figure 267. 

Figure 269 shows a nucleotide sequence (SEQ ID NO:269) of a native sequence PR0197 cDNA, 
wherein SEQ ID NO:269 is a clone designated herein as "DNA22780-1078". 

Figure 270 shows die amino acid sequence (SEQ ID NO:270) derived from die coding sequence of SEQ 
ID NO:269 shown in Figure 269. 

Figure 271 shows a nucleotide sequence (SEQ ID NO:271) of a native sequence PR0195 cDNA. 
wherein SEQ ID NO:271 is a clone designated herein as •*DNA26847-1395\ 

Figure 272 shows the amino acid sequence (SEQ ID NO:272) derived from die coding sequence of SEQ 
ID NO:271 shown in Figure 271. 

Figure 273 shows a nucleotide sequence (SEQ ID NO:273) of a native sequence PR0187 cDNA. 
wherein SEQ ID NO:273 is a clone designated herein as •*DNA27864-1155". 

Figure 274 shows die amino acid sequence (SEQ ID NO:274)derived from die coding sequenceof SEQ 
ID NO:273 shown in Figure 273. 
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Figure 275 shows a nucleotide sequence (SEQ ID NO:275) of a native sequence PRO 182 cDNA, 
wherein SEQ ID NO:275 is a clone designated herein as "DNA27865-1091 

Figure 276 shows the amino acid sequence (SEQ ID NO:276) derived from the coding sequence of SEQ 
ID NO:275 shown in Figure 275. 

Figure 277 shows a nucleotide sequence (SEQ ID NO:277) of a native sequence PR0188 cDNA. 
wherein SEQ ID NO:277 is a clone designated herein as -DNA28497-1130\ 

Figure 278 shows the amino acid sequence (SEQ ID NO:278) derived from the coding sequence of SEQ 
ID NO:277 shown in Figure 277. 

Figure 279 shows a nucleotide sequence (SEQ ID NO:279) of a native sequence PR0183 cDNA, 
wherein SEQ ID N0:279 is a done designated herein as "DNA28498". 

Figure 280 shows the amino acid sequence (SEQ ID NO:280) derived from the coding sequence of SEQ 
ID NO:279 shown in Figure 279. 

Figure 281 shows a nucleotide sequence (SEQ ID NO:28l) of a native sequence PR0184 cDNA, 
wherein SEQ ID NO:281 is a clone designated herein as *DNA28500". 

Figure 282 shows the amino acid sequence (SEQ ID NO:282) derived from the coding sequence of SEQ 
ID NO:281 shown in Figure 281. 

Figure 283 shows a nucleotide sequence (SEQ ID NO:283) of a native sequence PR0185 cDNA, 
wherein SEQ ID NO:283 is a clone designated herem as •'DNA28503". 

Figure 284 shows die amino acid sequence (SEQ ID NO:284) derived from the coding sequence of SEQ 
ID NO:283 shown in Figure 283. 

Figure 285 shows a nucleotide sequence (SEQ ID NO:285) of a native sequence PRO200 cDNA. 
wherein SEQ ID NO:285 is a clone designated herein as "DNA29101-1122". 

Figure 286 shows the amino acid sequence (SEQ ID NO:286) derived from the coding sequence of SEQ 
ID NO:285 shown in Figure 285. 

Figure 287 shows a nucleotide sequence (SEQ ID NO:287) of a native sequence PRO202 cDNA, 
wherein SEQ ID NO:287 is a clone designated herein as "DNA30869" . 

Figure 288 shows the amino acid sequence (SEQ ID NO:288) derived from the coding sequence of SEQ 
ID NO:287 shown in Figure 287. 

Figure 289 shows a nucleotide sequence (SEQ ID NO:289) of a native sequence PR0214 cDNA, 
wherein SEQ ID N0:289 is a clone designated herein as "DNA32286-1 191 

Figure 290 shows the amino acid sequence (SEQ ID NO:290) derived from the coding sequence of SEQ 
ID NO:289 shown in Figure 289 

Figure 291 shows a nucleotide sequence (SEQ ID NO:291) of a native sequence PR0215 cDNA, 
wherein SEQ ID NO:291 is a clone designated herein as '•DNA32288-1 132". 

Figure 292 shows the amino acid sequence (SEQ ID NO:292) derived from the coding sequence of SEQ 
ID NO:291 shown in Figure 291 . 

Figure 293 shows a nucleotide sequence (SEQ ID NO:293) of a native sequence PR0219 cDNA, 
y/hmm SEQ ID NO:293 is a clone designated herein as *'DNA32290-1164". 
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Figure 294 shows the amino acid sequence (SEQ ID NO:294) derived from the coding sequence of SEQ 
ID NO:293 shown in Figure 293. 

Figure 295 shows a nucleotide sequence (SEQ ID NO:295) of a native sequence PR0211 cDNA, 
wherein SEQ ID NO:295 is a clone designated herein as "DNA32292-113r. 

Figure 296 shows die amino acid sequence (SEQ ID NO:296) derived from the coding sequence of SEQ 
ID NO:295 shown in Figure 295. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:297) of a native sequence PRO220 cDNA. 
wherein SEQ ID NO:297 is a clone designated herein as "DNA32298-H32*. 

Figure 298 shows the amino acid sequence (SEQ ID NO:298) derived from the coding sequence of SEQ 
ID NO:297 shown in Figure 297. 

Figure 299 shows a nucleotide sequence (SEQ ID NO:299) of a native sequence PR0366 cDNA, 
wherein SEQ ID NO:299 is a clone designated herein as "DNA33085-1 1 10". 

Figure 300 shows the amino acid sequence (SEQ ID NO:300) derived from the coding sequenceof SEQ 
ID NO:299 shown in Figure 299. 

Figure 301 shows a nucleotide sequence (SEQ ID NO:301) of a native sequence PR02I6 cDNA, 
wherein SEQ ID NO:301 is a clone designated herein as "DNASSOSV-llSS". 

Figure 302 shows the amino acid sequence (SEQ ID NO:302) derived from the coding sequence of SEQ 
ID NO:301 shown in Figure 301. 

Figure 303 shows a nucleotide sequence (SEQ ID NO:303) of a native sequence PR0221 cDNA, 
wherein SEQ ID NO:303 is a clone designated herein as "DNA33089-1132". 

Figure 304 shows the amino acid sequence (SEQ ID NO:304) derived from the coding sequence of SEQ 
ID NO:303 shown in Figure 303. 

Figure 305 shows a nucleotide sequence (SEQ ID NO:305) of a native sequence PR0228 cDNA, 
wherein SEQ ID NO:305 is a clone designated herein as "DNA33092-1202". 

Figure 306 shows the amino acid sequence (SEQ ID NO:306) derived from the coding sequence of SEQ 
ID NO:305 shown in Figure 305. 

Figure 307 shows a nucleotide sequence (SEQ ID NO:307) of a native sequence PR0217 cDNA. 
wherein SEQ ID NO:307 is a clone designated herein as '•DNA33094-113r . 

Figure 308 shows the amino acid sequence (SEQ ID NO:308) derived from the coding sequence of SEQ 
ID NO:307 shown in Figure 307. 

Figure 309 shows a nucleotide sequence (SEQ ID NO:309) of a native sequence PR0222 cDNA. 
wherein SEQ ID NO:309 is a clone designated herein as *DNA33 107-1 135*. 

Figure 310 shows the amino acid sequence (SEQ ID NO:3 10) derived from the coding sequence of SEQ 
ID NO:309 shown in Figure 309, 

Figure 311 shows a nucleotide sequence (SEQ ID NO:311) of a native sequence PR0224 cDNA, 
wherein SEQ ID NO:31 1 is a clone designated herein as *DNA33221-1133". 

Figure 312 shows the amino acid sequence (SEQ ID NO:3 12) derived from the coding sequence of SEQ 
ID NO:311 shown in Figure 311. 
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Figure 313 shows a nucleotide sequence (SEQ ID NO:313) of a native sequence PRO230 cDNA. 
wherein SEQ ID NO:313 is a clone designated herein as "DNA33223-1136". 

Figure 3 1 4 shows the amino acid sequence (SEQ ID N0:3 14) derived from the coding sequence of SEQ 
ID NO:313 shown in Figure 313. 

Figure 315 shows a nucleotide sequence (SEQ ID NO:315) of a native sequence PR0198 cDNA, 
wherein SEQ ID NO:315 is a clone designated herein as "DNA33457-1078". 

Figure 3 16 shows the amino acid sequence (SEQ ID NO:316) derived from the coding sequence of SEQ 
ID N0:3 1 5 shown in Figure 315. 

Figure 317 shows a nucleotide sequence (SEQ ID N0:317) of a native sequence PR0226 cDNA, 
wherein SEQ ID NO:317 is a clone designated herein as **DNA33460-1 166\ 

Figure 318 shows the amino acid sequence (SEQ ID NO:318) derived from the coding sequence of SEQ 
ID NO:317 shown in Figure 317. 

Figure 319 shows a nucleotide sequence (SEQ ID NO:3i9) of a native sequence PR0261 cDNA. 
wherein SEQ ID NO:319 is a clone designated herein as **DNA33473-1176". 

Figure 320 shows the amino acid sequence (SEQ ID NO:320) derived from the codmg sequence of SEQ 
ID NO:319 shown in Figure 319. 

Figure 321 shows a nucleotide sequence (SEQ ID NO:321) of a native sequence PR0242 cDNA, 
wherein SEQ ID NO:321 is a clone designated herein as •DNA33785-1143". 

Figure 322 shows the amino acid sequence (SEQ ID NO:322) derived from the coding sequence of SEQ 
ID NQ:321 shown in Figure 321 . 

Figure 323 shows a nucleotide sequence (SEQ ID NO:323) of a native sequence PR0227 cDNA, 
wherein SEQ ID NO:323 is a clone designated herein as "DNA33786-1132\ 

Figure 324 shows the amino acid sequence (SEQ ID NO:324) derived from the coding sequence of SEQ 
ID NO:323 shown in Figure 323. 

Figure 325 shows a nucleotide sequence (SEQ ID NO:325) of a native sequence PR0237 cDNA, 
wherein SEQ ID NO:325 is a clone designated herein as "DNA34353-1428" 

Figure 326 shows the amino acid sequence (SEQ ID NO:326) derived from the coding, sequence of SEQ 
ID NO:325 shown in Figure 325. 

Figure 327 shows a nucleotide sequence (SEQ ID NO:327) of a native sequence PR0241 cDNA. 
wherein SEQ ID NO:327 is a clone designated herein as "DNA34392-1 170". 

Figure 328 shows the amino acid sequence (SEQ ID NO:328) derived from the coding sequence of SEQ 
ID NO:327 shown in Figure 327. 

Figure 329 shows a nucleotide sequence (SEQ ID NO:329) of a native sequence PR0231 cDNA, 
wherein SEQ ID NO:329 is a clone designated herein as •!DNA34434-1 139\ 

Figure 330 shows the amino acid sequence (SEQ ID NO:330) derived from the coding sequence of SEQ 
ID NO:329 shown in Fijgure 329. 

Figure 331 shows a nucleotide sequence (SEQ ID NO:331) of a native sequence PR0235 cDNA, 
wherein SEQ ID NO:331 is a done designated herein as •DNA35558-1 167". 
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Figure 332 shows the amino acid sequence (SEQ ID NO:332) derived from the coding sequence of SEQ 
ID NO:33 1 shown in Figure 33 1 . 

Figure 333 shows a nucleotide sequence (SEQ ID NO:333) of a native sequence PR0323 cDNA, 
wherein SEQ ID NO:333 is a clone designated herein as "DNA35595-1228". 

Figure 334 shows the amino acid sequence (SEQ ID NO:334) derived from the coding sequence of SEQ 
ID NO:333 shown in Figure 333. 

Figure 335 shows a nucleotide sequence (SEQ ID NO:335) of a native sequence PR0245 cDNA, 
wherein SEQ ID NO:335 is a clone designated herein as "DNA35638-1216". 

Figure 336 shows die amino acid sequence (SEQ ID NO:336) derived from die coding sequence of SEQ 
ID NO:335 shown in Figure 335. 

Figure 337 slwws a nucleotide sequence (SEQ ID NO:337) of a native sequence PR0246 cDNA, 
wherein SEQ ID NO:337 is a clone designated herein as "DNA35639-1 172". 

Figure 338 shows the amino acid sequence (SEQ ID NO:338) derived from the coding sequence of SEQ 
ID NO:337 shown in Figure 337. 

Figure 339 shows a nucleotide sequeiK» (SEQ ID NO:339) of a native sequence PR0288 cDNA, 
wherein SEQ ID NO:339 is a clone designated herein as -DNA35663-1129". 

Figure 340 shows the amino acid sequence (SEQ ID NO:340) derived from die coding sequence of SEQ 
ID NO:339 shown in Figure 339. 

Figure 341 shows a nucleotide sequence (SEQ ID NO:341) of a native sequence PR0248 cDNA, 
wherein SEQ ID NO:341 is a clone designated herein as "DNA35674-1 142". 

Figure 342 shows die amino acid sequence (SEQ ID NO:342) derived from die coding sequence of SEQ 
ID NO:341 shown in Figure 341. 

Figure 343 shows a nucleotide sequence (SEQ ID NO:343) of a nadve sequence PR0257 cDNA, 
wherein SEQ ID NO:343 is a clone designated herein as "DNA35841-1 173". 

Figure 344 shows the amino acid sequence (SEQ ID NO:344) derived from die coding sequence of SEQ 
ID NO:343 shown in Figure 343. 

Figure 345 shows a nucleotide sequence (SEQ ID NO;345) of a native sequence PR0172 cDNA, 
wherein SEQ ID NO:345 is a clone designated herein as "DNA35916-1 161". 

Figure 346 shows the amino acid sequence (SEQ ID NO:346) derived from die coding sequence of SEQ 
ID NO:345 shown in Figure 345. 

Figure 347 shows a nucleotide sequence (SEQ ID NO:347) of a native sequence PR0258 cDNA. 
wherein SEQ ID NO:347 is a clone designated herein as -DNA35918-1174". 

Figure 348 shows die amino acid sequence (SEQ ID NO:348) derived from die coding sequence of SEQ 
ID NO:347 shown in Figure 347. 

Figure 349 shows a nucleotide sequence (SEQ ID NO:349) of a native sequence PR0265 cDNA, 
wherein SEQ ID NO:349 is a clone designated herein as **DNA36350-1158". 

Figure 350 shows die amino acid sequence (SEQ ID NO:350) derived from die coding sequence of SEQ 
ID NO:349 shown in Figure 349. 
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Figure 351 shows a nucleotide sequence (SEQ ID NO:351) of a native sequence PR0326 cDNA. 
wherein SEQ ID NO:351 is a clone designated herein as •'DNA37140-1234". 

Figure 352 shows the amino acid sequence (SEQ ID NO:352) derived from the coding sequence of SEQ 
ID NO:351 shown in Figure 351. 

Figure 353 shows a nucleotide sequence (SEQ ID NO:353) of a native sequence PR0266 cDNA. 
wherein SEQ ID NO:353 is a clone designated herein as -DNA37150-1I78". 

Figure 354 shows the amino acid sequence (SEQ ID NO:354) derived from the coding sequence of SEQ 
ID NO:353 shown in Figure 353. 

Figure 355 shows a nucleotide sequence (SEQ ID NO:355) of a native sequence PR0269 cDNA, 
wherein SEQ ID NO:355 is a clone designated herein as "DNA38260-1180". 

. Figure 356 shows the amino acid sequence (SEQ ID NO:356) derived from die coding sequence of SEQ 
ID NO:355 shown in Figure 355. 

Figure 357 shows a nucleotide sequence (SEQ ID NO:357) of a native sequence PR0285 cDNA, 
wherein SEQ ID NO:357 is a clone designated herein as "DNA4002M 154*. 

Figure 358 shows the amino acid sequence (SEQ ID NO:358) derived fix)m the coding sequence.of SEQ 
ID Nb:357 shown in Figure 357. 

Figure 359 shows a nucleotide sequence (SEQ ID NO:359) of a native sequence PR0328 cDNA, 
wherein SEQ ID NO:359 is a clone designated herein as "DNA40587-123r. 

Figure 360 shows the amino acid sequence (SEQ ID NO:360) derived from the coding sequence of SEQ 
ID NO:359 shown in Figure 359. 

Figure 361 shows a nucleotide sequence (SEQ ID NO:361) of a native sequence PR0344 cDNA, 
wherein SEQ ID NO:361 is a clone designated herein as "DNA40592-1242\ 

Figure 362 shows the amino acid sequence (SEQ ID NO:362) derived from the coding sequence of SEQ 
ID NO:361 shown in Figure 361 . 

Figure 363 shows a nucleotide sequence (SEQ ID NO:363) of a native sequence PR0272 cDNA, 
wherein SEQ ID NO:363 is a clone designated herein as "DNA40620-I183". 

Figure 364 shows the amino acid sequence (SEQ ID NO:364) derived from ttie coding sequence of SEQ 
ID NO:363 shown in Figure 363. 

Figure 365 shows a nucleotide sequence (SEQ ID NO:365) of a native sequence PRO301 cDNA, 
wherein SEQ ID NO:365 is a clone designated herein as "DNA40628-1216\ 

Figure 366 shows the amino acid sequence (SEQ ID NO:366) derived from the coding sequence of SEQ 
ID NO:365 shown in Figure 365. 

Figure 367 shows a nucleotide sequence (SEQ ID NO:367) of a native sequence PR0331 cDNA, 
wherein SEQ ID NO:367 is a clone designated herein as "DNA4C981-1234". 

Figure 368 shows the amino acid sequence (SEQ ID NO:368) derived from the coding sequence of SEQ 
ID NO:367 shown in Figure 367. 

Figure 369 shows a nucleotide sequence (SEQ ID NO:369) of a native sequence PR0332 cDNA, 
wherein SEQ ID NO:369 is a clone designated herein as "DNA40982-1235". 
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Figure 370 shows the amino add sequence (SEQ ID NO:370) derived from the coding sequence of SEQ 
ID NO:369 shown in Figure 369, 

Figure 371 shows a nucleotide sequence (SEQ ID NO:371) of a native sequence PR0353 cDNA, 
wherein SEQ ID NO:371 is a clone designated herein as "DNA41234-1242". 

Figure 372 shows the amino acid sequence (SEQ ID NO:372) derived from the coding sequence of SEQ 
ID NO:371 shown in Figure 371. 

Figure 373 shows a nucleotide sequence (SEQ ID NO:373) of a native sequence PRO310 cDNA. 
wherein SEQ ID NO:373 is a clone designated herein as "DNA43046-1225". 

Figure 374 shows the amino acid sequence (SEQ ID NO:374) derived from the coding sequence of SEQ 
ID NO:373 shown in Figure 373. 

Figure 375 shows a nucleotide sequence (SEQ ID NO:375) of a native sequence PR0337 cDNA, 
wherein SEQ ID NO:375 is a clone deagnated herein as "DNA43316-1237". 

Figure 376shows du: amino acid sequence (SEQ ID NO:376) derived from thecoding sequence of SEQ 
ID NO:37S shown in Figure 375. 

Figure 377 shows a nucleotide sequence (SEQ ID NO:377) of a native sequence PR0346 cDNA. 
wherein SEQ ID NO:377 is a clone designated herein as "DNA441(57-1243". 

Figure378shows the amino acid sequence (SEQ ID NO:378) derived from the coding sequence of SEQ 
ID NO:377 shown in Figure 377. 

Figure 379 shows a nucleotide sequence (SEQ ID NO:379) of a native sequence PRO350 cDNA, 
wherein SEQ ID NO:379 is a clone designated herein as -DNA44175-1314". 

Figure 380 shows the amino acid sequence (SEQ ID NO:380) derived from the coding sequence of SEQ 
ID NO:379 shown in Rgure 379, 

Figure 381 shows a nucleotide sequence (SEQ ID NO:381) of a native sequence PR0526 cDNA, 
wherein SEQ ID NO:38I is a clone designated herein as "DNA44184-1319". 

Figure 382 shows the amino acid sequence (SEQ ID NO:382) derived from the coding sequence of SEQ 
ID NO:38l shown in Figure 381. 

Figure 383 shows a nucleotide sequence (SEQ ID NO:383) of a native sequence PR0381 cDNA, 
wherein SEQ ID NO:383 is a clone designated herein as "DNA44194-1317". 

Figure 384 shows the amino acid sequence (SEQ ID NO:384) derived from the coding sequence of SEQ 
ID NO:383 shown in Figure 383. 

Figure 385 shows a nucleotide sequence (SEQ ID NO:385) of a native sequence PR0846 cDNA. 
whwein SEQ ID NO:385 is a done designated herein as "DNA44196-1353". 

Figure 386 shows the amino acid sequence (SEQ ID NO:386) derived from die codmg sequence of SEQ 
ID NO:38S shown in Figure 385, 

Figure 387 shows a nucleotide sequence (SEQ ID NO:387) of a native sequence PR0363 cDNA, 
wherein SEQ ID NO:387 is a clone designated herein as "DNA45419-1252". 

Figure 388 shows the amino acid sequence (SEQ ID NO:388) derived fixjm die coding sequence of SEQ 
ID NO:387 shown in Figure 387. 
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Figure 389 shows a nucleotide sequence (SEQ ID NO:389) of a native sequence PR0365 cDNA, 
wherein SEQ ID NO:389 is a clone designated herein as "DNA46777.I253". 

Figure 390 shows the amino acid sequence (SEQ ID NO:390) derived firom the coding sequence of SEQ 
ID NO:389 shown in Figure 389. 

Figure 391 shows a nucleotide sequence (SEQ ID NO:391) of a native sequence PRO1310 cDNA. 
wherein SEQ ID NO:39l is a clone designated herein as •DNA47394-1572". 

Figure 392 shows the amino acid sequence (SEQ ID NO:392) derived from the coding sequence of SEQ 
ID NO:391 shown in Figure 391 . 

Figure 393 shows a nucleotide sequence (SEQ ID NO:393) of a native sequence PR0731 cDNA, 
wherein SEQ ID NO:393 is a clone designated herein as "DNA48331-1329\ 

Figure 394 shows the amino acid sequence (SEQ ID NO:394) derived from the coding sequence of SEQ 
ID NO:393 shown in Figure 393. 

Figure 395 shows a nucleotide sequence (SEQ ID NO:395) of a native sequence PR0322 cDNA, 
wherein SEQ ID NO:39S is a clone designated herein as ''DNA48336-1309". 

Figure 396 shows the amino acid sequence (SEQ ID NO:396) derived from the coding sequence of SEQ 
ID NO:395 shown in Figure 395. 

Figure 397 shows a nucleotide sequence (SEQ ID NO:397) of a native sequence PR0536 cDNA, 
wherein SEQ ID NO:397 is a clone designated herein as "DNA49142-1430", 

Figure 398 shows the amino acid sequence (SEQ ID NO:398) derived from the coding sequence of SEQ 
ID NO:397 shown in Figure 397. 

Figure 399 shows a nucleotide sequence (SEQ ID NO:399) of a native sequence PR0719 cDNA, 
wherein SEQ ID NO:399 is a clone designated herein as "DNA49646-1327". 

Figure 400 shows the amino acid sequence (SEQ ID NO:400) derived from the coding sequence of SEQ 
ID NO:399 shown in Figure 399. 

Figure 401 shows a nucleotide sequence (SEQ ID NO:401) of a native sequence PR0619 cDNA, 
wherein SEQ ID NO:401 is a clone designated herein as "DNA4982M562". 

Figure 402 shows the amino acid sequence (SEQ ID NO:402) derived from coding sequence of SEQ 
ID NO:40i shown in Figure 401 . 

Figure 403 shows a nucleotide sequence (SEQ ID NO:403) of a native sequence PR077I cDNA, 
wherein SEQ ID NO:403 is a clone designated herein as "DNA49829-1346", 

Figure 404 shows the amino acid sequence (SEQ ID NO:404) derived from the coding sequence of SEQ 
ID NO:403 shown in Figure 403. 

Figure 405 shows a nucleotide sequence (SEQ ID NO:405) of a native sequence PRO1083 cDNA, 
wherein SEQ ID NO:405 is a clone designated herein as "DNA5092 1-1458". 

Figure 406 Shows the amino acid sequence (SEQ ID NO:406) derived from the coding sequence of SEQ 
ID NO:405 shown in Figure 405. 

Figure 407 shows a nucleotide sequence (SEQ ID NO:407) of a native sequence PR6862 cDNA, 
whereui SEQ ID NO:407 is a clone designated herein as ^'DN A52187.1354" . 
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Figure 408 shows the amino acid sequence (SEQ ID NO:408) derived from the coding sequence of SEQ 
ID NO:407 shown in Figure 407. 

Figure 409 shows a nucleotide sequence (SEQ ID NO:409) of a native sequence PR0733 cDNA, 
wherein SEQ ID NO:409 is a clone designated herein as "DNA52i96-1348". 

Figure 410 shows the amino acid sequence (SEQ ID NO:410) derived from the coding sequence of SEQ 
ID NO:409 shown in Figure 409. 

Figure 41 1 shows a nucleotide sequence (SEQ ID N0:411) of a native sequence PROl 188 cDNA, 
wherein SEQ ID N0:41 1 is a clone designated herein as •'DNA52598-1518". 

Figure 412 shows the amino acid seqi^nce (SEQ ID NO:4I2) derived from the coding sequence of SEQ 
ID N0:41 1 shown in Figure 41 1 . 

Figure 413 shows a nucleotide sequence (SEQ ID NO:413) of a native sequence PRO770 cDNA, 
Mdierein SEQ ID NO:413 is a clone designated herein as "DNA54228-1366". 

Figure 414 shows the amino acid sequence (SEQ ID NO:414) derived from the coding sequence of SEQ 
ID NO:413 shown in Figure 413. 

Figure 415 shows a nucleotide sequence (SEQ ID NO:415) of a native sequence PRO1080 cDNA. 
wherein SEQ ID NO:415 is a clone designated herein as "DNA56047-1456". 

Figure 416 shows the amino acid sequence (SEQ ID NO:416) derived from the coding sequence of SEQ 
ID NO:415 shown in Figure 415. 

Figure 417 shows a nucleotide sequence (SEQ ID NO:417) of a native sequence PROI017 cDNA, 
wherein SEQ ID NO:417 is a clone designated herein as "DNA56112-1379". 

Figure 418 shows the amino acid sequence (SEQ ID NO:418) derived from the coding sequenceof SEQ 
ID NO:417 shown in Figure 417. 

Figure 419 shows a nucleotide sequence (SEQ ID NO:419) of a native sequence PRO1016 cDNA. 
wherein SEQ ID NO:419 is a clone designated herein as "DNA561 13-1378". 

Figure 420 shows the amino acid sequence (SEQ ID NO:420) derived from the coding sequence of SEQ 
ID NO:419 shown in Figure 419. 

Figure 421 shows a nucleotide sequence (SEQ ID NO:421) of a native sequence PR0792 cDNA, 
wherein SEQ ID NO:421 is a clone designated herein as *DNA56352-1358". 

Figure 422 shows the amino acid sequence (SEQ ID NO:422) derived from the coding sequence of SEQ 
ID NO:421 shown in Figure 421. 

Figure 423 shows a nucleotide sequence (SEQ ID NO:423) of a native sequence PR0938 cDNA, 
wherein SEQ ID NO:423 is a clone designated herein as "DNA56433-1406". 

Figure 424 shows the amino acid sequence (SEQ ID NO:424) derived from the coding sequence of SEQ 
ID NO:423 shown in Figure 423. 

Figure 425 shows a nucleotide sequence (SEQ ID NO:425) of a native sequence PRO1012 cDNA, 
wherein SEQ ID NO:425 is a clone designated herein as "DNA56439-1376", 

Figure 426 shows the amino acid sequence (SEQ ID NO:426) derived from tiie coding sequence of SEQ 
ID NO:425 shown in Figure 425. 



28 



10 



wo 01/40466 PCT/USOO/32678 

Figure 427 shows a nucleotide sequence (SEQ ID NO:427) of a native sequence PRO1008 cDNA, 
wherein SEQ ID NO:427 is a clone designated herein as "DNA57530-1375". 

Figure 428 shows the amino acid sequence (SEQ ID NO:428) derived from the coding sequence of SEQ 
ID NO:427 shown in Figure 427. 

Figure 429 shows a nucleotide sequence (SEQ ID Np:429) of a native sequence PRO1075 cDNA, 
wherein SEQ ID NO:429 is a clone designated herein as "DNA57689-1385\ 

Figure 430 shows the amino acid sequence (SEQ ID NO:430) derived from the coding sequence of SEQ 
ID NO:429 shown in Figure 429. 

Figure 431 shows a nucleotide sequence (SEQ ID NO:431) of a native sequence PRO 1007 cDNA, 
wherein SEQ ID NO:43I is a clone designated herein as *'DNA57690-1374". 

Figure 432 shows die amino acid sequence (SEQ ID NO;432) derived from the coding sequence of SEQ 
ID NO:43 1 shown in Figure 43 L 

Figure 433 shows a nucleotide sequence (SEQ ID NO:433) of a native sequence PRO1056 cDNA, 
wherein SEQ ID NO:433 is a clone designated herein as *DNA57693-1424", 

Figure 434 shows ttxe amino acid sequence (SEQ ID NO:434) derived from the coding sequence of SEQ 
IS ID NO:433 shown in Figure 433. 

Figure 435 shows a nucleotide sequence (SEQ ID NO:435) of a native sequence PR0791 cDNA, 
wherein SEQ ID NO:435 is a clone designated herein as -DNA57838-1337". 

Figure 436 shows the amino acid sequence (SEQ ID NO:436) derived from tiie coding sequence of SEQ 
ID NO:435 shown in Figure 435. 
20 Figure 437 shows a nucleotide sequence (SEQ ID NO:437) of a native sequence PROU 11 cDNA, 

wherein SEQ ID NO:437 is a clone designated herein as "DNA58721-1475". 

Figure 438 shows the amino acid sequence (SEQ ID NO:438) derived from the coding sequence of SEQ 
ID NO:437 shown in Figure 437. 

Figure 439 shows a nucleotide sequence (SEQ ID NO:439) of a native sequence PR0812 cDNA, 
25 v^ierein SEQ ID NO:439 is a clone designated herein as '•DNA59205-142r. 

Figure 440 shows Oie aniino acid sequence (SEQ ID NO:440) derived from die coding sequence of SEQ 
ID NO:439 shown in Figure 439. 

Figure 441 shows a nucleotide sequence (SEQ ID NO:441) of a native sequence PRO1066 cDNA, 
wherein SEQ ID NO:441 is a clone designated herein as "DNA59215-1425". 
30 Figure 442 shows the amino acid sequence (SEQ ID NO:442) derived from the coding sequence of SEQ 

ID NO:441 shown in Figure 44 1 . 

Figure 443 shows a nucleotide sequence (SEQ ID NO:443) of a native sequence PR01185 cDNA, 
wherein SEQ ID NO;443 is a clone designated herein as **DNA59220-1514". 

Figure 444 shows the amino acid sequence (SEQ ID NO:444) derived from die coding sequence of SEQ 
35 ID NO:443 shown in Figure 443. 

Figure 445 shows a nucleotide sequence (SEQ ID NO:445) of a native sequence PRO1031 cDNA, 
wherein SEQ ID NO:445 is a clone designated herein as "DNA59294-138 1 " . 
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Figure 446 shows the amino acid sequence (SEQ ID NO:446) derived from the coding sequence of SEQ 
ID NO:445 shown in Figure 445. 

Figure 447 shows a nucleotide sequence (SEQ ID NO:447) of a native sequence PRO1360 cDNA, 
wherein SEQ ID NO:447 is a clone designated herein as "DNA59488-1603". 

Figure 448 shows the amino acid sequence (SEQ ID NO:448) derived from the coding sequence of SEQ 
ID NO:447 shown in Figure 447. 

Figure 449 shows a nucleotide sequence (SEQ ID NO:449) of a native sequence PRO1309 cDNA, 
wherein SEQ ID NO:449 is a clone designated herein as "DNA59588-1571\ 

Figure 450 shows the amino acid sequence (SEQ ID NO:450) derived from the coding sequence of SEQ 
ID NO:449 shown in Figure 449. 

Figure 451 shows a nucleotide sequence (SEQ ID NO:451) of a native sequence PRO! 107 cDNA. 
wherein SEQ ID NO:451 is a clone designated herein as •'DNA59606-1471". 

Figure 452 shows the amino acid sequence (SEQ ID NO:452) derived from the coding sequence of SEQ 
ID NO:451 shown in Figure 451. 

Figure 453 shows a nucleotide sequence (SEQ ID NO:453) of a native sequence PR0836 cDNA. 
wherein SEQ ID NO:453 is a clone designated herein as **DNA59620-1463". 

Figure 454 shows the amino acid sequence (SEQ ID NO:454) derived fix)m the coding sequence of SEQ 
ID NO:453 shown in Figure 453. 

Figure 455 shows a nucleotide sequence (SEQ ID NO:455) of a native sequence PR01132 cDNA. 
wherein SEQ ID NO:455 is a clone designated herein as "DNA59767-1489". 

Figure 456 shows the amino acid sequence (SEQ ID NO:456) derived from the coding sequence of SEQ 
ID NO:455 shown in Figure 455. 

Figure 457 shows a nucleotide sequence (SEQ ID NO:457) of a native sequence PR01131 cDNA, 
wherein SEQ ID NO:457 is a clone designated herein as *DNA59777-1480". 

Figure 458 shows the amino acid sequence (SEQ ID NO:458) derived from the coding sequence of SEQ 
ID NO:457 shown in Figure 457. 

Figure 459 shows a nucleotide sequence (SEQ ID NO:459) of a native sequence PRO1130 cDNA. 
wherein SEQ ID NO:459 is a clone designated herein as "DNA598I4.1486". 

Figure 460 shows the amino acid sequence (SEQ ID NO:460) derived from the coding sequence of SEQ 
ID NO:459 shown in Figure 459. 

Figure 461 shows a nucleotide sequence (SEQ ID NO:461) of a native sequence PR0844 cDNA, 
herein SEQ ID NO:461 is a clone designated herein as •DNA59839-146r. 

Figure 462 shows the ammo acid sequence (SEQ ID NO:462) derived from the coding sequence of SEQ 
ID NO:461 shown in Figure 461. 

Figure 463 shows a nucleotide sequence (SEQ ID NO:463) of a native sequence PROl 154 cDNA, 
wherein SEQ ID NO:463 is a clone designated herem as •'DNA59846-1503". 

Figure 464 shows the amino acid sequence (SEQ ID NO:464) derived from the coding sequence of SEQ 
ID NO:463 shown in Figure 463. 
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Figure 465 shows a nucleotide sequence (SEQ ID NO:465) of a native sequence PRO 1181 cDNA. 
wherein SEQ ID NO:465 is a clone designated herein as "DNA59847-151 1 



ID NO:465 shown in Figure 465. 

Figure 467 shows a nucleotide sequence (SEQ ID NO:467) of a native sequence PR01126 cDNA, 
wherein SEQ ID NO:467 is a clone designated hereui as "DNA60615-1483". 

Figure 468 shows the amino acid sequence (SEQ ID NO:468) derived from the coding sequence of SEQ 
ID NO:467 shown in Figure 467. 

Figure 469 shows a nucleotide sequence (SEQ ID NO:469) of a native sequence PROl 186 cDNA, 
wherein SEQ ID NO:469 is a clone designated herein as "DNA60621-1516". 

Figure 470 shows the amino acid sequence (SEQ ID NO:470) derived from the coding sequence of SEQ 
ID NO:469 shown in Figure 469. 

Figure 471 shows a nucleotide sequence (SEQ ID NO:471) of a native sequence PR01198 cDNA, 
wherein SEQ ID NO:471 is a clone designated herein as "DNA60622-1525". 

Figure 472 shows the amino acid sequence (SEQ ID NO:472) derived from the coding sequence of SEQ 
ID NO:471 shown in Figure 471. 

Figure 473 shows a nucleotide sequence (SEQ ID NO:473) of a native sequence PROl 159 cDNA. 
wherein SEQ ID NO:473 Is a clone designated herein as "DNA60627-1508". 

Figure 474 shows the ammo acid sequence (SEQ ID NO:474) derived from the coding sequence of SEQ 
ID NO:473 shown in Figure 473. 

Figure 475 shows a nucleotide sequence (SEQ ID NO:475) of a native sequence PR01265 cDNA, 
wherein SEQ ID NO:475 is a clone designated herein as "DNA60764-1533". 

Figure 476 shows the amino acid sequence (SEQ ID NO:476) derived from the coding sequence of SEQ 
ID NO:475 shown in Figure 475. 

Figure 477 shows a nucleotide sequence (SEQ ID NO:477) of a native sequence PRO 1250 cDNA, 
wherein SEQ ID Np:477 is a clone designated herein as "DNA60775-1532". 

Figure 478 shows the amino acid sequence (SEQ ID NO:478) derived from the coding sequence of SEQ 
ID NO:477 shown in Figure 477. 

Figure 479 shows a nucleotide sequrace (SEQ ID NO:479) of a native sequence PR01475 cDNA, 
wherein SEQ ID N0:479 is a clone designated herein as '•DNA61 185-1646". 

Figure 480 shows the amino acid sequence (SEQ ID NO:480) derived from the coding sequence of SEQ 
ID NO:479 shown in Figure 479. 

Figure 481 shows a nucleotide sequence (SEQ ID NO:481) of a native sequence PR013 12 cDNA, 
wherein SEQ ID N0:481 is a clone designated herein as "DNA6 1873- 1574". 

Figure 482 shows die amino acid sequence (SEQ ID NO:482) deri ved from the coding sequence of SEQ 
ID NO:481 shown in Figure 481. 

Figure 483 shows a nucleotide sequence (SEQ ID NO:483) of a native sequence PRO1308 cDNA, 
wherein SEQ ID NO:483 is a clone designated herein as "DNA62306-1 570" . 



Figure 466 shows die amino acid sequence (SEQ ID NO:466) derived from the coding sequence of SEQ 
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Figure 484 shows the amino icid sequence (SEQ ID NO:484) derived from the coding sequence of SEQ 
ID NO:483 shown in Figure 483. 

Figure 485 shows a nucleotide sequence (SEQ ID NO:485) of a native sequence PR01326 cDNA. 
wherein SEQ ID NO:485 is a clone designated herein as -DNA62808-1582-. 

Figure 486 shows the amino acid sequence (SEQ ID NO:486) derived from the coding s^^ 
ID NO:485 shown in Figure 485. 

Figure 487 shows a nucleotide sequence (SEQ ID NO:487) of a native sequence PROl 192 cDNA, 
wherein SEQ ID NO:487 is a clone designated herein as "DNA62814-1521". 

Figure 488 shows the amino acid sequence (SEQ ID NO:488) derived from the coding sequence of SEQ 
ID NO:487 shown in Kgure 487. 

Figure 489 shows a nucleotide sequence (SEQ ID NO:489) of a native sequence PR01246 cDNA. 
wherein SEQ ID NO:489 is a clone designated herein as "DNA64885-1529". 

Figure490 shows the amino acid sequence(SEQ ID NO:490)derived from the coding 
ID NO:489 shown in Figure 489. 

Figure 491 shows a nucleotide sequence (SEQ ID NO:49I) of a native sequence PR01356 cDNA, 
wherein SEQ ID NO:491 is a clone designated herem as "DNA64886-i60r . 

Figure 492 shows the amino acid sequence (SEQ ID NO:492) derived from the coding sequence of SEQ 
ID NO:49I shown in Figure 491 . 

Figure 493 shows a nucleotide sequence (SEQ ID NO:493) of a native sequence PR01275 cDNA, 
wherein SEQ ID n6:493 is a clone designated herein as "DNA64888-1542". 

Figure494 shows the amino acid sequence(SEQ ID NO:494)derived from thecodingse^^ 
ID NO:493 shown in Figure 493. 

Figure 495 shows a nucleotide sequence (SEQ ID NO:495) of a native sequence PR01274 cDNA. 
wherein SEQ ID NO:495 is a clone designated herein as "DNA64889-154r . 

Figure 496 shows the amino acid sequence (SEQ ID NO:496) derived from the coding sequence of SEQ 
ID NO:495 shown in Figure 495. 

Figure 497 shows a nucleotide sequence (SEQ ID NO:497) of a native sequence PR01358 cDNA. 
wherein SEQ ID NO:497 is a clone designated herein as "DNA64890-1612". 

Figure 498 shows the amino acid sequence (SEQ ID NO:498) derived from the codmg sequence of SEQ 
ID NO:497 shown in Figure 497. 

Figure 499 shows a nucleotide sequence (SEQ ID NO:499) of a native sequence PR01286 cDNA, 
wherein SEQ ID NO:499 is a clone designated herein as "DNA64903-1553". 

Figure 500 shows the amino acid sequence (SEQ ID NO:500) derived from the coding sequence of SEQ 
ID NO:499 shown m Figure 499. 

Figure 501 shows a nucleotide sequence (SEQ ID NO:501) of a native sequence PR01294 cDNA. 
wherein SEQ ID NO:501 is a clone designated herein as "DNA649Q5-1558". 

Figure 502 shows the amino acid sequence (SEQ ID NO:502) derived fix>m the coding sequence of SEQ 
ID NO:501 shown in Figure 501. 
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Figure 503 shows a nucleotide sequence (SEQ ID NO:503) of a native sequence PR01273 cDNA, 
v^erein SEQ ID NO:503 is a clone designated herein as ''DNA65402-1540". 

Figure 504 shows tfie amino acid sequence (SEQ ID NO:504) derived from the coding sequence of SEQ 
ID NO:503 shown in Figure 503. 

Figure 505 shows a nucleotide sequence (SEQ ID NO:505) of a native sequence PRO 1279 cDNA, 
wherein SEQ ID NO:505 is a clone designated herein as "DNA65405-1547". 

Figure 506 shows the amino acid sequence (SEQ ID NO:506) derived from the coding sequence of SEQ 
ID NO:505 shown in Figure 505. 

Figure 507 shows a nucleotide sequence (SEQ ID NO:507) of a native sequence PR01195 cDNA, 
wherein SEQ ID NO:507 is a clone designated herein as •'DNA65412-1523". 

Figure 508 shows the amino acid sequence (SEQ ID NO:508) derived fr-om the coding sequence of SEQ 
ID NO:507 shown in Figure 507. 

Figure 509 shows a nucleotide sequence (SEQ ID NO:509) of a native sequence PR01271 cDNA, 
wherein SEQ ID NO:509 is a clone designated herein as *'DNA66309-1538" . 

Figure 510 shows the amino acid sequence (SEQ ID N0:5 10) derived from the coding sequence of SEQ 
ID NO:509 shown in Figure 509. 

Figure 511 shows a nucleotide sequence (SEQ ID N0:511) of a native sequence PR01338 cDNA, 
wherein SEQ ID NO:51l is a clone designated herein as •'DNA66667-1596". 

Figure 512 shows the amino acid sequence (SEQ ID N0:5 12) derived fix>m the coding sequence of SEQ 
ID NO;5 1 1 shown in Figure 511. 

Figure 513 shows a nucleotide sequence (SEQ ID NO:513) of a native sequence PR01343 cDNA, 
wherein SEQ ID NO:513 is a clone designated herein as *'DNA66675-1587\ 

Figure 5 14 shows the amino acid sequence (SEQ ID NO :5 14) derived from the coding sequence of SEQ 
ID NO:513 shown in Figure 513. 

Figure 515 shows a nucleotide sequence (SEQ ID NO:515) of a native sequence PR01434 cDNA, 
wherein SEQ ID NO:515 is a clone designated herein as "DNA68818 2536". 

Figure 516 shows the amino acid sequence (SEQ ID NO:516) derived from the coding sequence of SEQ 
ID NO:515 shown in Figure 515. 

Figure 517 shows a nucleotide sequence (SEQ ID NO:517) of a native sequence PR01418 cDNA. 
wherein SEQ ID N0:517 is a clone designated herein as "DNA68864-1629". 

Figure 518 shows the amino acid sequence (SEQ ID NO:518) derived from the coding sequence of SEQ - 
ID NO:517 shown in Figure 517. 

Figure 519 shows a nucleotide sequence (SEQ ID NO:519) of a native sequence PR01387 cDNA, 
M^erein SEQ ID NO:519 is a clone designated herein as "DNA68872-1620". 

Figure 520 shows the amino acid sequence (SEQ ID NO:520) derived from the coding sequence of SEQ 
ID NO:519 shown in Figure 519. 

Figure 521 shows a nucleotide sequence (SEQ ID NO:521) of a native sequence PR01384 cDNA, 
wherein SEQ ID NO:521 is a clone designated herein as "DNA71 159-1617". 
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Figure 522 shows the amino acid sequence (SEQ ID NO:522) derived from the coding sequence of SEQ 
ID NO:521 shown in Figure 521. 

Figure 523 shows a nucleotide sequence (SEQ ID NO:523) of a native sequence PR01565 cDNA, 
wherein SEQ ID NO:523 is a done designated herein as •DNA73727-1673". 

Figure 524 shows the amino acid sequence (SEQ ID NO:524) derived from the coding sequence of SEQ 
ID NO:523 shown in Figure 523. 

Figure 525 shows a nucleotide sequence (SEQ ID NO:525) of a native sequence PR01474 cDNA. 
wherein SEQ ID NO:525 is a clone designated herein as "DNA73739-1645". 

Figure 526 shows the amino acid sequence (SEQ ID NO:526) derived from the coding sequence of SEQ 
ID NO:525 shown in Figure 525. 

Figure 527 shows a nucleotide sequence (SEQ ID NO:527) of a native sequence PR01917 cDNA, 
wherein SEQ ID NO:527 is a clone designated herein as "DNA76400-2528". 

Figure 528 shows the amino acid sequence (SEQ ID NO:528) derived from the coding sequence of SEQ 
ID NO:527 shown in Figure 527. 

Figure 529 shows a nucleotide sequence (SEQ ID NO:529) of a native sequence PR01787 cDNA. 
wherein SEQ ID NO:529 is a clone designated herein as •DNA765 10-2504". 

Figure 530 shows the amino acid sequence (SEQ ID NO:530) derived from the coding sequence of SEQ 
ID NO:529 shown in Figure 529. 

Figure 531 shows a nucleotide sequence (SEQ ID NO:531) of a native sequence PROi556 cDNA. 
wherein SEQ ID NO:531 is a clone designated herein as "DNA76529-1666". 

Figure 532 shows the amino acid sequence (SEQ ID NO:532) derived from the coding sequence of SEQ 
ID NO:531 shown in Figure 531. 

Figure 533 shows a nucleotide sequence (SEQ ID NO:533) of a native sequence PR01561 cDNA. 
wherein SEQ ID NO;533 is a done designated herein as "DNA76538-1670". 

Figure 534 shows die amino acid sequence (SEQ ID NO:534) derived from the coding sequence of SEQ 
ID NO:533 shown in Figure 533. 

Figure 535 shows a nucleotide sequence (SEQ ID NO:535) of a native sequence PR01693 cDNA, 
wherein SEQ ID NO:535 is a clone designated herein as "DNA77301-1708". 

Figure 536 shows die amino acid sequence (SEQ ID NO:536) derived from the coding sequence of SEQ 
ID NO:535 shown in Figure 535. 

Figure 537 shows a nucleotide sequence (SEQ ID NO:537) of a native sequence PR01868 cDNA, 
wherein SEQ ID NO:537 is a clone designated herein as •'DNA77624-2515". 

Figure 538 shows the amino acid sequence (SEQ ID NO:538) derived from the coding sequence of SEQ 
ID NO:537 shown in Figure 537. 

Figure 539 shows a nucleotide sequence (SEQ ID NO:539) of a native sequence PRO1890 cDNA, 
wherein SEQ ID NO:539 is a clone designated herein as •DNA79230-2525". 

Figure 540 shows the amino acid sequence (SEQ ID NO:540) derived fixjm the coding sequence of SEQ 
ID NO:539 shown in Figure 539. 
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Figure 541 shows a nucleotide sequence (SEQ ID NO:541) of a native sequence PR01887 cDNA, 
wherein SEQ ID NO:541 is a clone designated herein as "DNA79862-2522". 

Figure 542 shows the aniino acid sequence (SEQ ID NO:542) derived from the coding sequence of SEQ 
ID NO:54 1 shown in Figure 541 . 

Figure 543 shows a nucleotide sequence (SEQ ID NO:543) of a native sequence PR(M353 cDNA. 
wherein SEQ ID NO:543 is a clone designated herein as **DNA80145-2594''. 

Figure 544 shows the amino acid sequence (SEQ ID NO:544) derived from die coding sequence of SEQ 
ID NO:543 shown in Figure 543. 

Figure 545 shows a nucleotide sequence (SEQ ID NO:545) of a native sequence PRO180I cDNA, 
wherein SEQ ID NO:545 is a clone designated herein as "DNA83500-2506". 

Figure 546 shows the amino acid sequence (SEQ ID NO:546) derived from the coding sequence of SEQ 
ID NO:545 shown in Figure 545. 

Figure 547 shows a nucleotide sequence (SEQ ID NO:547) of a native sequence PR04357 cDNA, 
wherein SEQ ID NO:547 is a clone designated herein as "DNA849 17-2597". 

Figure 548 shows the amino acid sequence (SEQ ID NO:548) derived from the coding sequence of SEQ 
ID NQ:547 shown in Figure 547. 

Figure 549 shows a nucleotide sequence (SEQ ID NO:549) of a native sequence PRC)4302 cDNA, 
wherein SEQ ID NO:549 is a clone designated herem as "DNA922 18-2554". 

Figure 550 shows the amino acid sequence (SEQ ID NO:550) derived from the coding sequence of SEQ 
ID NO:549 shown in Figure 549. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I- Definitions 

The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a 
numerical designation refer to various polypeptides, wherein the complete designation (i.e. , PRO/number) refers 
to specific polypeptide sequences as described herein. The terms -PRO/number polypeptide** and 
"PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein 
encompass native sequence polypeptides»and polypeptide variants (which are fi^ The PRO 

polypeptides described herein may be isolated from a variety of sources, sudi as from human tissue types or 
from another source, or prepared by recombinant or synthetic metfiods. The term "PRO polypeptide" refers to 
each individual PRO/number polypeptide disclosed herein. All disclosures in this specification which refer to 
the "PRO polypeptide" refer to each of the polypeptides individually as well as jointly. For example, 
descriptions of the preparation of, purification of, derivation of, formation of antibodies to or against, 
administration of, compositions containing, treatment of a disease with, etc.. pertain to each polypeptide of the 
invention individually. The term *PRO polypeptide" also includes variants of the PRO/number polypeptides 
disclosed herein. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid sequence 
as the corresponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be 
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isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO 
polypeptide- specifically encompasses naturally-occurring truncated or secreted forms of the specific PRO 
polypeptide (e.g.. an extracellular domain sequence), namrally-occurring variant forms (e.g.. alternatively 
spliced forms) and namrally-occurrlng aUelic variants of the polypeptide. In various embodiments of the 
invention, the native sequence PRO polypeptides disclosed herein are mamre or fiill-length native sequence 
polypeptides comprising die full-length amino adds sequences shown in the accompanying figures. Start and 
stop codons are shown in bold font and underlined in the figures. However. whUe the PRO polypeptide 
disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino 
addposition 1 indie figures, it is conceivable and possible that other methionine residues located dther upstream 
or downstream fiom the amino acid position 1 in die figures may be employed as the starting amino acid residue 
for the PRO polypqitides. 

TTw PRO polypeptide "extraceUulardomain-or "ECD" refers to a form of die PRO polypeptide which 
is essentiaUyfreeofthetransmembraneand cytoplasmic domains. Ordinarily.aPROpolypeptideECDwm 
less flum 1 % of sudi transmembrane and/or cytoplasmic domains and preferably. wiU have less dian 0.5% of 
such domains. It will be understood that any transmembrane domains identified for the PRO polypeptides of 
the presem invention are identified pursuant to criteria ixMitinely employed in the art for identifying diat type of 
hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 
than about 5 amino acids at either end of the domain as initially identified herein. Optionally, tfierefore. an 
extraceUular domain of a PRO polypeptide may contain from about 5 or fewer amino acids on eidier side of die 
transmembrane domain/eMraceUular domain boundary as identified in die Examples or specification and such 
polypeptides, widi or widiout the associated signal peptide, and nucleic acid encoding diem, are comtemplated 
by the present invention. 

The approximate location of die "signal peptides" of die various PRO polypeptides disclosed herein ate 
shown in the present spedfication and/or die accompanying figures. It is noted, however, duit tiie C-terminal 
boundary of a signal peptide may vary, but most likely by no more dum about 5 amino acids on eidier side of 
the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of die 
signal peptide may be identified pursuant to criteria routinely employed in die art for identifying diat type of 
amino acid sequence element (e.g.. Nielsenetal..ProLEng. 10: 1-6 (1997) and von Heinle etal.. Nud. Acids. 
Ess. 14:4683-4690 (1986)). Moreover, it is also recognized diat. in some cases, cleavage of a signal sequence 
from a secreted polypeptide is not entirely uniform, resulting in more dian one secreted species. These mature 
polypeptides, where die signal peptide is cleaved widiin no more dum about 5 amino acids on ddier side of die 
C-terminal boundary of die sigmil peptide as identified herein, and die polynucleotides encoding tiiem, are 
contemplated by die present invention. 

"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at least 
about 80% amino add sequence identify widi a fiiU-lengdi native sequence PRO polypeptide sequence as 
disclosed herein, a PRO polypeptide sequence lacking die signal peptide as disclosed hei«in. an extracellular 
domain of a PRO polypeptide, widi or widiout die signal peptide, as disdosed herein or any odier fragment of 
a full-lengtfi PRO polypeptide sequence as disclosed herein. Sudi PRO polypeptide variants indude. for 
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instance, PRO polypeptides wherein one or more amino acid residues are added, or deleted, at the N- or C- 
cerminus of the full-length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least 
about 80% amino acid sequence identity, alternatively at least about 81% amino acid sequence identity, 
alternatively at least about 82% amino acid sequence identity, alternatively at least about 83% amino acid 
sequence identity, alternatively at least about 84% amino acid sequence identity, alternatively at least about 85 % 
5 amino acid sequence identity, alternatively at least about 86% amino acid sequence identity, alternatively at least 
about 87% amino acid sequence identity, alternatively at least about 88% amino acid sequence identity, 
alternatively at least about 89% amino acid sequence identity, alternatively at least about 90% amino acid 
sequence identity, alternatively at least about 91 % amino acid sequence identity, alternatively at least about 92% 
amino acid sequence identity, alternatively at least about 93% amino acid sequence identity, alternatively at least 

10 about 94% amino acid sequence identity, alternatively at least about 95% amino acid sequence identity, 
ahematively at least about 96% amino acid sequence identity, alternatively at least about 97% amino acid 
sequence identity, alternatively at least about 98% amino acid sequence identity and alternatively at least about 
99% amino acid sequence identity to a Ml-lengih native sequence PRO polypeptide sequence as disclosed herein, 
a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an exuacellular domainof a PRO 

IS polypeptide, with or without the signal peptide, as disclosed herein or any other specifically defmed fragment 
of a fulMength PRO polypeptide sequence as disclosed herein. Ordinarily, PRO variant polypeptides are at least 
about 10 amino acids in length, alternatively at least about 20 amino acids in length, alternatively at least about 
30 anuno acids in length, alternatively at least about 40 amino acids in length, alternatively at least about 50 
amino acids in length, alternatively at least about 60 amino acids in length, alternatively at least about 70 amino 

20 acids in length, alternatively at least about 80 amino acids in length, alternatively at least about 90 amino acids 
in length, alternatively at least about 100 amino acids in length, alternatively at least about 150 amino acids in 
length, alternatively at least about 200 amino acids in lengUi, alternatively at least about 300 amino acids in 
length, or more. 

"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 
25 herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific PRO polypeptide sequence, after aligning die sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identiQr, and not considering any conservative 
substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly 
30 available computer software such as BLAST, BLAST.2, ALIGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed 
to achieve maxima] alignment over the full length of the sequences being compared. For purposes herein, 
however, % amino acid sequence identity values are generated using the sequence comparison computer program 
ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Table 1 below. The 
35 ALIGN-2 sequence comparison computer program was authored by Genentech, Inc. and the source code shown 
in Table 1 below has been filed with user documentation in the U.S. Copyright Office. Washington D.C. , 20559, 
where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is publicly 
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available Arough Genentech. Inc.. South San Francisco. California or may be conq>i]ed ftom the source code 
provided in Table 1 below. The ALIGN-2 program should be compiled for use on a UNIX operating system. . 
preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and 
do not vaiy. 

In situations w*ere ALIGN-2 is employed for amino acid sequence conq)arisons. die % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino add sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a cenain % amino acid 
sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 tiroes die fraction X/Y 

M*ere X is die number of amino acid residues scored as ideaitical matches by die sequence alignment program 
ALIGN-2 in diat program's alignmoit of A and B. and where Y is die total numbn- of anuno acid residues in 
B. It wai be appreciated tfiat where die lengdi of amino acid sequence A is not equal to die lengdi of amino acid 
sequence B. die % amino acid sequence idendty of A to B wBl not equal die % amino acid sequence idoitity of 
B to A. As examples of % amino acid sequence identity calculations using diis mediod. Tables 2 and 3 
demonstrate how to calculate die % amino acid sequence identity of the amino acid sequence designated 
-Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO" represents die amino acid 
sequence of a Iqrpodietical PRO polypeptide of interest. "Comparison Protein" represents die amino acid 
sequence of a polypqitide against which die "PRO" polypeptide of interest is being compared, and "X. " Y" and 
"Z" each represent different hypodietical amino acid residues. 

Unless specifically stated otiierwise. all % amino acid sequence identity values used herein are obtained 
asdescribedindieinimediatelypi€cedingparagrapbusingajeALIGN-2computerprogiam. However. % amino 
acid sequence identity values may also be obtained as described below by using die WU-BLAST-2 conq)uter 
program (Altschul et al.. Methods in RnTymnlnpy 266:460-480 (1996)). Most of die WU-BLAST-2 search 
parameters are set to die default values. Those not set to default values, i.e., die adjustable parameters, are set 
wittj die following values: overlap spaa = 1. overi^ fraction = 0. 125. word direshold (T) = 11. and scoring 
matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid sequence identity value is 
determined by dividing (a) die number of matching identical amino acid residues between die amino acid 
sequence of die PRO polypeptide of intwest having a sequence derived from die native PRO polypeptide and 
die con?>arison amino acid sequence of interest Ci.e., the sequence against wfaidi die PRO polypeptide of interest 
is b<ang compared «4iich may be a PRO variant polypeptide) as determined by WU-BLAST-2 by (b) die total 
mimberof amino acid residuesofdiePROpoIypqitideof interest. Fbr example, in die statement "apolypeptide 
comprising an die amino acid sequence A which has or having at least 80% amino acid sequence identity to die 
amino acid sequence B", die amino acid sequence A is die comparison amino acid sequence of interest and die 
amino acid sequence B is die amino acid sequence of die PRO polypqitide of interest. 

Percent amino acid sequence identity may also be determined using die sequence comparison program 
NCBI-BLAST2 (Altschul et al.. Nucleic Acids Res 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
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comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained from the 
National Institute of Health, Bethesda. MD, NCBI-BLAST2 uses several search parameters, wherein all of those 
search parameters are set to default values including, for example, unmask = yes, strand = all, expected 
occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01. constant for multi-pass 
= 25, dropoff for fuial gapped alignment = 25 and scoring matrix = BLX)SUM62. 

In situations where NCBI-BLAST2 is enqiloyed for amino acid sequence comparisons, the % amino 
acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B 
(which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino 
acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction XA" 

where X is the number of amino acid residues scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues 
in B. It will be appreciated diat where the length of amino acid sequence A is not equal to the length of amino 
acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity 
ofBtoA. 

"PRO variant polynucleotide" or *'PRO variant niicleic acid sequence** means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nuclfcotide acid sequence encoding a full-length native sequence PRO polypeptide 
sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide, with or widiout the signal peptide, as 
disclosed herein or any other fragment of a fulWength PRO polypeptide sequence as disclosed herein. 
Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid sequence identity, 
alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid 
sequence identity, alternatively at least about 83 % nucleic acid sequence identity, alternatively at least about 84% 
nucleic acid sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at 
least about 86% nucleic acid sequence identity, alternatively at least about 87% nucleic acid sequence identity, 
alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid 
sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 91 % 
nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, alternatively at 
least about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid sequence identity, 
alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid 
sequence identity, alternatively at least about 97 % nucleic acid sequence identity, alternatively at least about 98% 
nucleic acid sequence identity and alternatively at least about 99% nucleic acid sequence identity with a nucleic 
acid sequence encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a full-length 
native sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain 
of a PRO polypeptide, with or without the signal sequence, as disclosed herein or any other fragment of a fuU- 
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length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide 
sequence. 

Ordinarily. PRO variant polynucleotides are at least about 30nucleotides in length, alternatively at least 
about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 
120 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 180 
nucleotides in length, alternatively at least about 210 nucleotides in length, alternatively at least about 240 
nucleotides in length, alternatively at least about 270 nucleotides in length, alternatively at least about 300 
nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 600 
nucleotides in length, alternatively at least about 900 nucleotides in lengA, or more. 

"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent sequence identity. AUgnment for purposes of determimng percent 
nucleic acid sequence identity can be achieved in various ways that are within the skiU in the art. for instance, 
using publicly available computer software such as BLAST. BLASr-2. ALIGN or MegaUgn (DNASTAR) 
software. For purposes herein, however, % nucleic acid sequence identity values are generated using the 
sequence comparison computer program ALIGN-2. wherein the complete source code for the AUGN-2 program 
is provided in Table 1 below. The ALlGN-2 sequence comparison computer program was authored by 
Genentech, Inc. and the source code shown in Table 1 below has been filed with user documentation in the U.S. 
Copyright Office. Washington D.C., 20559. where it is registered under U.S. Copyright Registration No. 
TXU510087. The ALIGN-2 program is publicly available tfirough Genentech. Inc.. South San Francisco. 
CalifomiaormaybecompiledfromihesourcecodeprovidedinTablel below. The ALIGN-2 program should 
be compUed for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison 
parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to. with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 
sequence identity to, widi. or against a given nucleic acid sequence D) is calculated as foUows: 

100 times die fraction W/Z 

when W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 
in that program's aligmnem of C and D. and where Z is the total number of nucleotides in D. It wiU be 
appreciated that where the length of nucleic acid sequence C is not equal to the length of micleic acid sequence 
D, the % nucleic acid sequence identity of C to D wW not equal the % nucleic acid sequence identity of D to 
C. As examples of nucleic acid sequence identity calculations. Tables 4 and 5. demonstrate how to calculate 
tiie % nucleic acid sequence idemity of die nucleic acid sequence designated 'Comparison DNA" to the micleic 
acid sequence designated -PRO-DNA" , wherein "PRO-DNA" represents a hypothetical PRO-encoding micleic 
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acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule 
against which the "PRO-DNA" nucleic acid molecule of interest is being compared, and "N". "L" and "V" each 
represent different hypothetical nucleotides. 

Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are obtained 
as described in the inunediately preceding paragraph using the ALIGN-2 computer program. However, % 
nucleic acid sequence identity values may also be obtained as described below by using the WU-BLAST-2 
computer program (Allschul et al.. Methods in Enzvmology 266:460-480 (1996)). Most of the WU-BLAST-2 
search parameters are set to die default values. Those not set to default values, i.e.. the adjustable parameters, 
are set widi die following values: overlap span = 1. overlap fraction = 0.125. word threshold (T) = 11. and 
scoring matrix ^ BLOSUM62, When WU-BLAST-2 is employed, a % nucleic acid sequence identity value 
is determined by dividing (a) the number of matching identical nucleotides between the nucleic acid sequence 
of the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native 
sequence PRO polypeptide-encoding nucleic acid and the comparison nucleic acid molecule of interest (i.e. . the 
sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which 
may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides 
of the PRO polypeptide-encoding nucleic acid molecule of interest. For example, in the statement "an isolated 
nucleic acid molecule conq)rising a nucleic acid sequence A which has or having at least 80% nucleic acid 
sequence identity to the nucleic acid sequence B", die nucleic acid sequence A is the comparison nucleic acid 
molecule of interest and the nucleic acid sequence B is die nucleic acid sequence of die PRO polypeptide- 
encoding nucleic acid molecule of interest. 

Percent nucleic acid sequence identity may also be determined using die sequence comparison program 
NCBI-BLAST2 (Altschul et ah. Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence 
comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained from the 
National Institute of Healdi, Bethesdaj MD. NCBI-BLAST2 uses several search parameters, wherein all of diose 
search parameters are set to default values including, for example, unmask = yes, strand = all, expected 
occurrences = 10. minimum low complexity lengdi = 15/5. multi-passe-value = 0.01, constant for multi-pass 
= 25. dropoff for final gapped alignment = 25 and scoring mattix = BLOSUM62. 

In situations v/hext NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid sequence 
identity of a given nucleic acid sequence C to, witii, or against a given nucleic acid sequence D (vMOi can 
alternatively be phrased as a given nucleic acid sequence C that has or conq)rises a certain % nucleic acid 
sequence identity to, widi, or against a given nucleic acid sequence D) is calculated as follows: 

100 times die fraction W/Z 

where W is tfie number of nucleotides scored as identical matches by die sequence alignment program NCBI- 
BLAST2 in diat program's alignment of C and D, and vdiere Z is die total number of nucleotides in D. It will 
be appreciated diat where the lengdi of nucleic acid sequence C is not equal to die lengdi of nucleic acid sequence 
D, die % nucleic acid sequence identity of C to D wUl not equal die % nucleic acid sequence idwitity of D to 
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In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active 
PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant 
polypeptides may be those that are encoded by a PRO variant polynucleotide. 

"Isolated, " when used to describe the various polypeptides disclosed herein, means polypeptide that has 
been identified and separated and/or recovered from a component of its natural environment. Contaminant 
components of its naniral environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 
15 residues of N-tenninal or internal amino acid sequence by use of a spinning cup sequenator, or (2) to 
homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, 
sUver stain. Isolated polypeptide includes polypeptide in situ wiOiin recombinant cells, since at least one 
component of the PRO polypeptide natural environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is a 
nucleic acid molecule diat is identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An isolated 
polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found in nature. 
Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the specific polypeptide- 
encoding nucleic acid molecule as it exists in naniral cells. However, an isolated polypeptide-encoding nucleic 
acid molecule includes polypeptide-encoding nucleic acid molecules contained in cells that ordinarily express die 
polypeptide ^ere, for example, the nucleic acid molecule is in a chromosomal location different fcom that of 
natural cells. 

The term "control sequences" refers to DNA sequences necessary for die e>q)ression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, 
for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells 
are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with anodier nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in die secretion of die polypq)tide; a promoter or 
enhancer is operably linted to a coding sequence if it affects the transcription of die sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facUitate translation. Generally, 
"operably linked" means that die DNA sequwices being linked are contiguous, and, in die case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
acconq)lished by ligation at convenient restriction sites. If sudi sites do not exist, the syntfietic oligonucleotide 
adaptors or linkers are used in accordance with conventional practice. 
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The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-PRO 
monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anii-PRO antibody 
compositions with polyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a 
population of substantially homogeneous antibodies, i.e. . the individual antibodies comprising the population are 
identical except for possible naturaUy^xxairring mutations that may be present in minor amounts. 

"Stringency" of hybridization reactions is readily determinable by one of oidinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperamre, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, die higher the relative temperature which 
can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions 
more stringent, while lower ten^ratures less so. For additional details and explanation of stringency of 
hybridization reactions, see Ausubel et al.. Current Protocols in Molecular Biolojgv . Wiley Interscience 
Publishers, (1995). 

"Stringent conditions" or "high stringem^ conditions", as defmed herein, may be identified by those 
that: (1) OTploy low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50**C; (2) employ during hybridization a 
denaturing agent, such as formamide. for example. 50% (v/v) fomiamide with 0.1% bovine serum 
albumin/0, 1 % Ficoll/0. 1 % polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride. 75 mM sodium citrate at 42"C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M 
sodium citrate). 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 ^g/ml), 0. 1% SDS, and 10% dextran sulfate at 42°C. with washes at 42X 
in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55 °C, followed by a high-stringency wash 
consisting of 0. 1 x SSC containing EDTA at 55 'C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al.. Molecular 
Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g.. temperature, ionic strength and %SDS) less stringent diat those 
described above: An example of moderately stringent conditions is overnight incubation at 37'C ma solution 
coD[q)rising: 20% formamide. 5 x SSC (150 mM NaCl. 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6). 5 X Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37.50^C. The skilled artisan will recognize how to adjust 
the temperamre, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypqjtide fused to a "tag polypeptide" . The tag polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide 
to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not 
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substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid 
residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino 
acid residues). 

As used herein, the term "immunoadhesin" designates antibody-lite molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity whidi is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunogiobulm constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

•Active" or "activity" for die purposes herein refers to form(s) of a PRO polypeptide which retain a 
biological and/or an immunological activity of native or nanirally-occurring PRO. wherein "biological" activity 
refers to a biological function (either inhibitoiy or stimulatory) caused by a native or naturally-occurring PRO 
odier than the ability to induce the production of an antibody against an antigemc epitope possessed by a native 
or naturally-occurring PRO and an "immunoiogicar activity refers to the ability to induce the production of an 
antibody against an antigemc epitope possessed by a native or naturally-occurring PRO. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 
blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar 
manner, die term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native PRO polypq>tide disclosed herein. Suitable agonist or antagonist molecules specificaUy 
include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of 
native PRO polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Mediods for 
identifying agonists or antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a 
candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities 
normally associated with the PRO polypeptide, 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein 
the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of 
treatment include those already witfi the disorder as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. 

"Chronic" administration refers to administration of die agent(s) in a continuous mode as opposed to 
an acute mode, so as to maintain die initial therapeutic effect (acriviQr) for an extended period of time. 
"Intermittent- administration is treatment that is not consecutively done without interruption, but radier is cyclic 
in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, 
domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, catde, horses, sheep, pigs, goats, 
rabbits, etc. Preferably, die mammal is human. 



44 



"^oovmee pct/usoo/32678 

Administration "in combination with" one or more further therapeutic agents includes simuitaneous 
(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 
are nontoxic, to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 
the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and odier organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine. asparagine. arginine or lysine; monosaccharides, disaccharides. and other carbohydrates including 
glucose, mannose. or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
fonming counterions such as sodium; and/or nonionic surfactants such as TWEEN™. polyethylene glycol (PEG), 
and PLURONICS™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab'. F(ab')2. and Fv 
fragments; diabodies; linear antibodies (Zapata et al.. Protein Eng. 8(10): 1057-1062 [1995]); single-chain 
antibody molecules; and multispecific antibodies fonned from antibody fragments. 

Papain digestion of antibodies produces two identical antigen binding fragments, called "Fab", 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflectuig the 
ability to crystalUze readUy. Pepsin treatment yields an F(ab')j fragment that has two antigenncombining sites 
and is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding 
site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen- 
binding site on die surface of the Vh-Vj. dimer. Collectively, the six CDRs confer antigen-binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific 
for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
site. 

The Fab fragment also contains the constant domain of the light diain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab* fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol groi^. F(ab*)2 antibody fragments originally were produced as pairs of Fab' fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant 
domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA. IgD. IgE. IgG, and 
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IgM, and several ofthese may be further divided into subclasses (isoty^ e.g.. IgGl. IgG2, IgG3. IgG4,IgA, 
and [gA2. 

"Single-chain Fv" or "sFv" antibody fragments-comprise the Vh and domains of antibody, wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between V„ and domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv, see Pluckdiun in The Pharmacology of Monoclonal Antibodies , vol . 113. 
Rosenburg and Moore eds., Springer- Verlag, New York. pp. 269-315 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (V„) connected to a light-chain variable domain (VJ in the 
same polypeptide chain (Vh-Vl). By using a linker that is too short to allow pairing between the two domains 
on the same chain, the domains are forced to pair with the complementary domains of another chain and create 
two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/1 1 161 ; and 
HoUinger et ah. Proc. Natl. Acad. Sci. USA. 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant componeitts of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for die antibody, and may include enzymes, hormones, and 
other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 
to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more dian 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 
sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 
nonreducing conditions using Coomassie blue or. preferably, silver stain. Isolated antibody includes the antibody 
in situ within recombinant cells since at least one component of the antibody's natural environment will not be 
present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

An antibody that "specifically binds to" or is "specific for** a particular polypeptide or an epitope on 
a particular polypeptide is one that binds to diat particular polypeptide or epitope on a particular polypeptide 
without substantially binding to any oflier polypeptide or polypeptide epitope. 

The word "label " when used herein refers to a detectable compound or composition whidi is conjugated 
direcdy or indirectly to the antibody so as to generate a "labeled" antibody. The label may be detectable by itself 
(e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical 
alteration of a substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of die present invention can 
adhere. Examples of solid phases encompassed herein include tfiose formed partially or entirely of glass (e.g. , 
controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay 
plate; in others it is a purification column (e.g., an affinity chromatography colunui). This term also includes 
a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4.275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholq>ids and/or surfactant 
which is useful for delivery of a drug (such as a PRO polypq>tide or antibody thereto) to a manunal. The 
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components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 Daltons. 

An "effective amount" of a polypeptide disclosed herein or an agonist or antagonist thereof is an amount 
sufficient to carry out a specifically stated purpose. An "effective amount" may be determined empirically and 
in a routine manner, in relation to the stated purpose. 



47 



40 



45 



50 



WO 01/40466 



PCT/USOO/32678 



* C-C increased from 12 to 15 

* Z is average of EQ 

* Bis average of ND 

* match with stop is _M; stop-stop = 0; J (joker) match = 0 
•/ 

^define M -8 
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35 



int 

/* a' 

/* A*/ 

/*B*/ 

/*C*/ 

/*D*/ 

/*£•/ 

/*F*/ 

/*G*/ 

/*H»/ 

/•I*/ 

/*J*/ 

/*K*/ 

/*L*/ 

/*M */ 

/»N*/ 

/*0*/ 

/*P*/ 

/*Q*/ 

/*R*/ 

/*S*/ 

/♦T*/ 

/*U*/ 

/* V*/ 

/* W*/ 

I* XV 

/* Y*/ 

/*Z*/ 

}; 



-8 /* value of a match with a stop */ 
_day[26][26] = { 

BCDEFGHIJKLMNOPQRSTUVWXYZ*/ 
' 2. 0.-2, 0, 0.^, 1,-1.-1. 0.-1.-2.-1. 0, M. 1, 0.-2, 1, 1, 0. 0.-6. 0.-3. 0}, 
. 0, 3.-4. 3. 2,-5. 0, 1.-2, 6. 0.-3.-2. 2, M.-l, 1. 0. 0. 0. 0.-2.-5. 0.-3. 1>. 
[-2,-4.15,-5.-5.-4,-3.-3.-2. 0.-5.-6,-5.-4.__M.-3.-5,-4. 0,-2. 0,-2,-8, 0. 0,-5}. 
' 0, 3.-5. 4. 3.-6. 1. 1.-2. 0. 0,-4.-3. 2.__M.-1. 2,-1. 0. 0. 0,-2,-7. 0.-4, 2}, 
. 0. 2,-5. 3, 4.-5. 0. 1,-2. 0, 0,-3.-2. l._M,-l, 2,-1, 0, 0, 0,-2,-7, 0,^. 3}, 
;-4,-5,-4,-6,-5. 9,-5.-2. 1, 0,-5. 2, 0.-4 ._M.-5. -5.-4,-3, -3. 0,-1. 0. 0, 7,-5}. 
1. 0,-3. 1. 0,-5, 5,-2.-3. 0,-2,-4.-3. 0,_M,- 1.-1,-3, 1, 0. 0.-1.-7. 0.-5, 0}, 
-1. 1.-3. 1. 1.-2,-2. 6.-2, 0. 0.-2,-2, 2. M. 0, 3. 2.-1.-1. 0.-2,-3. 0. 0. 2}. 
-1,-2.-2,-2.-2, 1,-3,-2. 5. 0,-2, 2, 2.-2,Im,-2,-2.-2.-1. 0. 0. 4,-5. 0.-1.-2}, 
0, 0. 0. 0. 0, 0. 0, 0. 0. 0. 0. 0. 0. 0. M. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0}. 
-I. 0.-5. 0. 0,-5,-2. 0.-2, 0, 5.-3. 0. 1,_M.-1. 1. 3. 0. 0, 0,-2.-3, 0,-4, 0}, 
.2.-3.^.-4.-3. 2.-4.-2. 2. 0.-3. 6. 4.-3.^M..3,.2.-3.-3.-l. 0. 2.-2, 0,-1.-2}. 
■1,-2.-5.-3.-2. 0.-3.-2. 2. 0. 0. 4. 6.-2.^M.-2.-l, 0.-2.-1. 0, 2.^. 0.-2.-1}. 

0. 2,-4. 2. 1.-4. 0. 2.-2. 0. 1.-3.-2. 2,_M..l. 1. 0. 1. 0. 0,-2,-4. 0.-2. 1}. 
{_M,_M.^M._M._M._M.^M._M._M, M,_M. M._M._M, 0._M.^M.^M._M.^M.^M._M.^M,_M._M._M}. 

1. -1.-3.-1.-1.-5.-1, 0.-2, 0,-l.-3,-2,-r,_M, 6. 0, 0, 1. 0. 0.-1.-6, 0.-5. 0}, 
^ 0, 1,-5, 2, 2.-5.-1, 3.-2. 0. 1.-2,-1. 1,^M, 0. 4, l.-l.-l, 0.-2.-5, 0,-4, 3}. 
:-2. 0.-4.-1,-1,-4.-3. 2.-2. 0, 3.-3. 0. O.^M. 0. 1, 6. O.-l, 0.-2. 2, 0,-4. 0}. 

1. 0, 0, 0, 0,-3, 1,-1,-1, 0, 0.-3,-2, 1,_M, 1,-1, 0. 2. 1. 0.-1.-2. 0,-3, 0}. 

1. 0.-2. 0. 0,-3, 0,-1. 0, 0. 0,-1.-1, 0,_M. O.-l.-l. 1. 3. 0. 0.-5. 0.-3, 0}. 

0, 0. 0, 0. 0. 0. 0, 0. 0. 0. 0, 0. 0, 0._M. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0. 0}, 

0.-2.-2.-2.-2.-1.-1,-2. 4. 0,-2. 2. 2.-2. _M.- 1.-2.-2.-1. 0, 0. 4.-6, 0.-2.-2}. 
-6.-5.-8,-7,-7. 0,-7.-3,-5, 0,-3,-2,-4,-4._M,-6,-5. 2.-2.-5. 0.^.17. 0. 0.-6}. 

0. 0. 0, 0. 0, 0. 0. 0. 0. 0. 0, 0. 0. 0,_M. 0. 0. 0, 0. 0. 0, 0. 0, 0. 0, 0}, 
-3.-3, 0.-4,^, 7.-5. O.-l. 0.-4.-1.-2.-2. M.-5..4.^.-3.-3, 0.-2. 0. 0.10.-4}, 

0, 1.-5. 2. 3.-5. 0. 2,-2. 0. 0.-2,-1, 1. M, 0. 3. 0, 0. 0. 0,-2,-6. 0,-4. 4} 
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/♦ 






•/ 






^include 


<stdio.h> 




^include 


<ctype.h> 






MAXJMP 


16 






2A 


^define 


JMPS 


1024 


iS^define 


MX 


4 


^define 


DMAT 


3 


^'define 


DMIS 


0 


^define 


DINSO 


8 


^define 


DINSl 


1 


^define 


PINSO 


8 




PINSi 


4 


struct jmp { 





short 

unsigned short 



struct diag { 
int 
long 
short 



}: 

struct 



Struct jmp 



/* max jumps in a diag */ 

/* don't conCiiuie to penalize gaps larger than this 

/* max jmps in an padi */ 

/* save if dxere's at least MX-1 bases since last jmp V 

/♦ value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue ♦/ 



n[MAXrMPJ; /* size of jmp (neg for dely) ♦/ 
x[MAXJMP]; /* base no. of jmp in seq x */ 
/• limits seq to 2*16 -I •/ 



score; /* score at last jmp */ 

ofifeet; /• offset of prev block */ 

ijmp; /* current jmp index */ 

jp; /* list of jmps */ . 



{ 
int 
short 
hit 



spc; /* number of leading spaces */ 

nlJMPS];/* size of jmp (gap) */ 

x[JMPS];/* loc of jmp (last elem before gap) */ 



char 




diar 




char 




char 




int 




int 




int 




int 




hit 




int 




hit 




hit 




mt 




long 




struct 


diag 


struct 


path 


char 




char 





*ofile; 

♦namexPJ; 

♦prog; 

*seqx(2]; 

dmax; 

dmaxO; 

dna; 



gapx. gapy; 
lenO, lenl; 
ngapx. ngapy; 
smax; 
♦xbm; 
offset; 
*dx; 
PPC2]; 



/* output file name */ 

/* seq names: getseqsQ */ 

/* prog name for err msgs ♦/ 

/* seqs: getseqsQ */ 

/* best diag: nwQ */ 

/* final diag */ 

/* set if dna: mainQ */ 

/* set if penalizing end gaps */ 

/* total gaps in seqs *l 

/* seq tens ♦/ 

/♦ total size of gaps 

/* max score: nwQ */ 

/* bitmap for matching ♦/ 

/* current offset in jmp file */ 

I* holds diagonals */ 

/* holds padi for seqs */ 



♦callocO. *mallocO. ♦indexQ. *strcpyO; 
'^gecseqO. *g_calloc(); 
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Table 1 fcont>^ 

/* Needleman-Wunsch alignment program 

* usage: progs filel file2 

* where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 
Any lines beginnii^ with •;•.•>• or are ignored 

* Max file length is 65535 (limited by unsigned shon x in the jmp struct) 

A sequence widi 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 

* program may create a tmp file in /tmp to hold info about traceback 
*^Onginal version developed under BSD 4.3 on a vax 8650 

^include "nw.h" 
^include *day.h" 

static _dbval[26] = { 

T.14,2.13.OA4,ll,0.O.I2.0.3.15AOA5.6.8.8.7,9.O.lO,0 

static j)bval[26] = { 

1, 2|(1 < <CD'.'A'))|(l< <CN'-*A*)X 4, 8. 16, 32. 64. 

128, 256, OxFFFFFFF. I<<10, 1<<11, 1<<12. 1<<13 1<<14 

l^'^ll' 1<<1<5. 1<<17. 1<<18. 1<<19, 1<<20, 1<<21, 1<<22, 

^ 1<<23. I<<24. i<<25|(l<<CE'.W))|(l<<CQ'.'A')) 



main(ac, av) 
int 

char *avD; 



{ 



int ac; 



prog = av[0]; 
if(ac != 3){ 

fprintf(stderr, -usage: %s filel fiIe2Vn". prog); 

iprimf(stdeir, -where filel and file2 are two dna or two protein sequences.Xn")- 
Q)nntf(stderr. "The sequences can be in upper- or lower-case\n"); 
fprintf(stderr."Any lines beginning widi or * < ' are ignoied\n*); 
fprintf(stderr, "Output is in the file V"alicn.out\"\n")- 
exit(l); 

} 

namex[0] = av[l]; 

namex[l] = av[2]; 

seqx[b] = getseq(namex[0], &lenO); 

seqx[l] = getseq(namex[l]. &lenl); 

xbm = (dna)? .dbval : j)bval; 

emlgaps =0; /♦ j to penalize endgaps •/ 

ofUe = "align-out"; /* output file */ 

/* fill in the matrix, get the possible jmps */ 
readjmpsQ; /* get the actual jmps */ 
printQ; /* print stats, alignment */ 

cleamip(0); /* unlink any imp files •/ 
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Table 1 (cont^^ 



/* do the aitgnment, return best score: mainQ 

* dna: values in Fitch and Smith, PNAS, 80. 1382-1386. 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 



dx - (struct diag *)g_caIloc("to get diags", lenO+lenl + 1, sizeof(struct diag)); 

ndely = (int *)g_calIoc(*to get ndely% lenl + 1 , sizeof(int)); 
dely = (mt ♦)g_callocCto get de!y". lenl + 1, sizeof(int)); 
colO = (int •)g calloc("to get colO". lenl + 1. sizeor(uit)); 
coll = (int *)g calloc("to get coll*, lenl+1, sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINSl : PINSl; 

smax = -10000; 
if(endgaps){ 

for (coKVO] = dely[OJ = -insO. yy = 1; yy < = lenl; yy++) { 
colO[yy] = dely[yy] = colO[yy-l] - iosl; 
ndelylyyl = yy; 

> 

colO[01 = 0; /♦ Waterman Bull Math Biol 84 */ 



nwO 
{ 



char 

int 

int 

int 

int 

int 



*px» *py; 
*ndely, *dely; 
ndelx, delx; 
*tmp; 

mis; 

insO, insl; 
id; 

"j; 

*col0,*coll; 
XX, yy; 



/* keep track of dely*/ 

/* keep track of delx ♦/ 

/♦ for swapping rowO, rowl */ 

/* score for each type */ 

/• insertion penalties */ 

/* diagonal index */ 

/• jmp index */ 

/* score for curr, last row */ 

/♦ index into seqs */ 



/* seqs and ptrs */ 



register 
register 
register 
register 



else 



for (yy = 1; yy < = lenl; yy++) 
delylyy] = -insO; 



/* fill in match matrix 



*/ 



for(px = s6qx{0L xx = 1; xx <= lenO; px++, xx++) { 
- /* initialize first entry in col 
*/ 

if (endgaps) { 



if(xx== 1) 

coll[0) = delx = -<ins0+insl); 



else 



coll[0] = deU = col0[0] - insl; 
ndelx — xx; 

) 

else { 



colUO] « 0; 
delx - -insO; 
ndelx 0; 
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Table 1 fcontn 

(py = seqxfl], yy l;yy <= Ienl;py++,yy++){ 
mis = colO[yy-l]; 
if(diia) 

mis += (xbin[*px-'A']&xbin[*py-*A'])? DMAT : DMIS- 

else 

mis + = .day[*px.*A'][*py-'A*]; 

/• update penalty for del in x seq; 
* fevor new del over ongong del 
Mgnore MAXGAP if weighting endgaps 

if (endgaps 1 1 ndely[yy] < MAXGAP) { 

if (colO[yy] - insO > = dely(yyj) { 

delylyyl = C0IOQ7I - (insO+insl); 
ndelytyy] = 1; 

}else{ 

de]y[yy] -=: insl; 
ndely(yy]++; 

}else{ 

if (coIO[yy] - (insO+insI) > = delylyy]) { 
dely[yyl = colOlyy] - (insO+insl); 
ndely[yyj = 1; 

}cise 

^ ndely[yy]++: 

/* update penalty for del in y seq; 
* fevor new del over ongong del 

if (endgaps | ( ndelx < MAXGAP) { 

if (coll[yy-l] - insO > = delx) { 

delx = coll[yy.l] - (insO+insl); 
ndelx = 1: 

}eise{ 

delx-= insl; 
ndelx++; 

} 

}elsc{ 

if (col 1 [yy-1] - (insO+ insl) > = delx) { 
delx = coU[yy-l] - (insO+insl); 
ndelx = 1; 

}dse 

iidelx+ + ; 



/* pick die maximum score; we're favoring 
* mis over any del and delx over dely 
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Table 1 (cont^) 

...nw 

id - xx-yy + lenl - 1; 
if (mis > « delx 8i& mis > = dely[yy]) 
coll[yy] = mis; 

5 else if (delx > = dely(yy]) { 

coll(yyJ = delx; 
ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndelx > = MAXJMP 
&& XX > dx[idl.jp.x[ij]+MX) 1 1 mis > dx[id],score+DINSO)) { 
10 dx(id].ijmp++; 

if(++ij >= MAXJMP) { 
writejmps(id): 
ij = dx[id].ijmp = 0; 
dx[id].of&et = ofEset; 

15 ' offeet + = sizeof(stni€t jmp) + sizeofCoffset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
dx(id].jp.x[ijl = xx; 

20 dx[id].score = delx; 

} 

else { 

co!l[yy] = delytyy]; 
. ij = dxpdj.ijmp; 

25 if (dx[id] Jp.nIO] && (tdna 1 1 (ndelylyy] > = MAXJMP 

&& XX > dx[id].jp.x[!j]+MX) 1 1 mis > dx[id]. score +DINSO)) { 
dx{id].ijmp++; 
If (++ ij > = MAXJMP) { . 
writejmps(id); 

30 ij = dx{id].ijmp = 0; 

dx[id].ofE5ec = offset; 

offset += sizeof(stnict jmp) + siz6of(ofito); 

^ } 

35 dx[id].jp.n[ij] - -ndely[yy]; 

dx(id].jp.x[ij] = xx; 
dx(id] .score = delyCyy]; 

} 

If (XX = = lenO && yy < lenl) { 
40 /♦ last col 

*/ 

If (endgaps) 

coll[yy] -= insO+insl*(lcnl-yy); 
if (coll[yy] > smax) { 
45 smax = coIl[yy]; 

dmax ~ id; 

} 

} 

50 if (endgaps xx < lenO) 

coll[yy-l] -= insO+insl*(lenO-xx); 
if (colllyy-lj > sniax){ 

smax « coll[yy-l]; 
dmax - id; 

55 ) 

tmp s= colO; colO = coll; coll = tmp; 

(void) free((diar *)ndely); 
(void) free((char *)dely); 
60 (void) free((cliar *)colO); 

(void) free((cfaar *)coll); } 
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Table 1 (cont^^ 



* printO - only routine visible outside this module 



* 

* static: 



* getmalO - trace back best path, count matches: printQ 

* Pr_align() - print aligmnem of described in array pQ: printQ 

* dumpblockO - dump a block of lines with numbers, stars: pr aligoQ 

* numsO - put out a number line: dumpblockQ 

* pudineO - put out a line (name, [num], seq. [num]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 
♦^stripnam^ - strip any path and prefix from a seqname 

^include "nw.h' 

^define SPC 3 

^define P^LINE 256 /* maximum output line ♦/ 

^define P_SPC 3 /* spSLce between name or num and seq */ 

extern ^day[26](26]; 

olen; /* set output line length */ 

FILE •fic; /* ouQwt file ♦/ 

printQ , ^ 

{ print 

int Ix, ly, firstgap, lastgap; /• overlap */ 

if ((fx = fopen(ofile, "w")) 0) { 

fprintf(stderr,"%s: can't write %s\n", prog, ofile); 
cleanup(l); 

} 

ft)rintf(ix, " < first sequence: %s (length = %d)\n\ namex(01, lenO); 
ft)rintf(fic, - < second sequence: %s Oength = %d)\n-. namexH], lenl): 
olen = 60; 
Ix = lenO; 
ly = lenl; 

firstgap = lastgap = 0; 

If (dmax < lenl - 1) { /* leading gap in x */ 
pp[0].spc = firstgap = lenl - dmax -. 1; 
^ ly.«pp[0).^; 

else if (dmax > lenl - 1) { /* leading gap in y »/ 
ppIU.spc = firstgap = dmax - (lenl - 1); 
^ lx-=pp[l].spc; 

If (dmaxO < lenO - I) { /* trailing gap in x */ 

lastgap = lenO - dmaxO -1; 
^ Ix -= lastgap; 

dse if (dmaxO > lenO «•!){/* trailing gap in y */ 

lastgap = dmaxO- (lenO - 1); 
^ ly lastgap; 

getmatOx, ly, firs^ap, lastgap); 
pr__alignO; 
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Table 1 (cont^) 

/* 

* trace back the best path, count matches 
♦/ 

static 

getmat(lx, ly. firstgap, lastgap) getoiat 
int Ix, ly; /* "core" (minus endgaps) */ 

int firstgap. lastgap; /* leading trailing overlap ♦/ 



{ 



■nt nm. iO, il, sizO, sizl; 

char outx(32]; 

double pet; 

register nO, nl; 

register char *pO. *pl; 

I* get total matches, score 
•/ 

iO = ir= sizO = sizl = 0; 
pO = seqx[0] + pp(l].spc; 
pi = seqxll] + pp[0].spc; 
nO = pp[l].spc + 1; 
nl = pp[01.spc + I; 

nm = 0; 

while (*pO&&*pl){ 
if(sizO){ 

pi++; 
nl ++; 
sizO-; 

} 

else if (sizl) { 

pO++; 
nO+ + ; 
sizl-; 



{ 



If (xbm[*pO-* A1&xbm[*pl-* A']) 

iun++; 
if (110+ + pp(0].x[iO]) 

sizO = m>[0].n[iO++]; 
if(nl + + pp[l].xai]) 

sizl = pp(l].n[il++]; 

pO++; 
pl + +; 



} 



/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take hotter core 
*/ 

if (endgaps) 

ix = (lenO < lenl)? lenO : lenl; 

else 

U (Ix < ly)? U : ly; 
pet = 100.*(double)nniy(double)lx; 
fprintf(fx. "\n"); 

^rintf(fx. " < %d match%s in an overlap of %d: %.2f percent similarityVn" 
nm, (nm 1)? " : "es", Ix. pet); 
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fiprintf(fx. • < gaps in first sequence; %d". gapx); opHtiaf 

if(gapx){ ...gemiat 

(void) sprintf(outx. " (%d %s%s)", 

ngapx, (dna)? "base": "residue", (ngapx 1)? "":-s"); 
fpriiitf(fx." %s", ouu); 

fprintf(fjt, ". gaps In second sequence: %d", gapy); 
if (gapy) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapy. (dna)? "base": "residue", (ngapy == 1)? -;"s")- 

^rimf(6c/%s", ouix); 

} 

if (dna) 

•\n< score: %d (match = %d. mismatch = %d. gap penalty = %d + 9&d per base)\n" 
smax. DMAT. DMIS, DINSO. DINSl); 

else 

fprintf(fx. 

•\n<score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per fesidue)\n" 
smax. PINSO, PINSl); 
if (endgaps) 

^rintf(fx. 

•< endgaps penalized, leftendgap: %d %s%s, right endgap: %d %s%s\n". 
firstgap, (dna)? "base" : "residue", (firstgap 1)7 "" : "s", 
lastgap. (dna)? "base" : "residue", (lastgap == l)? - : "s"); 



else 



} 



fprintfCfx, " <eQdgaps not penalized\n"); 



™; /* matches in core ~ for checking */ 

static imax; /* lengdis of stripped file names */ 

static ij(2]; /* jmp index for a padi ♦/ 

*^*|^ nc[21; number at start of current line*/ 

/* current elem number - for gaming */ 

static siz[2]; 

static char •ps[2]; /* ptr to current element */ 

static char '^[2]; /* ptr to next output char slot ♦/ 

static char outl2](P LINE]; /* output line */ 

static char star[P^LINE]; /♦ set by starsQ */ 



/* 
« 

♦i 
static 



*^print alignmem of described in struct padi ppQ 



J'-^'8"0 pr_align 



int nn; /* char count */ 

int more; 
register 1; 

for(i = 0. Imax = 0; i < 2; i + +) { 
nn =s stripname(namex[il); 
if (nn > Imax) 

Imax = nn; 

nc[i] = 1; 
ni[il = 1; 
sizp] = = 0; 
ps[i] = seqxBl: 
pop] = out(i]; 
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Table 1 fcontn 

for (nn = iim = 0, more = 1 ; more; ) { «r align 

for 0 = more = 0; i < 2; i+ +) { 
/* 

* do we have more of diis sequence? 
*/ 

continue; 

more++; 

if (PPWspc) { /* leading space*/ 

*po[i] + + = ' 

} 

else if (siz[i]) { /* in a gap •/ 
•po[il++ = 

} 

^{ /* we're putting a seq element 

•/ 

*po[i) = *psli]; 
if (islower(*psti])) 

*ps[i] = touppcr(*ps(i]); 

po[il++; 
psPl++; 

/• 

* are we at next gap for this seq? 
•/ 

if(ni[ij==pp[ij.x[ij[i]]){ 
/• 

* we need to merge all gaps 

* at this location 
♦/ 

si2[i] =pp[i].nlij[i] + +]; 
while (nilij ==pp[il.x[ij[i]]) 
^ sizfi] ■»*=PPli].nIij[iJ++J; 

ni[i] + +: 

} 

> 

if (++nn == olen 1 1 Imore && nn) { 
dumpblockO; 
for(i =:0;i < 2;i++) 
po[i] = out[il; 

nn = 0; 

} 



/* 

* dump a block of lines, including numbers, stars: pr^alignQ 
static 

p'*'~'<> dumpblock 

register i; 



for(i = 0: i < 2; i++) 
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Table 1 fcont^ 



10 



15 



20 



35 



40 



45 
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(void) putc('\n\ fx); 
for(i = 0; i < 2; i++){ 

if (*out[i] &&(*out[i] != • ' 1 1 ♦(po[i]) !-")){ 
if(i==0) 

niuxis(i)'; 
if{i«=0&&*out[l]) 

starsO: 
piidine(i); 

if(i 0&&*out(l]) 

^riiuf(fx, star); 
if(i== 1) 

iiums(i); 



.«.dumpblock 



} 



25 { 



30 



} 
/♦ 

* put out a number line: dumpblockQ 
*/ 

static 

nums(ix) 

int ix; 



/* index in out[] holding seq line */ 



char 
register 
register char 



nlinelP UNEI; 

♦pn, *px. *py; 



for(pn = nline, i = 0; i < lmax+P_SPC; i++. pn++) 
*pn = -'; 

for (i = nc[ix]. py = out[ix]; *py; py+ +. pn+ +) { 

tfCpy-='* II 

»pn = • 



dse { 



if (i%10 == 0 II (i =- 1 &&iic[ix] != 1)){ 
j = (i < 0)? -I : i; 
for (px = pn; j; j /= 10, px-) 
*px =j%10+ 'O*; 

if(i<0) 

} 



} 



*pn - • 



} 

*pn = no*; 
nc[ix] = i; 

for ^n = nline; ♦pn; pn+ +) 
(void) putc(*pn, fx); 
(vold)putcC\n\fic); 



nums 



55 /• 

* put out a line (name, [num], seq, [num]): dumpblockQ 
♦/ 

static 
putline(ix) 

60 int ix; { 



putUne 
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Table 1 (conV) 



10 



15 



int 

register char 



•px; 



for(px = nainex[ix], i = 0; *px&&*px!= ':*;px++J++) 

(void) putc(*px, fx); 
for(; i < linax+P_SPC; i++) 

(vol<l)putc(* ', &); 

/* these count from 1: 

* nin is current element (from 1) 

* ncn is number at start of current line 
•/ 

for (px = out[ix]; *px; px+ +) 

(void) putc(*px&0x7F. fx); 
(void) putcC\n\ fx); 



...putline 
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/♦ 

* put a line of stars (seqs always in outfO]. outfl]): dumpblockQ 
*/ 

static 
stars() 
{ 



hit 

register char 



i; 



*pO. *pl. cx, •px; 



55 } 



if (!*out(0] I r(*out[0] " && ♦(po[0]) * •) 1 1 
!*out(n 1 1 (*out[l] *(po[l]) ")) 

return; 
px = star; 

for(i = lmax+P_SPC; i; i-) 
*px++ = • •; 

for(pO = out[01,pl =out[l]; *pO Aife ♦pi; pO+ + , pl + +) { 
. if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO-' A']&xbm[*pi-*A'l) { 
cx «= •*•; 
nm++; 

} 

else if(!dna&& dayI*pO-'A*][*pl-'A*] > 0) 
cx = •/; 

else 

cx = * •; 

} 

else 

cx = ' '; 
*px++ = cx; 

} 

*px++ = 'W: 

*px = "W; 



Stars 
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/* 



Table 1 fcont^) 



* strip path or prefix from pn. return len: pr_alignO 



char *pn; /* file name (may be path) */ 
register char •px, •py; 

py = 0; 

for (px = pn; *px; px+ +) 
if(*px=« T) 



static 

strtpname(pn) 



stripname 



py = px + 1; 



if(py) 



(void) sticpy(pn, py); 



retuni(strleo(pn)); 



} 
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Table 1 (conV^ 

/♦ 

* cleani4)() - cleanup any trap file 

* getseqO ~ read in seq, set dna, len. maxlen 

* gjcallocO - callocO with error checldn 

* readjmpsO - get the good jmps, from tmp file if necessary 

* writejmpsO ~ write a filled array of jmps to a tn^ file: nwQ 
*/ 

^include "nw.h" 
#mclude <sys/file.h> 

char *jname = 7tmp/homgXXXXXX"; /* tmp file for jmps •/ 

HLE *Q; 



Int cleanupO; /* cleanup tmp file •/ 

15 long IseekO; 

/• 

* remove any tmp file if we blow 
♦/ 

20 cieanup(i) Cleanup 

int i; 

{ 

if (5) 

(void) uolinkQname); 

25 exit(i); 
} 

/• 

* read, return ptr to seq, set dna, len, maxlen 
30 * skip lines starting with *;'.*<', or 

* seq in upper or lower case 
*/ 

char * 

getseq(file, len) getseq 
35 char *file; /* file name*/ 



{ 



int *len; /* seq len */ 

char line[1024], ♦pseq; 

register diar *px, *py; 



40 int natgc, tlen; 

FILE 

if ((* « fopen(filc,-r")) = = 0) { 

fprintQstderr/%s: can't read %s\n". prog, file); 
45 exitd); 
} 

tlen = natgc = 0; 

whae (fgets(line. 1024. fp» { 

if(*line == || *line == || ♦line == *>*) 

continue; 
for(px = line; *px 1= 'Vn*; px++) 

if(isupper(*px) II islower(*px)) 
tlen++; 

33 if((pseq = iiialIoc((iiiisignedXtIen-t-6))) = »0){ 

4nintfl[stdenr,*%s: mallocO foiled to get %d bytes fbr %$\n', prog, tleii-l-6. file); 
exKl); 

} 

^ pseq[0] = pseq[ll = |>seq(2] = pseq[3] = 'W; 
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py = pseq -f- 4; 
*len = den; 
rewind(Q>); 

wbfle (fgets(line. 1024. fp)) { 

if(*Iine == || *line || *jine •>•) 

continue; 

for (px = line; *px ! = 'Vn*; px+ +) { 
if(isupper(*px)) 

*py++ = *px; 
dseif (isIower(*px)) 

*py++ = toupper(*px); 
if (index(-ATGCU\*(py-l))) 

natgc++; 

} 

*py++ = 'VO'; 

*py '\0'; 

(void) fclose(^); 

dna = natgc > (tfcn/3); 

retuni(pseq+4); 
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«getseq 



char * 

g_caUoc(nisg, nx, sz) 

char *msg; /♦ program, calling routine ♦/ 

int nx, sz; /* number and size of elements */ 



g.calloc 



{ 



char *px» *callocO; 



if (0>x = caUoc((uiisigned)nx. (unsigned)sz)) = = 0) { 
if(*msg){ 

ft>rintf(s£derr. •%s: g_callocO failed %s (n=%d. sz=%d)\n-. prog, msg, nx, sz); 
exit(l); 

} 

} 

retum(px); 

} 

/* 

♦^get final jmps from dxQ or ttnp file, set ppQ. resec dmax: mainQ 

readjmps 

int fd = -l; 

mt siz, iO, il; 

register ij. xx; 

(void) fctose(f)); 

if ((H = opaiQname. O^RDONLY, 0)) < 0) { 

^rintf(stderr, "%s: can't opwO %s\n'', prQgJname); 
cleanup(l); 

} 

} 

for(i «. iO = il =0. dmaxO «= dmax. xx « tenO; ; { 
while (1){ 

for (j = dxTdmaxJ.ijiiip: j > » 0 && (te[diiiax].jp.xO] > = xx; j-) 
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Table 1 rcontM 

..•readjmps 

if (j < 0 && dx[dmax].offsei && §) { 

(void) lseek(fd, dx[dmax). offset, 0); 
(void) read(fd, (char *)&dx[dmax].jp, sizeof(struct jmp)); 
(void) read{fd, (char *)&dA[dmax], offset. sizeof(dx(diiiax). offset)); 
dx[dniax]jjinp MAXJMP-1; 

} 



10 } 

if(i > = JMPS){ 



break; 



fpriiitf(stdeiT, "%s: loo many gaps in alignmentVn", prog); 
cieanupCD; 

15 ifa> = o){ 

siz = dx[dmax]Jp.n[j]; 
XX = dx[dniax].jp.x(j]; 
dmax + = siz; 

if(siz<0){ /* gap in second seq*/ 

pp[l].nlil) = -siz; 
XX + = siz; 

/* id = XX - yy + leni - 1 
*/ 

pp[l].x[il] = XX - dmax + lenl - 1; 
gapy++; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = ( siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 

30 } 

else if (siz > 0) { /♦ gap in first seq */ 
Pp[0].n(i01 = siz; 
pp[0].x(iO] = xx; 
„ gapx++; 

ngapx + = siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP 1 1 ciKlgaps)? siz : MAXGAP; 
iO++; 

40 } * 

dse 

break; 

} 

45 /♦ reverse the order of jmps 

•/ 

for (j = 0, iO--; j < iO; j + + . iO-) { 

i = PP[0].nO]; ppt0].n(j] = pp[01.n[iO]; pp[0].n[iO] = i; 
J > = PP[0].xG]; pp[0].x(jj = ppI01.x[iO]; pp[0].x(iO] = i; 

for(j «0,il-;j < il;j ++, ii~){ 

i = pp[lj.n[j]; pp[l].n[i3 = pp(l].n[il]; pp[l].ntil] = i; 
^ i = R)[l].xO]; VpUUli] = pp[l].x[il]; ppfl].x[il] = i; 

55 if(fd> = 0) 

(void) close(fil); 

ir(8){ 

(void) unlink(jname); 
OU offtei = 0; 

> } 
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I* 

*^write a filled jmp struct offeet of the prev one (if any); nwQ 

"^'"'^"^t? ix writejmps 
{ 

char *inktemp(); 
if(!0){ 

if (mktempQname) < 0) { 

fprinrf(stderr. "%s: can*t mktempO %sVn', prog, jname); 
cleanupd); 

} 

if ((fi fopenQname, *w*)) == 0) { 

^riiitf(stdeiT, "%s: can't write %s\n". prog, jnaiue); 
cxit(l); 

} 

\ 

(void) fwiteftchar *)&ilx[ix].jp, sizeof(stnict jmp), 1. fj); 
(void) fwrite((char *)&dx[ix].of&et, slzeoKdx[Ul. offeet). 1. §); 
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Table 2 

PRO XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYYYYYYY (Length = 12 amino acids) 

S % amino acid sequence identity = 

(the number of identically matching amino acid residues between die two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

10 5 divided by 15 = 33.3% 



Table 3 

15 PRO XXXXXXXXXX (Ungth = 10 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity = 

20 (the number of identically matching amino acid residues between the two polypeptide sequences as determined 
by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 10 = 50% 
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Table 4 

NNNNNNNNNNNNNN 
NNNNNNLLLLLLLLLL 



(Lengdi = 14 nucleotides) 
(Length = 16 nucleotides) 



% nucleic acid sequrace identity = 



(the number of identically matching nucleotides between tfie two nucleic acid sequences as determined by 
ALIGN.2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 



Table S 



PRO-DNA 
Comparison DNA 



NNNNNNNNNNNN 
NNNNLLLVV 



(Length = 12 nucleotides) 
(Length = 9 nucleotides) 



% nucleic acid sequence identity 



(the number of identically matching nucleotides between the two nucleic acid sequences as detennined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

4 divided by 12 = 33.3% 
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Compositions and Methods of the Invention 

A. . FuU-Length PRO Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO polypeptides. In particular, cDNAs encoding various PRO 
polypeptides have been identified and isolated, as disclosed in further detaU in the Exanq)les below. It is noted 
that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number 
is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of 
simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules 
disclosed herem as well as all further native homologues and Variants included in the foregoing definition of 
PRO, will be referred to as "PRO/number". regardless of their origin or mode of preparation. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCX:. The 
actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the 
deposited clone using routine methods in the art. The predicted amino acid sequence can be detemuned from 
the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids described 
herein. Applicants have identified what is believed to be the reading frame best identifiable with the sequence 
information available at the time. 

B. PRO Polvpeptide Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 
PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into 
the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that 
amino acid changes may alter post-translational processes of the PRO, such as changing the number or position 
of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of tiie PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set forth, for instance, in U.S. Patent No. 5,364.934. Variations may be a substitution, deletion or 
insertion of one or more codons encoding the PRO tiiat results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one amino 
acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining vMch 
amino acid residue may be inserted, substimted or deleted without adversely affecting die desir^ activity may 
be found by comparing the sequence of the PRO with tiiat of homologous known protein molecules and 
minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid 
substitutions can be die result of replacing one amino acid with another amino acid having similar structural 
and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insertions or deletions may optionally be in die range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insertions, deletions or substimtions of amino acids in the 
sequence and testing the resulting variants for activity exhibited by die fiill-lengtii or mature native sequence. 
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PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus 
or C-cerminus. or may lack internal residues, for example, when compared witfi a full lengdi native protein. 
Certain fragments lack amino acid residues diat are not essential for a desired biological activity of the PRO 
polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired pq)tide 
5 fragments may be diemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating die protein with an enzyme known to cleave proteins at sites defined by 
particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PGR). Oligonucleotides that define the desired 
10 termini of the DNA fragment are employed at the 5' and 3' primers in the PGR. Preferably. PRO polypeptide 
fragments share at least one biological and/or inununological activity with the native PRO polypeptide disclosed 
herein. 

In particular embodiments , conservative substitutions of interest are shown in Table 6 under die heading 
of preferred substitutions. If such substitutions result in a change in biological activity, dien more substantial 
15 changes, denominated exenqilary substimtions in Table 6, or as further described below in reference to ammo 
acid classes, are introduced and the products screened. 
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Table 6 



Original 


Exemplary 


Preferred 


Residue 


Substitutions 


Sub^titutipps 


Ala (A) 


val; leu; ile 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


Asp(D) 


glu 


glu 


Cys (C) 


ser 


ser 


G]n(Q) 


asn 


asn 


Glu (E) 


a^ 


asp 


Gly(G) 


pro; ala 


ala 


His (H) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe; 






norleucine 


leu 


Leu(L) 


norleucine; ile; val; 






met; ala; phe 


ile 


Lys(K) 


arg; ghx; asn 


arg 


Mel (M) 


leu; phe; ile 


leu 


Phe (F) 


leu; val; ile; ala; tyr 


leu 


Pro (P) 


ala 


ala 


Ser (S) 


thr 


dir 


Thr(T) 


ser 


ser 


Trp (W) 


tyr; phe 


tyr 


Tyr(Y) 


trp; phe; thr; ser 


phe 


Val(V) 


ile; ieu; met; phe; 




ala; norleucine 


leu 
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Substantial modifications in function or inununological identity of the PRO polypeptide are accomplished 
by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. NaturaUy occurring residues 
are divided into groups based on common side-diain properties: 

(1) hydrophobic: norleucine, met, ala, val, leu. He; 

(2) neutral hydrophilic: cys, ser. thr; 

(3) acidic: asp, glu; 

(4) basic: asn. gin. his. lys. arg; 

(5) residues Aat influence chain orientation: gly. pro; and 

(6) aromatic: tip, tyr, phe. 

Non-conservative substitutions will entail exchanging a memberof one of these classes foranotherclass. 
Such substituted residues also may be introduced into the conservative substitution sites or. more preferably, into 
the renuining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-medlated (site- 
directed) mutagenesis, alanine scanning, and PGR mutagenesis. Site-directed mutagenesis [Carter ct al.. Nucl. 
Asj^Res,. 12:4331 (1986); ZoUeret al.. Nucl. Acids Res.. i(J:6487 (1987)]. cassette mutagenesis [Wells et 
al.. Gfine. 34:315 (1985)]. restriction selection mutagenesis [Wells et al.. Philos. Trans. R. Soc. London .S^rA 
212:415 (1986)] or otfier known techniques can be performed on the cloned DNA to produce the PRO variant 
DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively smaU. neutral ammo acids. Such 
amino acids include alanine, glycme. serine, and cysteine. Alanine is typicaUy a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 
chmn conformation of the variant [Cunningham and Wells, Science . 244 : 1081-1085 (1989)]. Alanine is also 
typicaUy preferred because it is Oe most common amino acid. Further, it is frequently found in both buried and 
exposed positions [Creighton. Thg Prot^im. (W.H. Freeman & Co.. N.Y.); Chothia. J. Mol. Biol. 150:1 
(1975)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid c^n be used. 

Cl Modifications of PRO 
Covalent modifications of PRO are included withiii the sct^ of this invention. One type of covalent 
modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO. 
Derivatization with bifiinctional agents is usefiil, for instance, for crosslinking PRO to a water-insoluble support 
matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. Commonly used 
crosslinking agents include. e.g.. l.l-bis(dia2oacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-a2idosalicylic acid, homobifiinctional imidoesters. including disuccinimidyl 
esters such as 3.3'-dithiobis(succininiidylpropionate). bifiinctional maleimides such as bis-N-maleimido-1,8- 
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octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 
glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side 
diains [T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co., San Francisco, 
S pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the PRO polyp^tide included within the scope of this 
invention comprises altering the native glycosylation panem of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found 
in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation 
10 by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the 
native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native 
proteins, involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine 
15 or threonine residues to the native sequence PRO (for O-Hnked glycosylation sites). The PRO amino acid 
• sequence may optionally be altered through changes at the DMA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
20 chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., 
in WO 87/05330 published 1 1 September 1987. and in Aplin and Wriston, CRC Crit. Rev. Biochem. . pp. 259- 
306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
25 for glycosylation. Chemicai deglycosylation techniques are known in the art and described, for instance, by 
Hakimuddin, et al.. Arch. Biochem. Biophys. . 2S2:52 (1987) and by Edge et a1.. Anal. Biochem. . 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzvmol. > 138:350 (1987). 

Another type of covalent modification of PRO coniprises linking the PRO polypeptide to one of a varied 
30 of nonproteinaceous polymers, e.g. , polyethylene glycol (PEG), polypropylene glycol, or polyo^^all^^lenes, in 
the manner set forth in U.S. Patent Nos. 4,640.835; 4,496,689; 4,301 ,144; 4.670.417; 4,791,192 or 4.179.337. 

The PRO of the present invention may also be modified in a way to form a chimeric molecule 
comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide 
35 which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed 
at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO can be 
detected using an antibody against the tag polypeptide. Also, provision of the q>itope tag enables the PRO to 
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be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds 
to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-hb) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its 
antibody 12CA5 [Field et al., Mol. Cell. Bio!.. 8:2159-2165 (1988)]; the c-myc tag and the 8F9. 3C7. 6E10. 
G4. B7 and 9E10 amibodies thereto [Evan et al.; Molecular and Qiiiular Bioloov 5:3610-3616 (1985)]; and the 
Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al.. Protein Engineering . 3(6):547- 
553 (1990)J. Other tag polypqnides include the Flag-pq>tide [Hopp et al., BioTechnologv . §:1204-1210 
(1988)1: the KT3 epitope peptide [Martin et al.. Science. 255:192-194 (1992)]; an «-tubulin epitope peptide 
[Skinner et al.. f, Biol. Chepi. .. 26§:15163-15166 (1991)1; and the T7 gene 10 protein peptide tag [Luiz- 
Freyermudi et al.. Proc. Natl. Acad. Sci. USA . §2:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of die PRO with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also 
referred to as an "immunoadhesin"). such a fusion could be to the Fc region of an IgG molecule. The Ig fiisions 
preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO 
polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred 
embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3. or die hinge. CHI. CH2 and CH3 
regions of an IgGl molecule. For the production of immunoglobulin fusions see also US Patent No. 5.428,130 
issued June 27, 1<>9S. 

The description below relates primarily to production of PRO by culturing cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated diat alternative mefliods. 
which are well known in the art, may be employed to prepare PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct peptide synthesis using solid-phase techniques (see, e.g. , Stewart 
^ Soiid-Pha.se Peptide $yinth?f>is . W.H. Freeman Co.. San Francisco, CA (1969); Merrifield. J. Am. Chem. 
25 Sq£^, 25:2149-2154 (1963)}. In vUro protein synthesis may be performed usmg manual techniques or by 
automation. Automated synthesis majr be accomplished, for instance, using an Applied Biosystems Peptide 
Synthesizer (Foster City, CA) using manufocturer's instructions. Various portions of the PRO may be 
chemicaUy synthesized separately and combined using chemical or enzymatic methods to produce the full-length 
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1- Isolation of DNA Encodin f r PRO 
DNA encoding PRO may be obtained from a cDNA library prqjared from tissue believed to possess 
the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently 
obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PRO- 
encoding gene may also be obuined from a genomic library or by known synthetic procedures (e.g., automated 
nucleic acid synthesis). 
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Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 
or genomic library with the selected probe may be conducted using standard procedures, sudi as described in 
Sambrook et al.. Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 
1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al., 
supra : Dieffenbadi ei al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDN A library. The oligonucleotide sequences 
selected as probes ^ould be of sufficient lengdi and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are well known in the art, and include the use of radiolabels like ^^P-labeled 
ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
stringency, are provided in Sambrook et al., supra . 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences dqwsited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) witiun defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the art aiui as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 
libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
conventional primer extension procedures as described in Sambrook et al., supra , to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 



Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 
temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, 
principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell Biotechnology: a Practical Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al., 
supra . 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily 
skilled artisan, for example, CaClj, CaP04, liposome-mediated and electroporation. Depending on the host cell 
used, transformation is performed using standard techniques appropriate to such ceils. The calcium treatment 
employing calcium chloride, as described in Sambrook et al., supra , or electroporation is generally used for 
prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as 
described by Shawet al.. Gene . 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium pho^hate precipitation method of Graham and van der Eb, Virology . 
^:456-457 (1978) can be enq>loyed. General aspects of mammalian cell host system transfections have been 
described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the 



Selection and Transformation of Host Cells 
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method of Van Solingen et ai. . J. Bact, . 130:946 (1977) and Hsiao et al. . Proc. Natl. Acad, Sci. (USA^ . 76:3829 
(1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g. , polybrene, polyomithine, may 
also be used. For various techniques for transforming mammalian cells, see Keown et al.. Methods in 
En2;Yipol0Ry, 185:527-537 (1990) and Mansour et al.. Nature . 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote. yeast, 
or higher eukaiyote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative 
or Gram-positive organisms, for example. Enterobacteriaceae such as E, colL Various E, coli strains arc 
publicly available, such as E, coli K12 strain MM294 (ATCC 31.446); £, coU X1776 (ATCC 31.537); £. coU 
strain W3110 (ATCC 27,325) and K5 772 (ATCC 53,635). Odier suitable prokaryotic host cells include 
Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Envinia, Klebsiella, Proteus, Salmonella, 
e.g.. Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as S. 
subtilis and B. licheniformis (e.g.. B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 
Pseudomonas such as P. aeruginosa, and Streptontyces. These examples are illustrative rather than limiting. 
Strain W3 1 10 is one particularly preferred host or parent host because it is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including £. ca/iW3 110 strain 1A2, which has the complete genotype 
tonA ; coli W3110 strain 9E4, whidi has the complete genotype tonA ptr3; E. coli W3110 strain 27C7 
(ATCC55,244), which has the complete genotype tonA ptrSphoA E15 (argF-lac)169degPompTkaif \ E. coli 
W31 10 strain 37D6. which has the complete genotype tonA ptrS phoA EI 5 (argF'lac)169 degP ompT rbs7 
ilvG kan'; E, coli W3110 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion 
mutation; and an E, coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 4,946.783 issued 
7 August 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase 
reactions, are suitable. 

In addition to prokaiyotes. eukaryotic microbes such as filamentous fiingi or yeast are suitable cloning 
or expression hosts for PRO-encoding vectors: Saccharomyces cerevisiae is a commonly used lower eukaryotic 
host microorganism. Others include Schizosaccharontyces pombe (Beach and Nurse, Nafaie . 290; 140 [1981]; 
EP 139,383 pubUshed 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., 
Bio/Technology, 9:968-975 (1991)) such as. e.g.. K. lactis (MW98-8C, CBS683. CBS4574; Louvwicourt et 
al.. J. Bacteriol,, 154(2):737-742 [1983]). AT. fragilis (ATCC 12.424), K, bulgaricus (ATCC 16,045), K. 
wickeramii (ATCC 24. 178), X waltii (ATCC 56.500). K, drosophilarum (ATCC 36.906; Van den Berg et al.. 
Bip/Technolpgy, 8: 135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 402.226); Pichia pastoris 
(EP 183.070; Sreekrishna et al., J. Basic Microbiol. . 28:265-278 [1988]); Candida; Trichoderma reesia (EP 
244.234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA. 76:5259-5263 [1979]); Schwanniomyces 
such as Schwanmomyces occidentalis (EP 394.538 published 31 October 1990); and filamentous fungi such as, 
e.g., Neurospora, Peniciltiwn, Tolypocladium(y/O9UQ0357 pubW^ed 10 January 1991), mdAspergillushosts 
such as A. nididans (BaUance et al., Biochem. Biophvs. Res. Commun. . 112:284-289 [1983]; Tilbum et al.. 
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(i§ne, 26:205-221 [1983]; Ycltonetal.. Proc. Natl. Acad. Sci. USA 81: 1470-1474 [1984]) and^. niger(Ke\\y 
and Hynes. EMBOJ., 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not 
limited to, yeast capable of growth on methanol selected from the genera consisting of Hansemda, Candida, 
Kloeckera. Pichia, Saccharomyces, Toruiopsis, and Rhodotorula, A list of specific species that are exemplaiy 
of this class of yeasts may be found in C. Anthony, The Biochemistr y of Meriivlnrmp h^ 269 (1982). 

Suitable host cells for the expression of glycosylated PRO are derived fix)m multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9. as well as plant 
cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 
More specific examples include monkey kidney CVl line transformed by SV40 {COS-7. ATCC CRL 1651); 
human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culmre. Graham et al., J, 
> 3^-59 (1977)); Chinese hamster ovary ceUs/-DHFR (CHO. Urlaub and Chasin. Proc. NatL Acad. 
ScLUSA. 77:4216 (1980)); mouse Sertoli cells (TM4, Mather. Biol. Reorod. . 23:243-251 (1980)); human lung 
cells (W138, ATCC CCL 75); human liver cells (Hep G2. HB 8065); and mouse mammary tumor (MMT 
060562, ATCC CCL51). The selection of the ^propriate host cell is deemed to be within Ae skiU in the art. 

3. Selection and Use of a Reolicable Vector 
The nucleic acid (e.g. , cDNA or genomic DNA) encoding PRO may be inserted into a replicable vector 
for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector 
may, for example, be in die form of a plasmid. cosmid, viral particle, or phage. The appropriate nucleic acid 
sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the art. Vector con^)onents generally 
include, but are not lunited to, one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable 
vectors containing one or more of diese components employs standard ligation techniques which are known to 
the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a pan of the PRO-encoding DNA that is inserted into the vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase. Ipp, or heat-stable enterotoxin U leaders. For yeast secretion the signal sequence may be, e.g., 
the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a-factor leaders, 
the latter described in U.S. Patent No. 5.010,182). or acid phosphatase leader, the C albicans glucoamylase 
leader (EP 362,179 published 4 April 1990), or die signal described in WO 90/13646 published 15 November 
1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the 
protein, such as signal sequences from secreted polypeptides of die same or related species, as well as viral 
secretory leaders. 
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Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2fi plasmid 
origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 
cloning vectors in manunalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marfcer. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, 
neomycin, methotrexate, or letracyclinie, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media, e.g., the gene encoding D-alanine racemase for BacillL 

An example of suitable selectable markers for mammalian cells are those that enable the identification 
of cells competent to take up the PRO-«ncoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, 
prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA . 77:4216 (1980). A suitable 
selection gene for use in yeast is the trpi gene present in the yeast plasmid YRp7 [Stinchcorab et al.. Nature, 
282:39 (1979); Kingsman et al.. Ggne, 7:141 (1979); Tschemper et al., Gene. 10:157 (1980)]. The trpi gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example. 
ATCC No. 44076 or PEP4-1 [Jones. Genetics . 85:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic 
acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host celb are well 
known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter systems 
[Chang et al.. Nature. 275:615 (1978); Goeddel et al.. Nature . 281:544 (1979)], alkaline phosphatase, a 
tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res. . 8:4057 (1980); EP 36,776], and hybrid 
promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA . 80:21-25 (1983)]. Promoters 
for use in bacterial systems also, will contain. a Shine-Dalgamo (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hitzeman et al.. J. Biol. Chem. . 255:2073 (1980)] or other glycolytic enzymes [Hess 
«t al., J. Adv. Enzyme Rftpr 7:149 (1968); HoUand, Biochemistrv . 17:4900 (1978)]. such as enolase, 
glyceraldehyde-3-phosphatedehydrogttiase. hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose- 
6-phosphate isomerase, 3^hosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phiDsphoglucose 
isomerase, and glucokinase. 

Odier yeast promoters, which are inducible promoters having the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast ei^ression are further described in EP 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox vtrus.(UiC 2,211,504 published 5 July 
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1989). adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retiovinis. hepatitis-B virus and Simian Virus 40 (SV40). from heterologous mammalian promoters, e.g., the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible witfi the host cell systems. 

Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 
bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 
mammaUan genes (globin, elastase. albumin, a-fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication 
origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5* or 
3* to the PRO coding sequence, but is preferably located at a site 5* from the promoter. 

E3q)ression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicelhilar organisms) wiU also contain sequences necessary for the termination of 
transcription and for stabilizing the mRNA. Such sequences are commonly available fiwm the 5' and, 
occasionally 3\ untranslated regions of eukaryotic or viral DNAs or cDNAs, These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the unttanslated portion of the mRNA encoding PRO. 

Still other meUiods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
recombinant vertebrate ceU culnire are described in Gediing et al.. Nature. 293:620-625 (1981); Mantei et al.. 
Nature. 281:4046 (1979); EP 117.060; and EP 117.058. 

4. Detecting Gene Amolificatio n/Exnression 

Gene amplification and/or expression may be measured in a sample directly, for exanq^le, by 
conventional Southern blotting. Northern blotting to quantitate the transcription of mRNA (Thomas. Proc. Natl. 
Acad. Sci. USA , 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on die sequences provided herein. Alternatively, antibodies may be employed 
that can recognize specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to die duplex can be detected. 

Gene expression, alternatively, may be measured by immunological mediods. such as 
immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the e;q>ression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be eidier monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, die 
antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on 
the DNA sequences provided herein or against exogenous sequence fiised to PRO DNA and encoding a specific 
antibody epitope. 
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5, 



Purification of Polypeptide 



Fonns of PRO may be recovered from culture medium or from host cell lysates. If membrane-bound, 
it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such 
as freeze-thaw cycling, sonication. mechanical disruption, or cell lysing agents. 

It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following 
procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exdiange resin such as 
DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
Sephadex G-75 ; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns 
to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed and such 
methods are known in the art and described for example in Deutscher. Mediods in Enzvmoloev . 182 (1990); 
Scopes. Protein Purification: Prin cioles and Practice . Springer-Verlag, New York (1982). The purification 
step(s) selected will depend, for example, on die namre of the production process used and the particular PRO 



Nucleotide sequences (or dieir complement) encoding PRO have various applications in the art of 
molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the 
generation of anti sense RNA and DNA. PRO nucleic acid will also be useful for the prq)aration of PRO 
polypeptides by the recombinant techniques described herein. 

The ftill-length native sequence PRO gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those 
encoding naturally-occurring variants of PRO or PRO from other species) which have a desired sequence identity 
to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about 50 
bases. The hybridization probes may be derived from at least partially novel regions of die fiill length native 
nucleotide sequence wherein diose regions may be determined widiout undue experimentation or from genomic 
sequences including promoters, enhancer elements and introns of native sequence PRO. By way of example, 
a screening method will comprise isolating the coding region of the PRO gene using the known DNA sequence 
to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, 
including radionucleotides such as ^^P or ^^S, or eiizymatic labels such as alkaline phosphatase coupled to the 
probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the PRO 
gene of the present invention can be used to screen libraries of human cDNA. genomic DNA or mRNA to 
determine which members of such libraries die probe hybridizes to. Hybridization techniques are described in 
further detail in the Examples below. 

Any EST sequences disclosed in the present application may similarly be employed as probes, using 
the methods disclosed herein. 
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Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides comprising 
a singe-stranded nucleic acid sequence (eiflicr RNA or DNA) capable of binding to target PRO mRNA (sense) 
or PRO DNA (antisense) sequences. Antisense or sense oUgonucleotides. according to the present invention, 
comprise a fragment of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 
nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in. for example. Stein and 
Cohen (Cancer Res. 48:2659, 1988) and van der Krol et al. (BioTechninnft. 6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in Ae formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 
enhanced degradation of the duplexes, premamre termination of transcription or translation, or by other means. 
The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense 
OUgonucleotides further comprise oUgonucleotides having modified sugar-phosphodiester backbones (or other 
sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oUgonucleotides with resistam sugar linkages are stable in vivo (i.e., capable of 
resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligomideotides w*ich are 
covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases 
affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further stUl, 
intercalating agents, such as eUipticine, and alkylating agents or metal complexes may be attached to sense or 
antisense oUgonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target 
nucleotide sequmce. 

Antisense or sense oligonucleotides may be introduced into a ceU containing the target nucleic acid 
sequence by any gene transfer method, including, for example, CaPO,-mediated DNA transfection. 
electroporation. or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oUgonucleotide is inserted into a suitable retroviral vector. A ceU comaining the target nucleic 
acid sequence is conacted with die recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 
vectors include, but are not limited to. those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A. DCT5B and DCT5C (see WO 
90/13641). 

Sense or antisoise oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a Ugand binding molecule, as described in WO 91/04753. Suitable 
ligand binding molecules include, but are not limited to. ceU sur&ce receptors, growtfi factors, other cytokines, 
or other Ugands that bind to ceH surface receptors. Preferably, coiijugation of the ligand binding molecule does 
not substantiaUy interfere with the abUity of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of the sense or antisense oUgonudeotide or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be intiwiuced into a ceU containing the target 
nucleic acid sequence by formation of an oUgonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipidcomplexispreferablydissociated within theceU by an endoge^ 
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Antisense or sense RNA or DNA molecules are generally at least about 5 bases in length, about 10 
bases in length, about 15 bases in length, about 20 bases in length, about 25 bases in length, about 30 bases in 
length, about 35 bases in length, about 40 bases in length, about 45 bases in length, about 50 bases in length, 
about 55 bases in length, about 60 bases in length, about 65 bases in length, about 70 bases in length, about 75 
bases in length, about 80 bases in length, about 85 bases in length, about 90 bases in length, about 95 bases in 
length, about 100 bases in length, or more. 

The probes may also be employed in PCR tedmiques to generate a pool of sequences for identification 
of closely related PRO coding sequences. 

Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping 
the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When the coding sequences for PRO encode a protein which binds to anodier protein (example, where 
the PRO is a receptor), die PRO can be used in assays to identify the other proteins or molecules involved in 
the binding interaction. By such methods, inhibitors ofthereceptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitois 
or agonists of the binding interaction. Also, die receptor PRO can be used to isolate correlative ligand(s). 
Screening assays can be designed to find lead compounds that mimic the biological activity of a native PRO or 
a receptor for PRO. Such screening assays will include assays amenable to high -throughput screening of 
chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 
cell based assays, which are well characterized in the art. 

Nucleic acids which encode PRO or its modified forms can also be used to generate either transgenic 
animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, 
which transgene was introduced into die animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage, A transgene is a DNA vMch is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques and the genomic sequences used to generate transgenic animals diat 
contain cells which express DNA encoding PRO. Mediods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 
Nos. 4,736,866 and 4,870.009. Typically, particular cells would be targeted for PRO transgene incorporation 
with tissue-specific enhancers. Transgenic animals diat include a copy of a transgene encoding PRO intitxiuced 
into the germ line of die animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection 
from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 

79 



PCT/USOO/32678 

the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, 
compared to untreated animals bearing the transgene, would indicate a potential Aerapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out" animal 
which has a defective or altered gene encoding PRO as a result of homologous recombination between the 
endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem 
cell of the animal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques. A portion of die genomic DNA encoding PRO can be deleted or 
replaced with anodier gene, such as a gene encoding a selectable marker which can be used to monitor 
integration. Typically, several kilobases of unaltered flanking DNA (both at die 5' and 3' ends) are included 
in the vector [see e.g.. Thomas and Capecchi. Cdl, 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g. , by electroporaiion) and 
cells in which the introduced DNA has homologously recombined with die endogenous DNA are selected (see 
e.g.. Li et al.. Cell, 69:915 (1992)]. The selected ceUs are then injected into a blastocyst of an animal (e.g.. 
a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Teratocardnomas and Embryonic Stem 
Cells: A Practical Approach, E. J. Robertson, ed. (IRL. Oxford, 1987). pp. 113-152]. A chimeric embiyo can 
then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create 
a "knock out" animal. Progeny harboring die homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 
against certain padiological conditions and for their development of padiological conditions due to absence of 
the PRO polypeptide. 

Nucleic acid encoding the PRO polypeptides may also be used in gene tfierapy. In gene tiierapy 
applications, genes are introduced into ceUs in order to achieve in vivo synthesis of a therapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 
gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene dierapeutic 
agents, which involves the one time or repeated administration of a dierapeutically effective DNA or mRNA. 
Antisense RNAs and DN As can be used as therapeutic agents for blocking die expression of certain genes in 
vivo. It has aheady been shown diat short antisense oligonucleotides can be imported into cells where tfiey act 
as inhibitors, despite their low intracellular concentrations caused by ttieir restricted uptake by die ceU 
membrane. (Zamecnik et al,, Proc. Natl. Acad, .SrJ T ISA 83:4143-4146 [1986]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting flieir negatively charged phosphodiester groups by 
uncharged groups. 

There are a variety of tedmiques available for introducing nucleic acids into viable cells. The 
techniques vary dqjending upon whedier die nucleic acid is transferred into cultured ceUs in vitro, or in vivo in 
the cells of die intended host. Techniques suitable for the ttansfer of nucleic acid into mammalian cells in vitro 
include the use of liposomes, electroporation. microinjection, cell fusion, DEAE-dextran, die calcium phosphate 
precipitation mediod. etc. The currenUy preferred in vivo gene transfer techniques include transfection widi viral 
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(typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al.. Trends in 
Biotechnology II, 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with 
an agent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind 
to a cell surface naembrane protein associated with endocytosis may be used for targeting and/or to facilitate 
uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which 
undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262. 
4429-4432 (1987); and Wagner et al. . Proc. Natl. Ac ad. Sci. USA 87, 3410-3414 (1990). For review of gene 
marking and gene therapy protocols see Anderson et al., Science 256, 808-813 (1992). 

The PRO polypeptides described herein may also be employed as molecular weight markers for protein 
electrophoresis purposes and the isolated nucleic acid sequences may be used for recoinbinantly e^^ressing those 
markers. 

The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are 
useful for chronwsome identification. In tiiis regard, there exists an ongoing need to identify new chromosome 
15 markers, since relatively few chromosome marking reagents, based upon acnial sequence data are presently 
available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of the present invention may also be used 
diagnostically for tissue typing, wherein the PRO polypeptides of the present invention may be differentially 
expressed in one tissue as compared to another, preferably in a diseased tissue as compared to a nonnal tissue 
20 of the same tissue type. PRO nucleic acid molecules wiQ find use for generating probes for PGR, Northern 
analysis. Southern analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO 
polypeptides of the present invention can be formulated according to known methods to prepare pharmaceutically 
useful compositions, whereby the PRO product hereof is combined in admixture with a pharmaceutically 
25 acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 
having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
(Remington's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized 
formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the 
dosages and concentrations employed, and include buffers such as phosphate, citrate and otiier organic acids; 
30 antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino 
acids such as glycine, glutamine, asparagine. arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, raannose. or dextrms; chelating agents such as EDTA; sugar alcohols such as 
mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, 
35 PLURONICS^orPEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution. 

81 



wo 01/40466 



PCT/USOO/32678 



Therapeutic compositions herein g^erally are placed into a container having a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierceable by a h3rpodermic injection needle. 

The route of administration is in accord witti known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical 
administration, or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical conq)ositions of the present invention may 
vary depending on the particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal e;q)eriments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling 
in toxicokinetics" In Toxicokinetics and New Drug Development, Yacobi et ai., Eds., Pergamon Press, New 
York 1989, pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/k^ of manunal body weight or more per day, 
preferably about 1 /ig/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided m the literanire; see, for example, U.S. Pat. Nos. 
4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 
treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner different from that to another organ or tissue. 

Where sustained-release administration of a PRO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 
polypq>tide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustdned release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhlFN- ), interleukin-2, and MN rgpl20. Johnson et al., Nat. Med. . 2:795-799 (1996); Yasuda, Biomed. 
Ibsr., 27:1221-1223(1993); Horaet al.. Bio/Technology. 8;755-7SRnQQn^' ri«lanH "Design and Production 
of Single Inununization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: 
The Subunit and Adjuvant A pproach . Powell and Newman, eds, (Plenum Press: New York. 1995), pp. 439-462; 
WO 97/03692, WO 96/40072. WO 96/07399; and U.S. Pat. No. 5.654,010. 

TTie sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid 
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation 
products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the 
degradability of this polymer can be adjusted from months to years depending on its molecular weight and 
composition. Lewis, "Controlled release of bioactive agents from lactide/glycolide polymer," in: M. Chasin 
and R. Langer (Eds.), Biodegradable Polvmers as Drue Deliverv Svstems (Marcel Dekker: New York, 1990). 
pp. 1-41. 

This invention enconq^asses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonists) or prevent the effect of the PRO poiypqptide (antagonists). Screemng assays for 
antagonist drug candidates are designed to identify compounds that bind or con^>lex with the PRO polypeptides 
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encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins. Such screening assays will include assays amenable to high-throughput screening 
of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screemng assays, immunoassays, and cell-based assays, which are well characterized in Oie art. 

All assays for antagonists are common in that tfiey call for contacting the drug candidate with a PRO 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 
drug candidate is immobUized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the PRO polypeptide and drying. Altemaavely, an immobilized antibody, e.g. , a monoclonal antibody, specific 
for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 
by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
component, e.g.. the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted con^nents are removed, e.g., by washing, and conqilexes anchored on the solid surface are 
detected. When the originally non-immobilized component canies a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not cany a label, complexing can be detected, for example, by using a labeled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by 
a gene identified herein, its interaction with that polypeptide can be assayed by methods weU known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London). 340:245-246 (1989); Chien ct al., Pioc. Natl. Acad. Sci. USA . 88:9578- 
9582 (1991)) as disclosed by Chevray and Nathans. Proc. Natl. Acad. Sci. USA . 89: .S7«g-S7OT n QQi ) Many 
transcriptional activators, such as yeast GAL4. consist of two physically discrete modular domains, one acting 
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to 
the DNA-binding domain of GAM. and another, in which candidate activating proteins are fused to the 
activation domain. The expression of a GALl-tocZ reporter gene under control of a GAL4-activated promoter 
depends on reconstimtion of GAM activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected with a chromogenic substrate for P-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using tfie two- 
hybrid technique is commercially available froni Clontedi. This system can also be extended to map protein 
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domains involved in specific protein interactions as well as to pinpoint amino acid residues diat are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 
allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixnire, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in die mixnire is monitored as 
described heremabove. The formation of a complex in the control reaction(s) but not in the reaction mixture 
containing the test compound indicates that die test compound interferes with die interaction of die test compound 
and its reaction partner. 

To assay for antagonists, die PRO polypeptide may be added to a cell along witfi die compound to be 
screened for a particular activity and die ability of the compound to inhibit die activity of interest in die presence 
of die PRO polypeptide indicates diat die compound is an antagonist to die PRO polypeptide. Alternatively, 
antagonists may be detected by combining die PRO polypeptide and a potential antagonist widi membrane-bound 
PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive uihibition 
assay. The PRO polypeptide can be labeled, such as by radioactivity, such that die number of PRO polypeptide 
molecules bound to die receptor can be used to determine die effectiveness of die potential antagonist. The gene 
encoding die receptor can be identified by numerous mediods known to diose of skUl in die art, for example, 
ligand panning and FACS sorting. Coligan et al.. Current Protocols in Imfnun. , 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
die PRO polypeptide and a cDNA library created from diis RNA is divided into pools and used to transfect COS 
cells or odier cells diat are not responsive to die PRO polyp^tide. Transfected cells diat are grown on glass 
slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means 
including iodination or inclusion of a recognition site for a site-specific protein kinase. FoUowing fixation and 
incubation, die slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, evennially yielding a 
single clone that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffiniiy- 
linked widi cell membrane or extract preparations diat express die receptor molecule. Cross-linked material is 
resolved by PAGE and exposed to X-ray fihn. The labeled con^lex containing die receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The anuno acid sequence obtained 
from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 
library to identify die gene encodmg die putative recqitor. 

In anodier assay for antagonists, mammalian cells or a membrane preparation expressing die receptor 
would be incubated widi labeled PRO polypeptide in die presence of die candidate compound. The ability of 
die compound to enhance or block diis interaction could tiien be measured. 
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More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 
immunoglobulin widi PRO polypeptide, and, in particular, anubodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single^hain antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO 
polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using 
antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 
to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5 ' coding portion of the polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length, A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al. . Nucl. Acids Res. . 
6:3073 (1979); Cooney et al., Scjence, 241: 456 (1988); Dervan et al.. Science . 251:1360 (1991)), thereby 
preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypq>tide 
(antisense - Okano, Newop^gn),. 56:560 (1991); Oligodeoxvnudeotides as Antisense Inhibitor of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 
to cells such that die antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 
polypeptide. When antisense DNA is used, oligodeoxyribonucleotides derived from the translation-initiation site, 
e.g.. between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to die active site, the receptor binding site, or 
growth factor or other relevant binding site of the PRO polypeptide, thereby blocking die normal biological 
activity of die PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and symhetic non-peptidyl organic or inorganic 



Ribozymes are enzymatic RNA molecules capable of catalyzing die specific cleavage of RNA. 
Ribozymes act by sequence-specific hybridization to die complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by 
known techniques. For fimher details see. e.g., Rossi . Current Biologv. 4:469-47 1 (1994), and PCT pubUcatioH 
No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such tiiat it 
promotes triple-helix fomnation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a dvplex. For further details see. e.g., PCT publication No. WO 
97/33551, jiyra. 



compounds. 
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These small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for diose skilled in the art. 

Diagnostic and therapeutic uses of the herein disclosed molecules may also be based upon the positive 
functional assay hits disclosed and described below. 

F. Anti'PRO Antibodies 
The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

1- Polyclonal Antibodies 
The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skUled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and. if desired, an adjuvant. Typically, the immiiniTine agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 
immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin. serum albumin, bovine 
thyroglobulin. and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund*s 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A. synthetic trehalose dicorynomycolate). 
The immunization protocol may be selected by one skilled in the art without undue experimentation. 

2. Monoclonal Antibodies 

The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and MUstein, Nahire . 25fi:495 (1975). 
In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized mth an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specificaUy 
bind to the immunizing agent. Alternatively, die lymphocytes may be immunized in vitro. 

The immunizing agent will typicaUy include the PRO polypeptide or a fusion protein thereof 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen 
cells or lymph node ceUs are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Princin les and Pracfire Academic Press. (1986) pp. 59-103]. 
Immortalized ceU lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine 
and human origm. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culnire medium diat preferably contains one or more substances that inhibit die growth or 
survival of the unfused. immortalized ceUs. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyi transferase (HGPRT or HPRT). the culmre medium for the hybridomas typically wiU 
include hypoxanthine. aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of 
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HGPRT-deficieni cells. 

Preferred inunortalized cell lines are those that fiise efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 
Institute Cell Distribution Center, San Diego, California and the American Type Culmre Collection, Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
productionofhumanmonoclonalantibodiesfKozbor. J. ImmunoL . 133;3QQ1 (1984); BrodeuretaL. Monoclonal 
Antibody Production Tech niques and Applications ^ Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by inununoprecipitation or by an in vitro binding assay, such as 
radioinununoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the art. The binding affmity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard. Anal, Biochem. . 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Coding, supral . Suitable culture media for this purpose mclude, 
for example, Dulbecco*s Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells 
may be grown in vivo as ascites in a manunal. 

The monoclonal antibodies secreted by . the subclones may be isolated or purified from , the culture 
medium or ascites fluid by conventional immunoglobulin purification procedures such as. for example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affmity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 
and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 
invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into e3q>ression 
vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the 
coding sequence for human heavy and light cham constant domains in place of the homologous murine sequences 
(U.S. Patent No. 4,816,567; Morrison et al., supral or by covalently joining to the immunoglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin polypeptide; Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for 
the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent 
antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 
known in the art. F6r example, one method involves recombinant expression of immunoglobulin light diain and 
modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent 
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heavy chain crosslinldng. Alternatively, the relevant cysteine residues are substituted with another amino acid 
residue or are deleted so as to prevent crosslinldng. 

In vitro methods are also suitable for preparing monovalait antibodies. Digestion of antibodies to 
produce fragnwnts thereof, particularly. Fab fragments, can be accomplished using routine techniques known 
in the ait. 



3- Human and HumaniTed Antibodies 
The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g.. murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv. Fab. Fab'. F(ab')j or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived bom non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having die desired specificity, affinity and capacity. In 
some instances. FvCrameworkresiduesoftfie human immunoglobulinare replaced by cor^^ 
residues. Humanized anulxxUes may also con^»rise residues which are found neitiier in die lectpiait antibody 
nor in the imported CDR or framework sequences. In general, die humanized antibody wll conq.rise 
substantially all of at least one. and typically two. variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 
diose of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
et al.. JJamre. m:522-525 (1986); Riechmann et al.. Nature. 332:323-329 (1988); and Presta. Curr. On. 
Struct. Biol. 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in die art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a soun» which is non-human. These non- 
human amino acid nsiduesareoflenreferred to as "import" residues, which are typicaUy taken from an "import" 
variable domain. Humanization can be essemiaUy perfonned following the meAod of Wmter and co-workers 
[Jones et al.. Nature. 321:522-525 (1986); Riechmann et al.. Nahire . 222:323-327 (1988); Verhoeyen et al.. 
Scigncs. 229:1534-1536 (1988)J. by subsdmting rodent CDRs or CDR sequences for Uie corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent 
No. 4.816.567). wherein substantially less tfian an intact human variable domain has been substimted by die 
corresponding sequoice from a non-human species. In practice, humanized antibodies are typically human 
antibodies in which some CDR residues and possibly some FR nssidues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be pwiduced using various techniques known in die art. including phage 
display libraries [Hoogenboom and Winter. J.Mol.Biol. ^:381 (1991); Marks et al.. J. Mol. Biol. . 2M:581 
(1991)]. The techniques of Cole et al. and Boemer et al. are also available for the preparation of human 
monoclonal antibodies (Cole et al.. Monoclonal Antibodies and r^n^jTt^r^ Alan R. Uss. p. 77 (1985) and 
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Boemer et ah. J. ImmunqK, 14201:86-95 (1991)]. Similarly, human antibodies can be made by introducing 
of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin 
genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and 
antibody rq)ertoire. This approach is described, for example, in U.S. Patent Nos. 5,545.807; 5,545.806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et al., 
Pio/TecbnolQgylQ, 779-783 (1992); Unberg eial., JSamre 268 856-859 (1994); Morrison, Nature 368 . 812-13 
(1994); Fishwild et al„ Nature Biotechnology 14, 845-51 (1996); Neuberger, Nanire Biotechnology 14, 826 
(1996); Lonberg and Huszar, Intern. Rev. Inununol. 12 65-93 (1995). 

The antibodies may also be affinity mamred using known selection and/or mutagenesis methods as 
described above. Preferred affinity mamred antibodies have an affinity which is five tinies, more preferably 10 
times, even more preferably 20 or 30 times greater than the starting antibody (generally murine, humanized or 
human) from which the matured antibody is prepared. 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subimit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chainAight-chain 
pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature . 305:537-539 (1983)] . 
Because of die random assortment of inununoglobulin heavy and light chains, these hybridomas (quadromas) 
produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific 
structure. The purification of the correct molecule is usually accomplished by affmity chromatography steps. 
Similar procedures are disclosed in WO 93/08829, published 13 May 1993. and in Traunecker et al., EMBO 
L. JS:3655-3659 (1991). 

Antibody variable domains with die desired binding specificities (antibody-antigen combining sites) can 
be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CHI) containing die site necessary for light-chain binding present in 
at least one of die fusions. DNAs encoding the immunoglobulin heavy-chain ftisions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al.. Methods 
iaEflZMnologx, 121;210 (1986). 

According to another approach described in WO 96/2701 1 , the interface between a pair of antibody 
molecules can be engineered to maximize die percentage of heteipdimers whidi ai€ recovered fiom recombinant 
cell culnire. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. 
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In ihis method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar 
size to the large side chain(s) are created on die interface of die second antibody molecule by replacing large 
amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing 
the yield of the heterodimer over other unwanted end-products such as homodimere. 

Bispecific antibodies can be prepared as foil length antibodies or antibody fragments (e.g. F(ab*)2 
bispecific anUbodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 
linkage. Brennang/g/., Science 229: 81 (1985) describe a procedure wherein intact antibodies are proteolytically 
cleaved to generate F(ab')2 fragments. These fragments are reduced in the presence of the didiiol complexing 
agent sodium arsenite to stabilize vicinial dithiols and prevent intermolecular disulfide fonnation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an 
equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for die selective immobilization of enzymes. 

Fab' fragments may be directiy recovered from E. coU and diwnically coi^>led to form bispecific 
antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe die production of a fully humanized 
bispecific antibody F(ab')2 molecule. Each Fab' fragment was separately secreted from E. coti and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody tiius formed was 
able to bind to cells overexpressing die ErbB2 receptor and normal human T cells, as weU as trigger die lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directiy from recombinant cell 
culnire have also been described. For example, bispecific antibodies have been pioduced using leucine zippers. 
Kostehiy et aL. J. Immunol. 148(5):1547.1553 (1992). The leucine zipper peptides from die Fos and Jun 
proteins were linked to die Fab' portions of two different antibodies by gene fusion. The antibody homodimers 
were reduced at die hinge region to form monomers and dien re-oxidized to form die antibody heterodimers. 
This mediod can also be utilized for die production of antibody homodimers. The "diabody** technology 
described by Hollinger et aL, Proc. Nati. A cad. Sci. USA 90:6444-6448 (1993) has provided an alternative 
mechanism for making bispecific amibody fragments. Hie fragments comprise a heavy-chain variable domain 
(V„) connected to a light-chain variable domain (VJ by a linker which is too short to allow pairmg between die 
two domains on die same chain. Accordingly, die Vh and domains of one fragment are forced to pair widi 
die complementary Vl and V„ domains of anodier fragmwit, tiiereby forming two antigen-binding sites, Anodier 
strategy for makmg bispecific antibody fragments by die use of single-chain Fv (sFv) dimers has also been 
reported. See. Gruber e/ fl/.. J. Immunol. 152;53fiR noodV 

Antibodies widi more than two valencies are contemplated. For example, trispecific antibodies can be piepaicd. 
Tutt etaL, J- Imiiiiififtl 147:60 (1991). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein. 
Alternatively, an anti-PRO polypeptide arm may be combined witii an arm vMdti binds to a triggering molecule 
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on a leukocyte such as a T-cell receptor molecule (e.g. CD2. CD3. CD28, or B7), or Fc receptors for IgG 
(FcyR). such as FcyRI (CD64), FcyRII (CD32) and FcyRITI (CD16) so as to focus cellular defense mechanisms 
to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize 
cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding 
arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, 
or TETA. Anotfier bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor 
(TF). 



antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980]. and for treatment of HIV infection 
[WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, inimunotoxins may be constructed using a disulfide exchange reaction or by forming a thioetfaer bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 



introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
honKxiimeric antibody thus generated may have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caronefo/.. J. Exp Med .. 176 : 
1191-1195 (1992) and Shopes, J. Immunol.. 14S: 2918-2922 (1992). Homodimeric antibodies with enhanced 
anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in Wolff et al. 
Capc^yReseairch, H: 2S60-2S6S (1993). Alternatively, an antibody can be ragineered that has dual Fc regions 
and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., Anti-Cahcer 
Drug De^jgn. 3: 219-230 (1989). 



The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin {e.g, , an enzymatically active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (/.e., a radioconjugate). 

Chemotherapeutic agents useftil in the generation of such inununoconjugates have been described above. 
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricih A chain, abrin A chain, 
modeccin A diain, alpha-sarcin, Akuriies fordii proteins, dianthin proteins, Phytolaca americana proteins 



5. Heteroconjugate Antibodies 
Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 



6. Effector Function Engineering 
It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e,g,, the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 



7. 



Immunoconjugates 
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(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
gelonin. mitogellin. restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are 
avmlable for the production of radioconjugated antibodies. Examples include ^^^Bi. »i£n^ wy^ and »*^Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunciional protein-coupling 
agents such as N-succinimidyI-3-(2-pyridyldithiol) propionate (SPDP), iminotfiiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), 
aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al. . Science . 238 : 1098 (1987). Carbon- 14-labeled 1- 
isothiocyanatobenzyI-3-methyldiethyIene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleotide to the antibody- See W094/1 1026. 

In another embodimOTt, the antibody may be conjugated to a "leceptor" (such streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) that is conjugated to a cytotoxic agent {e.g., a radionucleotide). 



8. Immunoliposomes 
The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 
the antibody are prepared by mediods known in the art, such as described in Epstein et al., Proc. Natl. Acad. 

S2: 3688 (1985); Hwang era/., Proc. Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 
4,485,045 and 4,544,545; Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized phoq)hatidyleihanolamine (PEG- 
PE). Liposomes are extruded through filters of defmed pore size to yield liposomes with the desired diameter. 
Fab* fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
et al .,.J. Biol. Chem., 257: 286-288 (1982) via a disulfide-mterchange reaction. A chemotherapeutic agent 
(such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al. , J. National Cancer Inst. . 
81(19): 1484 (1989). 



9* Pharmaceutical Compositi ons of Antibodies 

Antibodies specifically binding a PRO polypeptide identified herein, as well as other molecules 
identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various 
disorders in the form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
antibodies are preferred. However, lipofections or liposomes can also be used to deliver die antibody, or an 
antibody fragment, into ceUs. Where antibody firagments are used, the smallest inhibitory fragment that 
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specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
technology. See. e.g.. Marasco et a/., Proc. Natl. Acad. Sci. USA . 90: 7889-7893 (1993), Tlie ftmulation 
herein may also contain more than one active compound as necessary for the particular indication being treated, 
preferably those with complementary activities that do not adversely affect each otfier. Alternatively, or in 
addition, the con^sition may comprise an agent that enhances its function, such as, for example, a cytotoxic 
agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in 
combination in amounts that are effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albuniin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences , supra. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
the form of shaped articles, e.g„ fihns, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example. poly(2-hydroxyethyl-methacfylaie), orpoly(vinylalcohol)). polylactides (U.S. 
Pat. No. 3.773,919). copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable etfaylene-vinyl 
acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release 
of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to moisture 
at 37X, resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies 
can be devised for stabilization depending on the medianism involved. For example, if the aggregation 
mechanism is discovered tobe intennolecularS-Sbond formation through thio-disulfide interchange, stabilization 
may be achieved by modifying sulfliydryl residues, lyophilizing from acidic solutions, controlling moisture 
content, using appropriate additives, and developing specific polymer mattix compositions. 

G. Uses for anti~PRO Antihodifts 

The anti-PRO antibodies of the invention have various utilities. For example. anti-PRO antibodies may 
be used in diagnostic assays for PRO. e.g., detecting its expression (and in some cases, differential expression) 
in specific cells, tissues, or serum. Various diagnostic assay techniques known in the art may be used, such as 
con^titive binding assays, direct or indirect sandwich assays and inununoprecipitation assays conducted in 
either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A Manual of Techniq ue.^. CRC 
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Press. Inc. (1987) pp. 147-158]. TTie antibodies used in the diagnostic assays can be labeled with a detectable 
moiety. The detectable moiety should be capable of producing, either directly or indirectly, a detectable signal. 
For example, the detectable moiety may be a radioisotope, such as ^H. *^C, ^^P, ^*S. or '^I, a fluorescent or 
chemiluminescent con^wund, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such 
as alkaline phosphatase, beta-galacftwidase or horseradish peroxidase. Any method known in the art for 
conjugatii^ the antibody to the detectable moie^ may be employed, including those methods described by Hunter 
et al.. Nature, 144:945 (1962); David et al.. Biochemistry . 13:1014 (1974); Pain ct al.. J. Immunol. Meth. 
4Q:219 (1981); and Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). 

Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture 
or natural sources. In this process, die antibodies against PRO are immobilized on a suitable support, such a 
Sephadex resin or filter paper, using methods well known in the art. The immobUized antibody dien is contacted 
with a sample containing the PRO to be purified, and thereafter die support is washed widi a suitable solvent that 
will remove substantially all the material in the sample except the PRO. which is bound to the immobilized 
antibody. Finally, die si4)port is washed with anodier suitable solvent that will release the PRO from the 
antibody. 

The following exanq)les are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

AU patent and literature references cited m the present specification are hereby incoiporated by reference 
in their entirety. 



EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of diose cells identified in die following examples, and 
throughout die deification, by ATCC accession numbers is the American Type Culmre Collection, Manassas. 
VA. 



EXAMPLE 1 ; Extracellular Domain Homology Sc re ening to Identify Novel PolvDeDtides and cDNA Encoding 
Therefor 

The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 
950 known secreted proteins from die Swiss-Prot public database were used to search EST databases. The EST 
databases included public databases (e.g.. Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ™, 
Incyte Pharmaceuticals, Palo Alto. CA). The search was performed using die computer program BLAST or 
BLAST-2 (Altschul et al., Metfiods in E nzvmologv 266:460-480 (19%)) as a comparison of die ECD protein 
sequences to a 6 frame translation of die EST sequences. Those comparisons widi a BLAST score of 70 (or in 
some cases 90) or greater diat did not encode known proteins were clustered and assembled into consensus DN A 
sequences widi die program "phrap" (Phil Green, Universior of Washington, Seattle, WA). 

Using diis extracellular domain homology screen, consensus DNA sequences were assembled relative 
to die odier identified EST sequences using phiap. In addition, die consensus DNA sequences obtained were 
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Often (but not always) extended using repeated cycles of BLAST or BLAST-2 and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then 
synthesized and used to identify by PGR a cDNA library tiiat contained the sequence of interest and for use as 
probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PGR 
5 primers generally range from 20 to 30 nucleotides and are often designed to give a PGR product of about 100- 
1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 
oligonucleotidesaresynthesizedwhentheconsensussequenceisgreaterthanabout 1-1.5^^^ In order to screen 
- several libraries for a full-length clone, DNA from the libraries was screened by PGR amplification, as per 
Ausubel et aL, Current Protocols in Molecular R^'^^ffY, with the PGR primer pair. A positive Ubraiy was then 
10 used to isolate clones encoding the gene of interest using the probe oUgonucleotide and one of the primer pairs. 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 
commercially available reagents such as those from Invitrogen. San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl. sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
15 PRKB or pRKD;pRK5B is a precursor ofpRKSD that does not contain the Sfil site; see, Hohnesetal., Science, 
253: 1278-1280 (1991)) in the unique Xhol and Notf sites. 

PXAMPt^g 2 ; Isolation of cDNA clones bv Amyl a se Screeninp 

1. Preparation of oligo dT prim ed cDNA library 

20 mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San 

Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 
pRK5D using reagents and protocols from Life Technologies, Gaithersburg. MD (Super Script Plasmid System). 
In this procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI linkered cDNA 
was cloned into XhoI/NotI cleaved vector. pRK5D is a cloning vector that has an sp6 transcription initiation 

25 site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cDN A library was generated in order to preferentially represent the 5 * ends of the primary 
cDNA clones. Sp6 RNA was generated from the primary library (described above), and diis RNA was used 

30 to generate a random primed cDNA library in the vector pSST-AM Y.O using reagents and protocols from Life 
Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA 
was sized to 500-1000 bp, linkered with blunt to Notl adaptors, cleaved with Sfil. and cloned into Sfil/NotI 
cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding 
the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) 

35 followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus. cDNAs cloned into this 
vector that are fiised in frame with amylase sequence will lead to the secretion of amylase from appropriately 
transfected yeast colonies. 
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3. 



Transformation and Detection 



DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompeteni DHIOB bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 
electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies. I ml) was 
added and the mixture was incubated a 37*»C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37*C). Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCI- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

The yeast methods were divided into three categories: (1) Transformation of yeast with the 
plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PGR 
amplification pf the insert directly from the yeast colony and purification of the DNA for sequencing and further 
analysis. 

The yeast strain used was HD56-5A (ATCC.90785). This strain has the following genotype: MAT 
alpha. ura3.52. leu2-3, leu2-112. his3-ll, his3-15, MAL+, SUC+, GAL+. Preferably, yeast mutants can be 
employed that have deficient post-translational pathways. Such mutants may have translocation deficient alleles 
in secll, secll, sec62, with truncated secll being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) v/hich interfere with the normal operation of these genes, odier proteins 
implicated in this post translation pathway (e.g., SEC61p. SEC72p, SEC62p. SEC63p, TDJlp or SSAlp-4p) 
or the con^)Iex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et al.. NucL Acid. Res. . 
20: 1425 (1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) 
and grown overnight at 30**C. The YEPD brodi was prepared as described in Kaiser et al. . Methods in Yeast 
fiSSfiS. Cold Spring Harbor Press. Cold Spring Harbor, NY, p. 207 (1994), The overnight culture was then 
diluted to about 2 x 10*^ ceUs/ml (approx. OD^o^O.!) into fresh YEPD broth (500 ml) and legrown to 1 x 10^ 
cells/ml (approx. OD«o=0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 
Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, 
and centrifiiged again in 50 ml falcon tubes at 3.500 rpm in a Beckman GS-6KR centrifuge. The si^)ematant 
was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCl. 1 mM EDTA 
pH 7.5, 100 mM LijOOCCHa). and resuspended into LiAc/TE (2.5 ml). 

Transformation took place by mixing the prepared cells (100 fA) with freshly denatured single stranded 
sahnon testes DNA (Lofetrand Labs, Gaithersburg, MD) and transforming DNA (1 fig, vol. < 10 /il) in 
microfiigc tubes. The mixture was mixed briefly by vonexing, tfien 40% PEG/TE (600 /xl, 40% polyethylene 
glycoI^KXX), 10 mM Tris-HCl. 1 mM EDTA, 100 mM Li^OOCCHj, pH 7.5) was added. This mixture was 
gently mixed and incubated at 30**C while agitating for 30 minutes. The cells were then heat shocked at 42*C 
for 15 minutes, and the reaction vessel centrifiiged inamicrofuge at 12.000 rpm for 5-lOseco^^ 
resuspended into TE (500 pA, 10 mM Tris-HCl, 1 mM EDTA pH 7.5) followed by recentrifiigation. The cells 
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were then diluted into TE (I ml) and aliquots (200 /il) were spread onto the selective media previously prepared 
in ISO mm growdi plates (VWR). 

Alternatively, instead of multiple small reactions, the transformation was performed using a single, large 
scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et ah, Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spruig Harbor, NY, p. 
208-210(1994). Transformants were grown at 30**C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
growth media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per Che procedure described by 
Biely et al., Anal. Biochem., 172: 176«179 (1988). The coupled starch was incorporated into the SCD-Ura agar 
plates at a final concentration of 0. 15 % (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50- 
100 mM final concentration). 

The positive colonies were picked and streaked across fre^ selective media (onto 150 mm plates) in 
order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were 
determined by their ability to break down starch resulting in a clear halo around tiie positive colony visualized 



4. Isolation of DNA bv PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile 
water (30 /il) in a 96 well plate. At this time, the positive colonies were eidier frozen and stored for subsequent 
analysis or immediately amplified. An aliquot of cells (5 ^1) was used as a template for the PCR reaction in a 
25 ix\ volume containing: 0.5 fi\ Klentaq (Clontech. Palo Alto, CA); 4.0 fi\ 10 mM dNTP's (Perkin Ehner- 
Cetus); 2.5 ^1 Kentaq buffer (Clontech); 0.25 ^tl forward oligo 1 ; 0.25 fi\ reverse oligo 2; 12.5 ^1 distilled water. 
The sequence of the forward oligonucleotide 1 was: 

5'-TGrAAAACGACGGCCAG TTAAATAGACCTGCAATTATTAATCT --^' (SEQ ID NO:553) 
The sequence of reverse oligonucleotide 2 was: 

5'-CAGGAAACAGCTATGAC CACCTGCACACCTGCAAATCCATT .-^' (SEQ ID NO:554) 

PCR was then performed as follows: 



directly. 



a. 



Denature 



92'C, 5 minutes 



b. 



3 cycles of: 



Denature 

Anneal 

Extend 



92X. 30 seconds 
59°C, 30 seconds 
72'C, 60 seconds 



c. 



3 cycles of: 



Denature 

Anneal 

Extend 



92**C, 30 seconds 
57 °C, 30 seconds 
72*C, 60 seconds 



d. 



25 cycles of: Denature 
Anneal 
Extend 



92°C, 30 seconds 
55 ^'C. 30 seconds 
72**C, 60 seconds 
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The underlined regions of die oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
Typically, die first 18 nucleotides of Ae 5' end of these oligonucleotides contained annealing sites for the 
sequencing primes. Thus, the total product of the PGR reaction from an empty vector was 343 bp. However, 
signal sequence-fiised cDNA resulted in considerably longer nucleotide sequences. 

Following the PGR, an aliquot of the reaction (5 ,d) was examined by agarose gel electrophoresis in 
a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook etal.. supra, 
aones resulting in a single strong PGR product larger dian 400 bp were further analyzed by DNA sequencing 
after purification witf, a 96 Qiaquick PGR dean-up column (Qiagen Inc.. Chatsworth. CA). 

EXAMPLE 3 : tsolation of cDNA Clones Il.inp Si g nal Algorithm An^^ ^j. 

Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 
sequence finding algoridmi developed by Genentech. Inc. (South San Francisco. GA) upon ESTs as well as 
clustered and assembled EST fragments from public (e.g.. GenBank) and/or private (LIFESEQ*. Incyte 
Pharmaceuticals. Inc., Palo Alto. CA) databases. The signal sequence algoridmi computes a secretion signal 
score based on ttie character of the DNA micleotides surrounding die first and optionally die second mediioDine 
codon(s) (ATG) at die 5'-end of fl« sequence or sequence fragment under consideration. Hie nucleotides 
foUowing the first ATG must code for at least 35 unambiguous amino acids widiout any stop codons. If the first 
ATG has the requited amino acids, the second is not examined. If neiflier meets die requirement, die candidate 
sequence is not scored. In order to determine whedier die EST sequence contains an audientic signal sequence, 
die DNA and corresponding amino acid sequences surrounding die ATG codon are scored using a set of seveii 
sensors (evaluation parameters) known to be associated widi secretion signals. Use of fliis algoridim resulted 
in die identification of numerous polypeptide-encoding nucleic acid sequences. 

EXAMPLE 4 : bOlation of cDNA clones Fnn o ding Hiimiin PRO Polvp epfiri^ 

Using die techniques described in Examples 1 to 3 above, numerous fidl-lengdi cDNA clones were 
IdentifiedasencodingPROpolypeptidesasdisclosedherein. These cDNAs were dien deposited under die terms 
of die Budapest Treaty widi die American Type Culture Collection. 10801 University Blvd.. Manassas. VA 
201 10-2209. USA (ATGQ as shown in Table 7 below. 



Table 7 



Material 

DNA16438-1387 
DNA19360-2552 
DNA33455-1548 
DNA37I5S-2651 
DNA38269-2654 
DNA40619-1220 



ATCC Dep. Nn , 

209771 

203654 

PTA-127 

PTA-429 

PTA-432 

209525 



Deposit Date 

April 14. 1998 
February 9, 1999 
May 25. 1999 
July 27. 1999 
July 27. 1999 
December 10. 1997 
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Material 

DNA44174^2513 
DNA44675-2662 
DNA45408-2615 
5 DNA48606-1479 
DNA52753-2656 
DNA53915-1258 
DNA53991-2553 
DNA54009-2517 
10 DNA56055-1643 
DNA57O33-1403 
DNA57252-1453 
DNA58799-1652 
DNA59770-2652 
15 DNA59774-2665 
DNA60281-2518 
DNA60736-2559 
DNA61875-2653 
. DNA623 12-2558 
20 DNA62849-1604 
DNA66307-2661 
DNA66677.2535 
DNA71235-1706 
DNA71289-2547 
25 DNA73775-1707 
DNA76385-1692 
DNA76395.2527 
DNA77622-2516 
DNA77629-2573 
30 ' DNA77645-2648 
DNA79302-2521 
DNA79865-2519 
DNA801 35-2655 
DNA80794-2568 
35 . DNA80796-2523 
DNA80840-2605 
DNA80899-2501 
DNA81228-2580 
DNA81761-2583 
40 DNA82358-2738 
DNA82364-2538 
DNA82424-2566 
DNA82430-2557 
DNA83500-2506 
45 DNA83509-2612 
DNA83560-2569 
DNA84139-2555 
DNA84141.2556 
DNA84I42-2613 
50 DNA843 18-2520 
DNA84909-2590 
DNA84912-2610 
DNA84925-2514 
DNA84928-2564 
55 DNA84932-2657 



ATCC Dep. No. 

203577 

PTA-430 

PTA-203 

203040 

PTA-611 

209593 

203649 

203574 

PTA-129 

209905 

203585 

203665 

PTA-427 

PTA-615 

203582 

203838 

PTA-428 

203836 

PTA-205 

PTA-431 

203659 

203584 

PTA-126 

PTA-128 

203664 

203578 

203554 

203850 

PTA-45 

203545 

203544 

PTA-234 

203848 

203555 

203949 

203539 

203871 

203862 

PTA-510 

203603 

203813 

203812 

203391 

203965 

203816 

203814 . 

203810 

PTA-22 

203580 

203889 

203964 

203548 

203817 

PTA-235 



Table 7 (cont'^ 

Deposit Date 
January 12, 1999 
July 27. 1999 
June 8. 1999 
July 1, 1998 
August 31, 1999 
January 21, 1998 
February 9, 1999 
January 12. 1999 
May 25. 1999 
May 27. 1998 
January 12, 1999 
February 9, 1999 
July 27. 1999 
August 31. 1999 
January 12. 1999 
March 9, 1999 
July 27. 1999 
March 9, 1999 
June 8, 1999 
July 27, 1999 
February 9, 1999 
January 12, 1999 
May 25, 1999 
May 25. 1999 
February 9, 1999 
January 12, 1999 
December 22, 1998 
March 16. 1999 
May 11. 1999 
December 22. 1998 
December 22. 1998 
June 15, 1999 
March 16. 1999 
December 22, 1998 
April 20. 1999 
December 15, 1998 
March 23, 1999 
March 23, 1999 
August 10. 1999 
January 20, 1999 
March 2, 1999 
March 2. 1999 
October 29, 1998 
April 27, 1999 
March 2. 1999 
March 2. 1999 
March 2, 1999 
May 4. 1999 
January 12. 1999 
March 30. 1999 
April 27, 1999 
December 22, 1998 
March 2. 1999 
June 15. 1999 



99 



• 



# 
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35 
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45 



50 



55 
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Material 


ATCC D< 


DNA86592-2607 


203968 


DNA86594-2587 


203894 


DNA86647-2591 


203893 


DNA87 185-2563 


20381! 


DNA87656-2582 


203867 


DNA87974-2609 


203963 


DNA88001-2565 


203815 


DNA88004-2575 


203890 


DNA89220-2608 


PTA-130 


DNA89947-2618 


203970 


DNA90842-2574 


203845 


DNA91775-2581 


203861 


DNA91779-2571 


203844 


DNA92217-2697 


PTA-513 


DNA92219-2541 


203663 


DNA92223.2567 


203851 


DNA92225-2603 


203950 


DNA92232-2589 


203895 


DNA92233-2599 


PTA-134 


DNA92243-2549 


203852 


DNA92253-2671 


PTA.258 


DNA92254-2672 


PTA-259 


DNA92255-2584 


203866 


DNA92269.2570 


203853 


DNA92288-2588 


203892 


DNA92290-2550 


203847 


DNA93012-2622 


PTA-21 


DNA93020-2642 


PTA-121 


DNA94830.2604 


203951 


DNA94833-2579 


203869 


DNA94838-2658 


PTA-232 


DNA94844-2686 


PTA-385 


DNA94854-2586 


203864 


DNA96868-2677 


PTA.262 


DNA9687N2683 


PTA-381 


DNA96880-2624 


PTA-15 


DNA96986-2660 


PTA-239 


DNA96988.2685 


PTA-384 


DNA96995-2709 


PTA-475 


DNA97004-2562 


203854 


DNA97005-2687 


PTA-378 


DNA97009-2668 


PTA-257 


DNA97013-2667 


PTA-231 


DNA98380-2690 


PTA-388 


DNA9856 1-2696 


PTA-620 


DNA98575-2644 


PTA-118 


DNA98593-2694 


PTA-477 


DNA98600-2703 


PTA-488 


DNA99391-2572 


203849 


DNA99393-2560 


203837 


DNA100276-2684 


PTA-380 


DNA100312-2645 


PTA-44 


DNA100902-2646 


PTA-42 


DNA102899-2679 


PTA-123 
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Table 7 fcont*-> 

Deposit Date 
April 27. 1999 
March 30, 1999 
March 30, 1999 
March 2. 1999 
March 23, 1999 
April 27, 1999 
March 2, 1999 
March 30. 1999 
May 25. 1999 
April 27. 1999 
March 16, 1999 
March 23. 1999 
March 16, 1999 
August 10, 1999 
February 9, 1999 
March 16, 1999 
AprO 20, 1999 
March 30, 1999 
May 25. 1999 
March 16. 1999 
June 22. 1999 
June 22. 1999 
March 23, 1999 
March 16, 1999 
March 30, 1999 
March 16. 1999 
May 4, 1999 
May 25. 1999 
April 20. 1999 
March 23. 1999 
June 15. 1999 
July 20. 1999 
March 23. 1999 
June 22, 1999 
July 20, 1999 
May 4. 1999 
June 15, 1999 
July 20, 1999 
August 3, 1999 
March 16. 1999 
July 20. 1999 
June 22. 1999 
June 15. 1999 
July 20. 1999 
August 31, 1999 
May 25. 1999 
August 3, 1999 
August 3. 1999 
March 16. 1999 
March 9, 1999 
July 20, 1999 
May 11. 1999 
May 11, 1999 
May 25, 1999 
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Material 

DNA 104875-2720 
DNA105680-2710 
DNA105779-2708 
5 DNA105794-2695 
DN A 105838-2702 
DNA107698-2715 
DNAl07701-27n 
DNA 107781-2707 
10 DNA 108670-2744 
DNA 108688-2725 
DNA108769.2765 
DNA108935-2721 
DNAl 10700-2716 
15 DNA 11 1750-2706 
DNA 123430-2755 
DNA 125154-2785 
DNA 142238-2768 
DNA22779-il30 
20 DNA26847-1395 
DNA27864.1155 
DNA27865-1091 
DNA28497-1130 
DNA29101-1122 
25 DNA32286-1191 
DNA32288-1132 
DNA32290-H64 
DNA32292-1131 
DNA32298-1132 
30 DNA33085-1110 
DNA33087-1158 
DNA33089-1132 
DNA33092-1202 
DNA33094-n31 
35 DNA33107-1135 
DNA3322M133 
DNA33223-1I36 
DNA33460-1166 
DNA33473-1176 
40 DNA33785-1143 
DNA33786-1132 
DNA34353-1428 
DNA34392.1170 
DNA34434-1139 
45 DNA35558-1167 
DNA35595-1228 
DNA35638-1216 
DNA35639-1172 
DNA35663-1129 
50 DNA35674.1142 
DNA35841.1173 
DNA35916-1161 
DNA35918-1174 
DNA36350-1158 
55 DNA37140.1234 



ATCC Deo. No. 

PTA-482 

PTA-483 

PTA-485 

PTA-480 

PTA-476 

PTA-472 

PTA-487 

PTA-484 

PTA-546 

PTA-515 

PTA-861 

PTA-518 

PTA-512 

PTA^89 

PTA-614 

PTA-957 

PTA-819 

209280 

209772 

209375 

209296 

209279 

209653 

209385 

209261 

209384 

209258 

209257 

209087 

209381 

209262 

209420 

209256 

209251 

209263 

209264 

209376 

209391 

209417 

209253 

209855 

209526 

209252 

209374 

209528 

209265 

209396 

209201 

209416 

209403 

209419 

209402 

209378 

209489 



Table 7 fcont'^ 

Deposit Date 
August 3. 1999 
August 3, 1999 
August 3, 1999 
August 3. 1999 
August 3, 1999 
August 3, 1999 
August 3, 1999 
August 3, 1999 
August 17, 1999 
August 10. 1999 
October 19, 1999 
August 10, 1999 
August 10. 1999 
August 3, 1999 
August 31, 1999 
November 16.1999 
October 5, 1999 
September 18, 1997 
April 14. 1998 
October 16. 1997 
September 23. 1997 
September 18, 1997 
March 5. 1998 
October 16, 1997 
September 16, 1997 
October 16. 1997 
September 16, 1997 
September 16, 1997 
May 30, 1997 
October 16. 1997 
September 16. 1997 
October 28, 1997 
September 16. 1997 
September 16, 1997 
September 16. 1997 
September 16. 1997 
October 16. 1997 
October 17. 1997 
October 28. 1997 
September 16. 1997 
May 12, 1998 
December 10, 1997 
September 16. 1997 
October 16. 1997 
December 10. 1997 
September 16, 1997 
October 17. 1997 
August 18. 1997 
October 28, 1997 
October 17, 1997 
October 28. 1997 
October 17. 1997 
October 16. 1997 
November 21, 1997 
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Material 

DNA37150-1178 
DNA38260-1180 
DNA40021-1154 
5 DNA40587-1231 
DNA40592-1242 
DNA40620-1183 
DNA40628-1216 
DNA40981-i234 
10 DNA40982-1235 
DNA41234-1242 
DNA43046-1225 
DNA43316-1237 
DNA44167-1243 
15 DNA44184.1319 
DNA44194-1317 
DNA44196-1353 
DNA45419-1252 
DNA46777.1253 
20 DNA47394.1572 
DNA48331-1329 
DNA48336-1309 
DNA49142-1430 
DNA49646-1327 
25 DNA49821-1562 
DNA49829-i346 
DNA5092M458 
DNA52187.1354 
DNAS2196-1348 
30 DNA52598-1518 
DNA54228-1366 
DNA56047.1456 
DNA561 12-1379 
DNA561 13-1378 
35 DNA56352-1358 
DNA56433-1406 
DNA56439-1376 
DNA57530-1375 
DNA57689-1385 
40 DNA57690-1374 
DNA57693-1424 
DNA57838-1337 
DNA58721-1475 
DNA59205-1421 
45 DNA59215-1425 
DNA59220-1514 
DNA59294-1381 
DNA59488-ie03 
DNA59588-1571 
50 DNA59606-1471 
DNA59620-1463 
DNA59767-1489 
DNA59777-i480 
DNA59814.1486 
55 DNA59839-1461 
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ATCC Dep. No. 

209401 

209397 

209389 

209438 

209492 

209388 

209432 

209439 

209433 

209618 

209484 

209487 

209434 

209704 

209808 

209847 

209616 

209619 

203109 

209715 

209669 

203002 

209705 

209981 

209749 

209859 

209845 

209748 

203107 

209801 

209948 

209883 

203049 

209846 

209857 

209864 

209880 

209869 

209950 

203008 

203014 

203110 

203009 

209961 

209962 

209866 

203157 

203106 

209945 

209989 

203108 

203111 

203359 

209988 



Table 7 (cont*) 

Deposit Date 
October 17, 1997 
October 17. 1997 
October 17, 1997 
November 7, 1997 
November 21. 1997 
October 17. 1997 
November 7, 1997 
November 7, 1997 
November 7, 1997 
February 5. 1998 
November 21. 1997 
November 21, 1997 
November 7, 1997 
March 26, 1998 
April 28, 1998 
May 6. 1998 
February 5, 1998 
February 5, 1998 
August 11, 1998 
March 31, 1998 
March 11, 1998 
June 23, 1998 
March 26, 1998 
June 16, 1998 
April 7, 1998 
May 12. 1998 
May 6, 1998 
April 7. 1998 
August 11. 199S 
April 23. 1998 
June 9. 1998 
May 20, 1998 
July 1. 1998 
May 6. 1998 
May 12. 1998 
May 14, 1998 
May 20. 1998 
May 14. 1998 
June 9, 1998 
June 23, 1998 
June 23. 1998 
August 11, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
May 14, 1998 
August 25, 1998 
August 11, 1998 
June 9. 1998 
June 16, 1998 
August 11. 1998 
August 11, 1998 
October 20, 1998 
June 16, 1998 
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Table 7 (com') 



10 



15 



20 



25 



30 



35 



40 



45 



Material 


ATCC Dep. No. 




DNA59846-1503 


209978 


limp 16 iQOfi 


DNA59847-1511 


203098 


/\UgUoi *T, 1770 


DNA60615-1483 


209980 


Tiinp 16 1QQ8 


DNA6062 1-1516 




AUgUSl 4, iWo 


DNA60622-1525 


203090 


Aiiotict 4. IQOft 
AUgUbl H , l770 


DNA60627- 1 508 




AitAitet >f 100Q 

AUgusi 4, lyyo 


DNA60764-1533 


203452 


Mn v^mh^r 10 1 0OS 


DNA60775-1532 


203173 


CAntAmh^r 1 1 OOfi 
OCpiClUDCl 1, 1770 


DNA61 185-1646 


203464 


Mrktmmlwftr 17 1 OOfi 


DNA6 1873- 1574 




August lo, Vtrfo 


DNA62306-I570 




aepteniDer v, lyvo 


DNA62808-1582 




tf^^r/vK^r 7 A 1000 

ijccoDer 2U, lyvo 






August 4, lyvo 






November 3, 1998 


nNA64){R6.16ni 




deptemoer 9, 1998 




9ft17A0 


Deptember 9, 1998 






JSeptemoer 9, 1998 




71^** 111 


A ..M.^A to 1 AAO 

August 18, 1998 




7^1771 


September 15, 1998 






September 15, 1998 


r\VT A A^/fno 1 




September 9, 1998 


L/rMAODWj-l j4/ 


/O 


November 17, 1998 


r\MAAC/l10 1 
1/IN AOj4 1 2- 1 jZ3 


203094 


August 4, 1998 


rkMAA#nno 1 

I-^IN AOOjUV- 1 J Jo 




September 15, 1998 


r>KT A AAAAI 1 <0£ 
UlN AOOOO /- 1 J VO 


203267 


September 22, 1998 


L/ IN A OOO / J - 1 J » / 


203282 


September 22, 1998 


UlN AOoo 1 o-Zj^O 


203657 


February 9. 1999 


LI fS A00004- 1 0/y 


203276 


September 22, 1998 


LIlN AOoo / Z- i OZU 


O A^ 1 it A 


August 25, 1998 


LIINA/11 jV-101 / 


1 A1 lie 


August 18, 1998 


l/iNA/j/Z/-lO/j 


ZU3459 


November 3, 1998 


riKIA7niQ 
L/INA / J 


2U3270 


September 22, 1998 


riM A 7/^4 fw^_o ^ T C 
UfN A /OfUU-ZDZo 


OA'S CTa 

203573 


January 12, 1999 


r^NiA7A^in 7^rt/i 


203477 


November 17, 1998 


nMA7A^70 i^A^ 
Urin / 03Zy- 1 OuO 


ZU3J15 


October 6, 1998 


L«FfNA /ODJo-lO/U 


zU33i3 


October 6, 1998 


DNA77301-1708 


203407 


OrtAhpr 11 1 QOR 


DNA77624-2515 


203553 


December 22, 1998 


DNA79230-2525 


203549 


December 22, 1998 


DNA79862-2522 


203550 


December 22, 1998 


DNA80145-2594 


PTA-204 


. June 8, 1999 


DNA83500-2506 


203391 


October 29, 1998 


DNA84917-2597 


203863 


March 23, 1999 


DNA92218-2554 


203834 


March 9, 1999 


DNA96042*2682 


PTA-382 


July 20, 1999 
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These deposits were made under the provisions of die Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
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the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent appUcation. whichever comes first, and assures avaUability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entided thereto according to 35 
use § 122 and flie Commissioner's rules pursuant thereto (including 37 CFR § 1 . 14 with particular reference 
to 886 OG 638). 

The assignee of the piesoit application has agreed that if a culture of the materials on dqwsit should 
die or be lost or dearoyed when cultivated under suitable conditions, the materials will be prompdy replaced on 
notification widi another of the same. AvaiUbUity of the deposited material is not to be construed as a license 
to practice the invention in connavoition of the ri^ts granted under die authority of any government in 
accordance with its patent laws. 

EXAMPLE S: Use of PRO as a hyhridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 

DNA comprising the coding sequence of fuU-length or mature PRO as disclosed herein is employed as 
a probe to screot for homologous DNAs (such as tiiose encoding naturally-occurring variants of PRO) in human 
• tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and waAing of fdters containing either library DNAs is performed under the foUowing 
high stringKicy conditions. Hybridization of radiolabeled PRC-derived probe to die filters is performed in a 
solution of 50% formamide. 5x SSC. 0. 1 % SDS. 0.1 % sodium pyrophosphate. 50 mM sodium phosphate. pH 
6.8. 2x Denhardfs solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of die fillers is performed 
in an aqueous solution of 0. Ix SSC and 0. 1 % SDS at 42^. 

DNAs having a desired sequence identity wifli the DNA encoding fiill-lengdi native sequence PRO can 
then be idonified using standard techniques known in the art. 

EXAMPLE 6: Expression of PRO in R ml^ 

This example aiustrates preparation of an unglycoqrlated form of PRO by recombinant e;q)tession in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
should contain restriction enzyme sites w*ich coire^nd to the restriction enzyme sites on die selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coU; see Bolivar et al.. Gene. 2:95 (1977)) whidi contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dqjhosphotylated. The PCR 
amplified sequences are ften ligated into the vector. The vector wUl preferably include sequences which encode 
for an antibiotic resistance gene, a trp promoter, a polyhis leader (including die first six STO codons, polyhis 
sequence, and enterokinase cleavage site), die PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected E. cott stiain using the mediods described in 
Sambrook etal., sa^. Transformants are identified by their abiUty to grow on LB plates and antibiotic resistant 
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colonies are then selected, Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid cuimre medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culniring the cells for several more hours, the cells can be harvested by centrifiigation. The cell 
pellet obtained by the centrifiigation can be solubilized using various agents known in the art, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 



PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are dien 
ligated into an expression vector, which is used to o*ansform an E, coli host based on strain 52 {W3U0 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants arc first grown in LB containing 50 mg/mi 
carbenicillinat30*C with ^king until anO.D.600of 3-5 is reached. Cultures are thendiluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NHJ^SO^, 0.71 g sodium citrate«2H20, 1.07 g KCl, 5.36 g Difco 
yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgSO<) and grown for approximately 20-30 hours at 30*'C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centriftiged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetradiionate is added to make final 
concentrations of 0. IM and 0.02 M, respectively, and the solution is stirred overnight at 4**C. This step results 
in a denanired protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifiige for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and fdtered tfirough 0.22 micron filters to clarify. The 
clarified extract is loaded onto a 5 ml (^agen Ni-NTA metal chelate colunm equilibrated in the metal chelate 
column buffer. The column is washed widi additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4**C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 
of: 20 mM Tris, pH 8.6. 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refoldmg volumes are chosen so tiiat the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4**C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 
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solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1 % 
TFA with elution widi a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absoibance 
are analyzed on SDS polyaciylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
since those species are the most compact with their hydrophobic interiors shielded from interaction witfi the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes. pH 6.8 with 
0. 14 M sodium chloride and 4 % mamiitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypq>tides disclosed herein were successfuUy expressed as described above. 

15 PMMPLE 7 : Expression o f PRO in mammalian cells 

This example Ulustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 

The vector. pRK5 (see EP 307.247. published March 15. 1989). is employed as the expression vector. 
OptionaUy. the PRO DNA is ligated imo pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al . . sm&- The resulting vector is called pRK5- 
PRO. 

In one embodiment, die selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
grown to confluence in tissue culnire plates In medium such as DMEM supplemented with fetal calf serum and 
optionaUy, nutrient components and/or antibiotics. About 10 ,ig pRK5-PRO DNA is mixed wiUi about I Mg 
DNA encoding the VA RNA gene tThimmappaya et al.. CeU, 31:543 (1982)] and dissolved in 500 ,d of I mM 
Tris-HCl, 0. 1 mM EDTA. 0.227 M CaCl,. To this mixttire is added, dropwise. 500 /d of 50 mM HEPES (pH 
7.35). 280 mM NaCl. 1.5 mM NaPO,, and a precipitate is allowed to fonn for 10 minutes at 25«C. The 
precipitate is suspended and added to the 293 cells and aUowed to setde for about four hours at 3rc. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
30 then washed with serum free medium, fresh medium is added and the ceUs are incubated for about 5 days. 

Approximately 24 houis after the transfections. the culture medium is removed and replaced with culnire 
medium (alone) or culture medium containing 200 ,iCi/mt »S-cysteine and 200 ^Ci/ml «S-methionine. After 
a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The culmres containing transfected cells may undergo further incubation (in 
serum free medium) and the meditmi is tested in seleaed bioassays. 
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In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 
method described by Somparyrac et a!., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 ftg pRK5-PR0 DNA is added. The cells are first concentrated from 
the spinner flask by centritugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 ng/nd bovine insulin and 
0. 1 /ig/ml bovine transferrin. After about four days, the conditioned media is centrifiiged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purifled by any selected 
method, such as dialysis and/or column chromatography. 

In another embodiment, PRO can be expressed in CHO cells. The pRK5-PR0 can be transfected into 

r 

CHO cells using known reagents such as CaPO^ or DEAE-dextran. As described above, the cell culmres can 
be incubated, and the medium replaced with culture medium (alone) or mediimi containing a radiolabel such as 
^^S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method; 

Epitope-tagged PRO may also be e3q)ressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insen can then be subcloned into a SV40 
driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector, Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni^*-chelate affinity chromatography. 

PRO may also be ejspressed in CHO and/or COS cells by a transient expression procedure or in CHO 
cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in v/hxch the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fiised to an IgGl constant region sequence containing the hinge, CH2 
and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO e^^ression vector using 
standard techniques as described in Ausubel et al,. Current Protocols of Molecular Biolopv . Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA*s. The vector used expression in CHO cells 
is as described in Lucas et al.. Nucl. Acids Res. 24:9 (1774-1779 (1996). and uses the SV40 early 
promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of die desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* ((^uiagen), Dosper*" or Fugene* (Boehringer 
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Mannheim). The cells are grown as described in Lucas et al.. mm- Approximately 3 x cells are frozen 
in an anq>ule for fimher growUi and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifiiged at 1000 
ipm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
filtered PS20 with 5% 0.2 ^ra diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 
spimier containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spiraier 
fiUed with 150 mL selective growth medium and incubated at 37»C. After another 2-3 days. 250 mL. 500 mL 
and 2000 mL spim«j« are seeded with 3 x Itf cells/mL. The cell media is exchanged with fresh media by 
centrifiigation and lesuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5.122.469. issued June 16. 1992 may actuaUy be used. A 
3Lproductionspinnerisseededatl.2xlO*cells/mL. On day 0. the cell number pHie determined. Onday 
1. the spimffir is sampled and sparging widi filtered air is commenced. On day 2. the spimier is sampled, the 
temperanire shifted to 3yc. and 30 mL of 500 gA. glucose and 0.6 mL of 10% antifoam (e.g.. 35% 
polydimethylsUoxaneemulsion. Dow Coming 365 Medical Grade Emulsion) taken. TTiroughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70%. thecell culture is harvested by centrifiigation and filtering through a 0.22 /zm filter. TTie filtrate was either 
stored at 4«C or Immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. TTie conditioned media 
is pumped o«o a 6 ml Ni-NTA column equilibrated in 20 mM Hepes. pH 7.4. buffer containing 0.3 M NaCI 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4^. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. TTie highly 
purified protein is subsequenUy desalted into a storage buffer containing 10 mM Hepes. 0.14 M NaCl and 4% 
mannitol. pH 6.8. with a 25 ml G25 Superfine (Phaimacia) column and stored at -80^. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. Tlie 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer. pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

ml fractions imotobes containing 275 MLoflMTrisbuffer.pH9. TTie highly purified protein is subsequently 
desalted into storage buffijr as described above for the poly-His tagged proteins. TTie homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLF. 8: Exoression of PRO in Yeasr 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intraceUular production or secietion of PRO from 
dieADH2/GAPDHpromoter. DNA encoding PRO and the promoter is inserted into suitable restriction enzyme 
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sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 
be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB 1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supematants can be analyzed by 
precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels widi 
Coomassie Blue stain. 

Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 9 : Expression of PRO in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 

The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovinis 
expression vector. Such epitope tags include poly-his tags and inmiunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
as pVLl393 (Novagen). Briefly, the sequence encoding PRO or the desired portion pf the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5* and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovinis is generated by co-transfecting the above plasmid and BaculoGold''^ virus 
DNA (Pharmingen) into Spodopterafiugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin (commercially 
available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protem expression are performed as described by O'Reilley et 
al.. Baculovinis expression vectors: A Laboratorv Manual . Oxford: Oxford University Ptess (1994). 

Expressed poly-his tagged PRO can then be purified, for exan^le, by Ni^+-chelate affmity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al. , Namre . 362: 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl^; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KCl), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl. 10% glycerol. pH 7.8) and filtered through a 0.45 A^m 
filter. A Ni^'*'-NTA agarose column (commercially available from Qiagen) is prepared with a bed .volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the colunm at 0.5 mL per minute. The colunm is washed to baseline A280 widi loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
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phosphate; 300 mM NaCI. 10% glycerol, pH 6.0). M^iich elutes nonspecifically bound protein. After reaching 
baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni^+-NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluied Hisjo-tagged PRO are 
pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
chromatography techniques, including for instance. Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 10: Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 

Techniques for producing the monoclonal antibodies are known in the an and are described, for 
instance, in Coding, ^pra. Immunogens that may be employed include purified PRO. fusion proteins containing 
PRO, and cells e^qjressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitonealiy in an amoimt from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton. MT) and 
inj^ted into the animal's hind foot pads. The inununized mice are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum sanq)les may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final imravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then ftised (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC. No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fiised cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
"positive" hybridoma cells secreting die desired monoclonal antibodies against PRO is within die skill in die art. 

The positive hybridoma cells can be injected intraperitonealiy into syngeneic Balb/c mice to produce 
ascites containing die anti-PRO monoclonal antibodies. Alternatively, die hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of die monoclonal antibodies produced in die ascites can be 
accon^lished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatogr^hy based upon binding of antibody to protein A or protein G can be employed. 
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EXAMPLE 11: Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 
of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
is purified by inununoaffinity chromatography using antibodies specific for the PRO polypeptide of interest. In 
general, an inununoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 
5 an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 
sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
Likewise, monoclonal antibodies are prepared fi-om mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
10 chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such ati inununoaffinity colunm is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in. a soluble form. This preparation is derived by solubilization of the 
15 whole cell or of a subcellular fraction obtained via differential centrifiugation by the addition of detergent or by 
other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide-containing preparation is passed over the inununoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic 
20 strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/PRO polypeptide binding {e.g. , a low pH buffer sudi as approximately pH 2-3. or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 12: Drug Screening 

^3 This invention is particulariy useful for screening compounds by using PRO polypeptides or binding 

fragmem thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free iii solution, affixed to a solid support, borne on a cell surface, or located 
intracelluiariy. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs ace screened 

30 against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 
used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 
Thus, the present invention provides methods of screening for drugs or any other agents which can 

35 affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent vdth 
an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
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and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and tfie amount of free or uncomplexed label is a measure of the ability of die particular 
agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypq)tide and is described in detail in WO 84/03564. published on September 13. 1984. 
Briefly stated, large numbers of different smaU peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 
with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known m the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, die antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

EXAMPLE 13: Rational Drug n^Rt^n 

The goal of rational drug design is to produce strucniral analogs of biologically active polypq)tide of 
interest (/.e., a PRO polypeptide) or of small molecules with which they interact, e.g., agoiusts, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable fonns of the 
PRO polypeptide or which enhance or interfere widi the function of the PRO polypeptide in vivo (c.f., Hodgson, 
Bio/Technolopv . 9: 19-21 (1991)). 

In one approach, die three-dimensional structure of the PRO polypeptide, or of an PRO 
polypqjtide-inhibitor complex, isdeterminedby x-ray crystallography, by con^uter modeling or, most typically, 
by a combination of the two appjroaches. Both die shape and charges of die PRO polypeptide must be ascertained 
to elucidate the strucnire and to determine active site(s) of die molecule. Less often, useftd information regarding 
the structure of die PRO polypeptide may be gained by modeling based on die structure of homologous proteins. 
In botfi cases, relevant strucniral information is used to design analogous PRO polypq)tide-Iike molecules or to 
identify efficient inhibitors . Useful examples of rational drug design may include molecules which have inq)roved 
activity or stability as shown by Braxton and WeUs, Biochemistry. 31:7796-7801 (1992) or whidi act as 
inhibitors, agonists, or antagonists of native peptides as shown by Adiauda et al., J. Biochem. . 113:742-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by ftmctional assay, as described above, 
and tfien to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogedier by generating anti-idiotypic 
antibodies (anti-ids) to a ftmctional, pharmacologically active antibody. As a mirror image of a mirror image, 
die binding site of die anti-ids would be e;qpected to be an analog of die original receptor. The anti-id could tt»n 
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be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 
to perform such analytical studies as X-ray crystallography. In addition, knowledge of die PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 

EXAMPLE 14: Identification of PRO Polypeptides That Stimulate TNF-a Release In Human Blood (Assay 128^ 
This assay shows that certain PRO polypeptides of the present uivention act to stimulate the release of 
TNF-a in human blood. PRO polypeptides testing positive in this assay are useful for, among other things, 
research purposes where stimulation of the release of TNF-a would be desired and for the therapeutic treatment 
of conditions wherein enhanced TNF-a release would be beneficial. Specifically, 200 fi\ of human blood 
supplemented with 50mM Hepes buffer (pH 7.2) is aliquoted per well in a 96 well test plate. To each well is 
then added 300^1 of either the test PRO polypeptide in 50 noM Hepes buffer (at various concentrations) or 50 
mM Hepes buffer alone (negative control) and the plates are incubated at 3TC for 6 hours. The samples are 
then centrifiiged and SOfil of plasma is collected from each well and tested for the presence of TNF-a by ELISA 
assay. A positive in the assay is a higher amount of TNF-a in the PRO polypeptide treated samples as compared 
to the iiegative control san^)les. 

The following PRO polypeptides tested positive in this assay: PR0195. PRO202, PR0215. PR0221, 
PR0217, PR0222, PR0198, PR0245. PR0172, PR0265, PR0266, PR0344, PR0337. PR0322, PR01286, 
PR01279, PR01338 and PR01343. 

EXAMPLE 15: Detection of Polypeptid es That Affect Glucose or FFA Uptake in Skeletal Muscle (Assay 106) 
This assay is designed to determine whedier PRO polypeptides show the ability to affect glucose or FFA 
uptalffi by skeletal muscle cells. PRO polypeptides testing positive in this assay would be e:q>ected to be useful 
for the therapeutic treatment of disorders where either the stimulation or inhibition of glucose uptake by skeletal 
muscle would be beneficial including, for example, diabetes or hyper- or hypo-insulinemia. 

In a 96 well format, PRO polypeptides to be assayed are added to primary rat differentiated skeletal 
muscle, and allowed to incubate overnight. Then fresh media with the PRO polypeptide and +/- insulin are 
added to the wells. The sample media is then monitored to determine glucose and FFA uptake by the skeletal 
muscle cells. The insulin will stimulate glucose and FFA uptake by the skeletal muscle, and insulin in media 
without the PRO polypeptide is used^ a positive control, and a limit for scoring. As the PRO polypeptide being 
tested may either stimulate or inhibit glucose and FFA uptake, results are scored as positive in the assay if 
greater than 1 .5 times or less than 0.5 times the insulin control . 

The following PRO polypqitides tested positive as being capable of affecting glucose and/or FFA uptake 
by skeletal muscle in this assay: PR0182. PR0366, PR0198, PR0172 and PR0719. 
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TOs assay shows diat certain polypeptides of ihe invention act to induce redifferentiation of 
chondrocytes, tiierefore. are expected to be useful for the treatment of various bone and/or cartilage disordens 
such as. for example, sports injuries and arthritis. The assay is performed as follows. Porcine chondrocytes 
are isolated by overnight collagenasedigestionofarticularycartilageofmetacanwphaiangealjoi^ 
old female pigs. TTie isolated cells are then seeded at 25.000 cells/cm^ in Ham F-12 containing 10% FBS and 
4 ,.g/ml gentamycin. The culture media is changed eve^r diird day and the cells are then seeded in 96 well 
plates at 5.000 cells/well in 100^1 of the same media wiAout serum and 100 ^1 of the test PRO polypeptide. 5 
nM staurosporin (positive control) or medium alone (negative control) is added to give a final volume of 200 
/.I/well. After 5 days of incubation at 37«C. a picnire of each well is taken and the differentiation sute of the 

chondrocytes is determined. Apositive result in the assay occure when the redifferentiati^ 
is determined to be more similar to be positive control than die negative control. 

The foUowing polypeptide tested positive in this assay: PR0182. PR0366. PR0198 and PR01868. 

EXAMPLE 17: Chondrocyt e Proliferafing Assav f Assay H ]) 

This assay is designed to determine whether PRO polypeptides of tiiepresent invention show die ability 

to induce theproliferationand/orredifferemiationof chondrocytes in culture. PROpolypeptidestestingpositive 
in tiiis assay would be expected to be useful for the dierapeutic treatment of various bone and/or cartilage 
disorders sudi as, for example, qxirts injuries and arthritis. 

Porcme chondrocytes are isolated by overnight collagenase digestion of articular cartilage of the 
metacarpophalangeal joint of 4^ month old female pigs. The isolated cells are tiien seeded at 25.000 cells/cm' 
inHamF.12contauiingl0%FBSand4Mg/mlgentamycin. The culmre media is changed every tiuid day and 
the cells are reseeded to 25.000 ceUs/cm^ every five days. On day 12. fl« cells are seeded in 96 weU plates at 
5.000 cells/well in 100^.1 of the same media without serum and 100 ,.1 of either serum-free medium (negative 
control), staurosporin (final concentration of 5 nM; positive control) or die test PRO polypeptide are added to 
give a fmal volume of 200 ^I/well . After 5 days at 37«C. 20 ^ of Alamar blue is added to each well and the 
plates are incubated for an additional 3 hours at 37«C. Tbe fluorescence is tiien measured in each weU (Ex:530 
mn; Em: 590 nm). The fluorescence of a plate containmg 200 ^1 of a,e serum-free medium is measured to 
obtain the background. A positive result in the assay is obtained When the fluorescence of the PRO polypeptide 
treated sample is moiie like that of Uie positive control than the negative control. 

The following PRO polypeptides tested positive in this assay: PRO202. PR0224. PROI72 and 
PR01312. 



EXAMPLE l^pet^ionof PRO PolvneDtidesThatAff^ r juco5«.nrPFATTp t ^kebvPrimarvP«^ 
(Assav 94^ 

TTiis assay is designed to determine whether PRO polypeptides show the abiUty to affect glucose or FFA 
uptake by adipocyte cells. PRO polypeptides testing positive in this assay would be expected to be useful for 
the tiwrapeutic treatmem of disorders where eitiier die stimulationor inhibition of glucose uptake by adipocytes 
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would be beneficial including, for example, obesity, diabetes or hyper- or hypo-insulinemia. 

In a 96 well format. PRO polypeptides to be assayed are added to primary rat adipocytes, and allowed 
to incubate overnight. Samples are taken at 4 and 16 hours and assayed for glycerol, glucose and FFA uptake. 
After die 16 hour incubation, insulin is added to the media and allowed to incubate for 4 hours. At this time, a 
sample is taken and glycerol, glucose and FFA uptake is measured. Media containing insulin without the PRO 
polypeptide is used as a positive reference control. As die PRO polypq)tide being tested may either stimulate 
or inhibit glucose and FFA uptake, results are scored as positive in the assay if greater than 1.5 times or less 
than 0.5 times the insulin control. 

The following PRO polypeptides tested positive as being capable of affecting glucose and/or FFA uptake 
in this assay: PRO202. PR0211. PR0344 and PR01338. 

EXAMPLE 19: Gene Expression in Bovine Pericytes (Assay 105^ 

This assay is designed to identify PRO polypeptides which activate gene expression in pericytes. Such 
polypeptides would be expected to be useful as growth factors and/or for situations where the activation of gene 
expression is desired or beneficial. Bovine pericytes are plated on 60nun culture dishes in growth media fori 
week. On day 1, various PRO polypeptides are diluted (1 %) and incubated with the pericytes for 1, 4 and 24 
hr. timepoints. The cells are harvested and the RNA isolated using TRI-Reagent following the included 
instructions. The RNA is then quantified by reading the 260/280 OD using a spectrophotometer. The gene 
expression analysis is done by TaqMan reactions using Perkin Elmer, reagents and specially designed bovine 
probes and primers. Expression of the following genes is analyzed: GAPDH, beta-integrin, connective tissue 
growth factor (CTGF). lCAM-1, monocyte chemoattractant protein-l (MCP-1), osteopontin, transforming 
growth factor-beta (TGF-beta), TGF-beta receptor, tissue inhibitor of metalloproteinase (TIMP), tissue factor 
(TF), VEGF-a, thrombospondin, VEGF-P, angiopoeitin-2. and collagenase. Replicates are then averaged and 
the SD determined. The gene expression levels are then normalized to GAPDH. These are then normalized to 
the expression levels obtained with a-protein (PIN32) \^ich does not significantly induce gene expression in 
bovine pericytes when compared to untreated controls. Any PRO polypeptide that gives a gene expression level 
2-fold or higher over the PIN32 control is considered a positive hit. 

The following PRO polypeptides tested positive in this aissay: PR0366. 

EXAMPLE 20: Identification of PRO Polypeptides That Activate Pericytes (Assay 125) 

This assay shows that certain polypeptides of the invention act to activate proliferation of pericyte cells 
and. therefore, are useful not only as diagnostic markers for particular types of pericyte-associated tumors but 
also for giving rise to antagonists which would be expected to be useful for the therapeutic treatment of pericyte- 
associated tumors. Such PRO polypeptides also would be expected to be useful as growth factors and/or for 
situations where the induction of cell proliferation is desired or beneficial. Activation of pericyte proliferation 
also correlates with the induction of angiogenesis and, as such. PRO polypeptides capable of inducing pericyte 
proliferation would be expected to be useful for the treatment of conditions where induced angiogenesis would 
be beneficial including, for example, wound healing, and the like. Specifically, on day 1 , pericytes are received 
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from VEC Technologies, and all but 5 ml media is removed from the flask. On day 2. the pericytes are 
tiypsinized. washed, spun and plated on 96 well plates. On day 7. the media is removed and the pericytes are 
treated with 100 ^1 of either die specific PRO polypeptide or control treatments (positive control = 
DME+5 % +/- PDGF @ 500ng/Ml: negative control =PIN32. a polypeptide determined to have no significant 
effect on pericyte proliferation). C-fos and GAPDH gene expression levels are then determined and die 
replicates are averaged and the SD is determined. The c-fos values are normalized to GAPDH and the residts 
are expressed as fold increase over P1N32. Anything providing at least a 2-fold or higher response as compared 
to die negative control is considered positive for the assay. 

The following polypeptides tested positive in this assay: PR0366. 

^XAM?l^ 2\ : AWi^y of PRO Polvnentides to Stiinnlam th e Release of Prnteoglvcans fron^ Cartilage rA^py 
221 

The abiUty of various PRO polypeptides to stimulate the release of proteoglycans from cartilage tissue 
was tested as fbUows. 

The metacarphophalangeal joint of 4-6 month old pigs was aseptically dissected, and articutarcartaage 
was removed by free hand slicing being careful to avoid ihe underlying bone. Hie cartilage was minced and 
cultured in bulk for 24 hours in a humidified atmosphere of 95% air, 5% CO, in serum free (SF) media 
(DME/F12 l:l)wiih0.1% BSAand lOOU/mlpenicUlinand lOOMg/ml streptomycin. After washing three times, 
approximately 100 mg of articular cartilage was aliquoted into micronics tubes and incubated for an additional 
24 hours in the above SF media. PRO polypeptides were dien added at 1 % either alone or in combination with 
18 ng/ml inierleukin-la. a knownstimuhitor of proteoglycan release from cartilage tissue. TTie supernatant was 
then harvested and assayed for die amoum of proteoglycans using die l.9H«nieUiyl-mediylene blue (DMB) 
colorimetric assay (Fariidale and Buttle. fiiadiemJWoEhK,^ 883: 173-177 (1985)). Apositive result in this 
assay indicates that die test polypeptide wiU find use. for example, in the treatment of sports-related joint 
problems, articular cartilage defects, osteoarthritis or rheumatoid arthritis. 

When various PROpolypeptides were tested in die above assay.die polypeptides demonstratedamarked 
ability to stimulate release of proteoglycans from cartilage tissue bodi basally and after stimulation with 
interleukin-la and at 24 and 72 hours after treatment, thereby indicating that these PRO polypeptides are usefiil 
for stimulatiQg proteoglycan release from cartUage tissue. As such, diese PRO polypeptides are useful for die 
treatment of sports-relaied joint problems, articular cartilage defects, osteoarthritis or riieumatoid arthritis. The 
polypeptides testing positive in this assay are : PR0216. 

EXAMPLE 22 : Proliferation of Rat lltrirniar S unnortinf r..ii« t/^ ^j^y 

This assay shows diat certain polypeptides of the invention act as potent mitogens for inner ear 
supporting cells which are auditoiy hair cell pnigenitois and. dierefore. are usefol for inducing die regeneration 
of auditory hair cells and treating hearing loss in mammals. The assay is perfonned as follows. Rat UEC-4 
utricular epidielial cells are aliquoted into 96 well plates with a density of 3000 cells/well in 200 ptl of senim- 
containing medium at 33'C. TTie ceUs are culmred overnight and are dien switched to senim-free medium at 
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37*'C. Various dilutions of PRO polypeptides (or nothing for a control) are dien added to the cultures and the 
cells are incubated for 24 hours. After the 24 hour incubation, 'H-ihymidine (1 ^Ci/well) is added and the cells 
are then cultured for an additional 24 hours. The cultures are dien washed to removie unincorporated radiolabel, 
the cells harvested and Cpm per well determined. Cpm of at least 30% or greater in the PRO polypeptide treated 
cultures as compared to the control cultures is considered a positive in the assay. 
The following polypeptides tested positive in this assay: PR0172. 

EXAMPLE 23 : Stimulatory Activity in Mixed Lymphocyte Reaction (MLR^ Assay (Assay 24^ 

This example shows that certain polypeptides of the invention are active as a stimulator of the 
proliferation of stimulated T-Iymphocytes. Compounds which stimulate proliferation of lymphocytes are useful 
therapeutically where enhancement of an immune response is beneficial. A therapeutic agent may take the form 
of antagonists of the polypeptide of the invention, for example, murine-human chimeric, humanized or human 
antibodies against the polypeptide. 

The basic protocol for this assay is described in Current Protocols in Immunology, unit 3. 12; edited 
by J E Coligan, A M Kruisbeek, D H Marglies, E M Shevach, W Strober, National Institutes of Health, 
Published by John Wiley & Sons, Inc. 

More specifically, in one assay variant, peripheral blood mononuclear cells (PBMC) are isolated from 
manunalian individuals, for example a human volunteer, by leutopheresis (one donor will supply stimulator 
PBMCs, the otfier donor will supply responder PBMCs). If desired, the cells are frozen in fetal bovine serum 
and DMSO after isolation. Frozen cells may be diawed overnight in assay media (37*»C. 5% COj) and then 
washed and resuspended to 3x10* cells/ml of assay media (RPMI; 10% fetal bovine serum. 1% 
penicillin/streptomycin. 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate). The 
stimulator PBMCs are prepared by irradiating the cells (about 3000 Rads). 

The assay is prepared by platmg in triplicate wells a mixture of: 

100:1 of test sample diluted to 1 % or to 0.1%, 

50 :1 of irradiated stimulator cells, and 

SO :1 of responder PBMC cells. 
100 microliters of cell culture media or 100 microliter of CI>4-IgG is used as the control. The wells are then 
incubated at 37X:, 5% COj for 4 days. On day 5, each well is pulsed with tritiated thymidine (1.0 mC/well; 
Amersham). After 6 hours the cells are washed 3 times and then the uptake of the label is evaluated. 

In another variant of this assay, PBMCs are isolated from the spleens of Balb/c mice and C57B6 mice. 
The cells are teased from freshly harvested spleens in assay media (RPMI; 10% fetal bovine serum, 1% 
penicillin/streptomycin, 1% glutamine, 1% HEPES, 1% non-essential amino acids, 1% pyruvate) and the 
PBMCs are isolated by overiaying these cells over Lympholyte M (Organon Teknika). centriftiging at 2000 
ipm for 20 minutes, collecting and washing the mononuclear cell layer in assay media and resu^nding the cells 
to 1x10^ cells/ml of assay media. The assay is dien conducted as described above. 

Positive increases over control are considered positive widi increases of greater than or equal to 180% 
being preferred. However, any value greater than control indicates a stimulatory effect for die test protein. 
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The following PRO polypeptides tested positive in this assay: PR0344. 

gXAMPLE 24: Pericyte c-Fos Induction (Assay 9%\ 

This assay shows that certain polypeptides of the invention act to induce the expression of c-fos in 
pericyte cells and. therefore, are useful not only as diagnostic markers for particular types of pericyte-associated 
tumors but also for giving rise to antagonists which would be expected to be usefiil for die therapeutic oeatment 
of pericyte-associated umors. Induction of c-fos expression in pericytes is also indicative of die induction of 
angiogenesis and. as such. PRO polypeptides capable of inducing the expression of c-fos would be expected to 
be useful for the treatment of conditions where induced angiogenesis would be beneficial including, for example, 
wound healing, and the like. Specifically, on day 1. pericytes are received from VEC Technologies and all bui 

5ml of media is removed from flask. On day2.thepericytes are trypsinized. washed, spun and then ^^^^ 
96 weU plates. On day 7. the media is removed and die pericytes are treated wifli 100 ^1 of PRO polypeptide 
test samples and controls (positive control = DME+5% serum +/- PDGF at 500 ng/ml; negative control = 
protein32). Replicates are averaged and SD/CV are determined. Fold increase over Protein 32 (buffer control) 
value indicated by chemiluminescence units (RLU) luminometer reading verses frequency is plotted on a 
histogram. Two-fold above Protein 32 value is considered positive for the assay. ASY Matrix: Growth media 
= low glucose DMEM = 20% FBS -I- IX pen stiep -HIX fimgizone. Assay Media = low glucose DMEM 
+5% FBS. 

The foUowing polypeptides tested positive in this assay: PRO301 , PR0619, PRO1066 and PROI265. 

EXAMPLE 25 : CvtoMne Release As^ rA^j i y 1?ff> 

This assay is designed to determine whether PRO polypeptides of die present invention are capable of 
inducing die release of cytokines from peripheral blood mononuclear cells (PBMCs). PRO polypeptides capable 
of inducing die release of cytokines from PBMCs are useful fiom die treatment of conditions which would 
benefit from enhanced cytokine release and wUl be readily evident to (hose of ordinary skiU in the art. 
Specifically. 1x10* cells/ml of peripheral blood mononuclear cells (PBMQ are cultiired with 1% of a PRO 
polypeptide for 3 days in complete RPMI media. The supernatant is then harvested and tested for increased 
concentrations of various cytokines by ELISA as compared to a human IgG treated control. A positive in die 
assay is a 10-fold or greater increase in cytokine concentration in tfie PRO polypeptide treated sanqile as 
compared to the human IgG treated control. 

The foUowing polypeptides tested positive in this assay: PROS26 and PR01343. 

BXAMPI^B 26 : Inhibition of A-Pentidi. RinHing to Factor VI IA fAssav 1 H^> 

This assay Is designed to identify PRO polypeptides which are capable of inhibiting the binding of 
A-peptide to factor VIIA. diereby affecting the blood coagulation cascade. PRO polypeptides testing positive 
in diis assay are expected to be useful for die treatment of conditions where alteration of tiie blood coagulation 
cascade would be beneficial including, for example, stroke, heart attack and various coagulation disoiders. 
niese PRO polypeptides are also usefiil for the identification of agonist and antagonist molecules which would 
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Specifically. 384 well plates are coated widi soluble factor VIIA and are incubated overnight at 4°C. 
The wells are then decanted and are blocked by the addition of 0.5% BSA for 1 hour. The wells are then washed 
and 20/iI of biotinylated A-pcptide and either various concentration of the PRO polypeptide (test) or nothing 
(negative control) are added to each well. The plates are then incubated for 1 hour at room temperature. The 
5 wells are again washed and dien 40fd of streptavidin-europium is added to each well. The plates are then 
incubated for 30 minutes at room temperature and then washed. 40ftl of a fluorescence enhancement solution 
is then added to each well, the plates incubated for 5 minutes at room ten^rature and each well is dien read on 
Wallac Victor reader under europium delayed fluorescence settings. Percent inhibition of binding of the A- 
peptide to the factor VIIA is then determined (as compared to the negative control), wherein a positive in the 
10 assay is a percent inhibition of 30% or greater. 

The following PRO polypeptides tested positive in this assay: PR0182. 

EXAMPLE 27: Inhibition of Adipocyte Differentiation Assav ('Assay 66) 

This assay is designed to identify PRO polypeptides which are capable of inhibiting insulin-induced 

IS differentiation of adipocytes. PRO polypeptides testing positive in this assay would be e]q)ected to be useful for 
the treatment of conditions associated with obesity, diabetes, etc. 

Specifically, 3T3-L1 cells are seeded into the wells of 96 well plates at 6x10^ cells/well and allowed 
to grow to confluency for 7 days. At day 7. the cells are treated with various concentrations of the PRO 
polypepitide (or nothing for the negative conttol) in the presence of 1/ig/ml insulin. 0.25x10^ M dexamethasohe 

20 and 0.5mM IBMX. The samples are then incubated at 37**C in 7% (X)^ for 2 days. After the incubation, the 
media is removed by aspiration and the cells are washed with PBS and re-exposed to the PRO polypeptide (or 
nothing for the negative control) and 1/ig/ml insulin. After 5 days, the media is removed and replaced witii ft-esh 
PRO polypeptide (or nothing for the negative control) and insulin. After 5 days, the cells are lysed and the cell 
lysate is assayed using Sigma's Triglyceride [INT] kit (Sigma procedure #336). A positive in the assay is 20% 

25 greater inhibition of adipocyte differentiation in the PRO polypeptide treated samples as compared to the negative 
control. 

The following PRO polypeptides tested positive in Uiis assay: PR0185 and PROi98. 

EXAMPLE 28: HUVEC Stimulation bv PRO Polvpeotides (Assav 13n 
30 This assay is designed to identify PRO polypeptides which are capable of stimulating the proliferation 

of HUVEC cells. PRO polypeptides testing positive in this assay would be expected to be useftil for inducing 

angiogenesis for the treatmnent of conditions where angiogenesis would be beneficial including, for example. 

wound healing, and the like. Antagonists of these PRO polypeptides would be expected to be useftd for 

inhibiting angiogenesis for the treatment of, for example, tumors, and die like. 
35 Specifically, COSTAR* flat bottom black plates are treated wifli fibronectin for 20 minutes and then 

washed twice with PBS, HUVEC cells are then plated at 2000 cells/well in an appropriate growth medium. The 

plates are then incubated overnight and then the PRO polypeptide (1 % final concentration), nothing (negative 
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control) or ILip (3.3 ng/ml final concentration; positive control) is added. The plates are again incubated 

ovemight. stained with ICAMl<:y5 and read on FMAT.Apositive in the assay isa2-fold or gr«^^ 
in fluorescence as compared to the positive control. 

The following PRO polypqitides tested positive in diis assay: PR0222. 

EXAMPLE 29 : Promotion nf rhnnHr/^ cvte Redifferentiation (Assay l?0> 

This assay is designed to determine whether PRO polypeptides of the present invention show the ability 

to induce diepioliferationand/orredifferenaationofchondrocytesinculture.PROpolypeptides testing 

in this assay would be expected to be usefiil for die therapeutic treatment of various bone and/or cartilage 

disorders such as, for example, sports injuries and arthritis. 

Porcine chondrocytes are isolated by overnight collagenase digestion of articular cartilage of the 
metacarpophalangeal joint of 4^ month old female pigs. The isolated cells are then seeded at 25,000 cells/cm^ 
inHamF-12containinglO%FBSand4Mg/mlgentamycin. The culture media is changed every third day. On 
day 12. the cells are seeded in 96 well plates at 5.000 cells/weU in 100^1 of the same media without serum and 
100 Ml of either serum-free medium (negative control), staurosporin (final concentration of 5 nM; positive 
control) or the test PRO polypeptide are added to give a final volume of 200 ,.1/well. After 5 days at 37^. 22 
Ml of media comtaining lOOMg/ml Hoechst 33342 and 50 ^g/ml 5-CFDA is added to each well and incublted 
for an additional 10 minutes at 3TC. A picnire of the green fiuorescence is taken for each well and the 
diCferentiation state of the chondrocytes is calculated by morphometric analysis. A positive result in the assay 

is obtained when the>50% of the PROpolypeptide treated cells are differentiated(compared to the bac^ 
obtained by die negative control). 

The foUowing PRO polypeptides tested positive in this assay: PRO301. 

EXAMPLE 30 : Microarray Anglysifs to Detect Overexnression of PRO Polyp eptides in r,n».»>„c t. 

Nucleic acid microarrays. often containing thousands of gene sequences, are usefiil fot identifying 
differentially expressed genes in diseased tissues as compared to theirnormal countciparts. Using nucleic acid 
microarrays. test and control mRNA samples from test and control tissue samples are reverb transcribed and 
labeled to generate cDNA probes. n,e cDNA probes are then hybridized to an array of nucleic acids 

inmwbUizedonasoUd support. The array is configured such diat die sequence and position of each member of 
the array is known. For example, a selection of genes known to be expressed in certain disease states may be 
arrayed on a solid support. Hybridization of a labeled probe with a particular array member indicates that die 
sample from which the probe was derived expresses that gene. If the hybridization signal ofaprobefix)^ 
(disease tissue)sample is greater flanhybridization signal ofaprobefromacontrol (normal tissue) sa^^^ 
gene or genes overexpressed in the disease tissue ate identified. IT* nnpUcation of dus result is that an 

overexpressed protein inadiseased tissue Is usefid not only asadiagnostic marker fortiiepresenceof the d^ 
condition, but also as a therapeutic target for treatment of the disease condition. 

The mediodology of hybridization of nucleic acids and microarray technology is well known in dw an. 
In the present example, the specific preparation of nucleic acids for hybridization and probes, slides, and 
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hybridization conditions are all detailed in U.S. Provisional Patent Application Serial No. 60/193.767, filed on 
March 31 , 2000 and uiiich is herein incorporated by reference. 

In the present example, cancerous tumors derived from various human tissues were studied for PRO 
polypeptide-encoding gene expression relative to non-cancerous human tissue in an attempt to identify diose PRO 



5 one set, cancerous human colon tumor tissue and matched non-cancerous human colon tumor tissue from the 
same patient ("matched colon control") were obtained and analyzed for PRO polypeptide expression using the 
above described microarray technology. In the second set of data, cancerous huoum nimor tissue from any of 
a variety of different human tumors was obtained and compared to a **universal" epithelial control sample which 
was prepared by pooling non-cancerous human tissues of epitfielial origin, including liver, kidney, and lung. 

10 mRNA isolated from the pooled tissues represents a mixture of expressed gene products from these different 
tissues. Microarray hybridization experiments using the pooled control samples generated a linear plot in a 2- 
color analysis. The slope of the line generated in a 2-color analysis was then used to normalize the ratios of 
(test:control detection) within eadi experiment. The normalized ratios from various experiments were then 
compared and used to identify clustering of gene expression. Thus, the pooled "universal control" sample not 

15 only allowed effective relative gene expression determinations in a simple 2-S2unp]e comparison, it also allowed 
multi-sample comparisons across several experinoents. 

In the present experiments, nucleic acid probes derived from die herein described PRO polypeptide- 
encoding nucleic acid sequences were used in the creation of the microarray and RNA from the tumor tissues 
listed above were used for the hybridization thereto. A value based upon the normalized ratio:experimental ratio 

20 was designated as a "cutoff ratio". Only values that were above this cutoff ratio were determined to be 
significant. Table 8 below shows the results of these experiments, demonstrating that yarious PRO polypeptides 
of the preent invention are significantly overexpressed in various human tumor tissues as compared to a non- 
cancerous human tissue control. As described above, these data demonstrate that the PRO polypeptides of the 
present invention are useful not only as diagnostic markers for the presence of one or more cancerous tumors, 

25 but also serve as ther^utic targets for the treatment of those tumors. 



polypq)tides which are overexpressed in cancerous mmors. Two sets of experimental data were generated. In 



Table 8 



Molecule 



is overexpressed in: 



as compared to: 



40 



35 



30 



PR0177 

PR0177 

PR0177 

PR03574 

PR03574 

PRO1280 

PRO1280 

PR04984 

PR04988 

PR04988 

PRO305 

PRO305 

PR01866 



prostate tumor 



breast tumor 
liver tumor 
lung tumor 
breast tumor 
colon tumor 
breast tumor 
lung tumor 
lung tumor 
colon tumor 
lung tumor 
lung tumor 
colon tumor 



universal normal control 
imiversal normal control 
universal normal control 
universal normal condx>l 
universal normal control 
universal normal control 
universal normal control 
universal normal control 



universal normal control 
universal normal control 
universal normal control 
universal normal control 



matdied normal colon control 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Molecule 


is overexpressed in: 


PR01866 


lung tumor 


PR01866 


colon tumor 


PR04996 


breast tumor 


PR04996 


lung tumor 


PRO4406 


lung tumor 


PRO4406 


colon tumor 


PROU20 


colon tumor 


PRO1120 


breast tumor 


PROI120 


rectal tumor 


PRO4990 


lunc tumor 


PR0738 


cervical tumor 


PR0738 


lunff tumnr 


PR0738 


breast tiimor 


PR03577 


lunc^ tiimnr 

t-UU^ UAl 111/1 


PRO 1879 


breaQt fiimnr 


PR01879 


iunp tumor 


PR01879 


colon tiiinor 


PR01471 


lUllg LUIIIUI 


PRO 1076 


nroctsifR ^ltT1n^kr 

f/lVdlCllV lUIIiUl 


PRO 1483 


iiuig iuniur 


PR04985 


FPPtal tiinn/>r 


PR04985 


iAiiun tumor 


PR04985 


lirf'StCf tlimrtr 
UlCool iUillUi 


PR04985 


lUil^ tUillUl 


PRO5000 


limn 

lUllg lUmOr 


PR01881 


iivcr lumor 


PR01881 


■lino fiim^^f* 

lUUg LlililUl 


PR01881 


uicaol luiuur 


PR04314 


fitfio fiim/^f* 
luiig iuiuuir 


PR04314 


breast tumor 


PR04987 


lunp tumor 


PR04313 


lun? tumor 

AVUIK KKH 11\/1 


PR04313 


HrPJlQt fliTTinr 
i/i ^€id I lu J 1 njx 


PR04799 


colon tumor 


PR04995 


livpr tiim/\r 


PR04995 


colon tumor 


PR04995 


colon tumor 


PR01341 




PR01341 


lun? tumor 


PR01341 


colon tumor 


PR01341 


colon tumor 


PR01777 


lung bunor 


PR01777 


colon tumor 


PRO3580 


lun? tumor 


PRO3580 


prostate tumor 


PR01779 


lung tumor 


PR01779 


colon tumor 


PR01779 


cervical tumor 


PR01754 


breast tumor 


PR01754 


lung tumor 


PRO1906 


breast tumor 


PRO1906 


colon tumor 


PRO1906 


prostate tumor 


PRO! 870 


breast tumor 



Table 8 fcont'^ 



as compared to: 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal cond:ol 
universal normal control 
universal normal control 
universal normal control 
universal normal conu-ol 
matched normal colon control 
universal normal control 
universal normal control 
imiversal normal control 
matched normal colon control 
imiversal normal control 
matdied normal colon control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 fcont'^ 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Molecule 


is overexpressed in: 


as compared to: 


PR04329 


lung tumor 


universal normal control 


PR04979 


colon tumor 


universal normal control 


PR01885 


rectal tumor 


univer<;a1 normal f!ontrnl 


PR01885 


colon tumor 


univer^l normal control 


PR01885 


colon tumor 


matched normal colon contml 


PR01882 


prostate tumor 


univerul normal contfv^t 


PR01882 


lung tumor 


■inivf*rc9l n^rmat o/^ntml 
uiiivciacii iHJiiiiai vUilliiJl 


PR01882 


colon tumor 


iinivprciil nrtrrnat /^rkntrol 
UlUVwdal llUllllal WiJIlirUl 


PRO 1882 


breast tumor 


liniUATCst normal f*/\ntrY\\ 

universal uuiTiuu control 


PR01882 


cervical tumor 


uuiversai noiiiiai coniroi 


PR04989 


rectal tumor 


universal noiuiai coniroi 


PR04989 


breast tumor 


tmivprc^l normal /control 


PR04989 


colon tumor 


luaicncu noiiiiHi CO ion control 


PR04989 


colon tumor 


umvciaai normal coniroi 


PR04323 


lufiff tumor 


universal normal control 


PR04323 


liver tiimnr 


umversal normal control 


PR01886 


Vl^Adt iUlllVIl 


universal normal control 


PR01886 


lun9 tiimnr 

lUlIg iUAltV/1 


umversal normal control 


PR01886 


rf^fiil tiimnr 

IWVMU tUUIUl 


universal normal control 


PR04395 


CAinn tiimnr 


universal normal control 


PR04395 


fiuautic iiuuur 


universal normal control 


PR04395 


liinff fitmnr 


universal normal control 


PR04395 


p^rvifal fiiTTir\^ 
VlUal luiiiur 


universal normal control 


PRO 1782 




universal normal control 


PRO 1782 


limiv tiitTinr 


universal normal control 


PR04388 


liino fiimrfer 


universal normal control 


PR04341 


hri^ficf tiimrtr 

UlCodi iUlllUK 


universal normal control 


PR04341 


lllflQ tllTTIOr 


universal normal control 


PR03438 


liino tiimnr 


umversal normal control 


PR04321 


ttn^act' tiiniiir 


universal normal control 


PR04321 


liino tiiTnnr 


umversal normal control 


PR04321 


colon nimor 


umversal normal control 


PR04304 




umversal normal control 


PRO4304 


lunff tumor 


■ umversal normal control 


PRO4403 


colon tumor 


uuivcioai nuiuiai coniroi 


PRO4403 


breast tumor 


universal normal coniroi 


PRO4403 


lunff tumor 


universal normal control 


PR04324 


luns tumor 


universal nonnai coniroi 


PR04324 


breast tumor 


universal iKJiiuai control 


PRO4303 


cervical tumor 


uiuvcrsoi uuiuiai control 


PRO4303 


lung tumor 




PRO4303 


breast tiimor 


lUUVClSMll llUIIIlal VUIIIXOI 


PRO4303 


colon tumor 


umversm nuiuiai control 


PRO4303 


prostate tumor 


UiUVCtool uOlUlol vOiuroi 


PRO4305 


breast tumor 


uiuYciddi nonnai control 


PRO4305 


lung tumor 


universal normal control 


PRO4305 


colon tumor 


universal normal control 


PR04365 


liver mmor 


universal normal control 


PRO4404 


lung tumor 


universal normal control 


PRO4404 


breast tumor 


universal normal control 


PRO4404 


rectal tumor 


universal normal control 


PR01884 


lung tumor 


universal nonnai control 


PR04349 


colon tumor 


universal normal control 


PR04349 


lung tumor 


universal normal control 
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Molecule 

PRO4401 

PRO4401 

PR01867 

PR01867 

PR04319 

PR04319 

PR04991 

PR04991 

PR04398 

PR04346 

PRO4350 

PRO4350 

PRO4350 

PR04318 

PR04318 

PRO4340 

PRO4340 

PRO4400 

PRO4400 

PRO4320 

PRO4409 

PRO4409 

PRO4409 

PR04399 

PR04399 

PR04418 

PR04418 

PRO4330 

PRO4330 

PR04339 

PR04339 

PR04326 

PR04326 

PRO6014 

PR03446 

PR03446 

PR04322 

PR04322 

PR04322 

PR04381 

PR04381 

PR04381 

PR04348 

PR04348 

PR04371 

PR03742 

PR03742 

PR05773 

PR05773 

PR05773 

PR05774 

PR04343 

PR04325 

PR04347 



is Qverexnressfirf in- 

colon tumor 

lung tumor 

lung tumor 

liver tumor 

breast tumor 

lung tumor 

lung tumor 

colon tumor 

lung tumor 

lung tumor 

colon tumor 

prostate tumor 

lung tumor 

prostate nunor 

lung tumor 

breast tumor 

lung tumor 

breast tumor 

lung tumor 

lung tumor 

lung tumor 
r cervical tumor 

colon tumor 
lung tumor 
breast tumor 
lung tumor 
breast tumor 
cervical tumor 
colon tumor 
breast tumor 
colon tumor 
lung tumor 
colon tumor 
breast tumor 
colon tumor 
lung tumor 
lung tumor 
rectal tumor 
colon tumor 
breast Uimor 
lung tumor 
colon tumor 
lung tumor 
prostate tumor 
breast tumor 
colon tumor 
lung tumor 
lung tumor 
colon tumor 
prostate tumor 
colon timior 
colon tumor 
lung tumor 
lung tumor 
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as compared tn* 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
matched normal colon control 
universal normal comrol 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
matched normal colon control 
universal normal control 
universal nonnal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal nonnal control 
universal normal conm)l 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 (conn 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Molecule 


is overexpressed in: 


as comoared to: 


rKU4347 


coloa tumor 


universal normal control 


rKU4347 


rectal tumor 


universal normal control 


rKU3743 


colon tumor 


universal normal control 


rR03743 


lung tumor 


universal normal control 


PR03743 


prostate tumor 


universal normal control 


PROWzo 


colon tumor 


universal normal control 


PRO4500 


colon tumor 


universal normal control 


PR04389 


breast tumor 


universal normal control 


PR04389 


lung tumor 


universal normal control 


PR04337 


colon tumor 


universal normal control 


PR04337 


breast tumor 


universal normal control 


PR04337 


lung tumor 


universal normal control 


PR04992 


lung tumor 


universal normal control 


PR05996 


lung tumor 


universal normal control 


PR04345 


lung tumor 


universal normal control 


PR04345 


colon tumor 


universal normal control 


PRO5780 


lung tumor 


universal normal control 


PRO5780 


breast tumor 


universal normal control 


PR05992 


lung tumor 


universal normal control 


PR05992 


colon tumor 


universal normal control 


PR05992 


breast tumor 


universal normal control 


PR04428 


prostate tumor 


universal normal control 


PR04994 


lung tumor 


uiiiversal normal control 


PR05995 


lung tumor 


universal normal control 


PR05995 


colon tumor 


universal normal control 


PRO6094 


lung tumor 


universal normal control 


PRO6094 


colon tumor 


universal normal control 


PR04317 


lung tumor 


universal nomuil control 


PR04317 


colon tumor 


universal normal control 


PR04317 


liver tumor 


. universal normal control 


PR04317 


colon tumor 


matched normal colon control 


PR05997 


colon tumor 


universal normal control 


PR05997 


lung tumor 


universal normal control 


PRO5005 


lung tumor 


universal normal control 


PRO5005 


colon tumor 


universal normal control 


PRO5004 


colon tumor 


universal normal control 


PRO6001 


breast tumor 


universal normal control 


PRO6013 


colon tumor 


universal normal control 


PR04SQ2 


lung tumor 


universal normal control 


PRO4502 


colon tumor 


universal normal control 


PRO6007 


breast tumor 


universal normal control 


PRO6028 


breast tumor 


universal normal control 


PRO6028 


colon tumor 


universal normal control 


PR04327 


prostate tumor 


universal normal control 


PR04315 


colon tumor 


universal normal control 


rKU5yy3 


lung tumor 


universal normal control 


PR05993 


colon tumor 


universal normal control 


PRO4503 


colon tumor 


universal normal control 


PR04976 


lung tumor 


universal normal control 


PR05798 


lung tumor 


universal normal control 


PR05798 


colon tumor 


tmiversal normal control 


PR06242 


colon tumor 


universal normal control 


PR06242 


colon tumor 


matched normal colon control 


PR06242 


breast tumor 


universal normal control 
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20 



25 



30 



35 



40 



45 



50 



55 



Molecule 

PR06242 

PR06242 

PRO6095 

PRO6095 

PRO6093 

PRO6093 

PRO6093 

PRO6093 

PRO6012 

PRO6027 

PRO6027 

PRO6027 

PR06181 

PR06181 

PR06181 

PRO6097 

PRO6097 

PRO6090 

PR07171 

PR07171 

PR07171 

PR06258 

PR06258 

PR06258 

PR06258 

PR06258 

PRO9820 

PR06243 

PR06182 

PRO6079 

PRO6079 

PRO6079 

PRO6079 

PR07434 

PR09865 

PR09828 

PR0196 

PR0196 

PR0196 

PR0197 

PR0197 

PR0197 

PR0195 

PR0195 

PR0195 

PR0187 

PR0187 

PR0182 

PR0182 

PR0182 

PR0188 

PR0183 

PR0183 

PR0183 



isoverexDres5«>H in- 

liver tumor 

rectal tumor 

breast tumor 

lung tumor 

colon tumor 

breast tumor 

lung tumor 

colon tumor 

colon tumor 

lung tumor 

colon tumor 

rectal tumor 

prostate tumor 

lung tumor 

colon tumor 

colon tumor 

lung tumor 

lung tumor 

lung tumor 

colon tumor 

breast tumor 

prostate tumor 

breast tumor 

cervical tumor 

liver tumor 

colon tumor 

prostate tumor 

lung tumor 

lung tumor 

lung tumor 
colon tumor 
breast tumor 
prostate tumor 
lung tumor 
colon tumor 
colon tumor 
colon tumor 
lung tumor 
breast tumor 
colon tumor 
lung tumor 
breast tumor 
colon tumor 
lung tumor 
breast timior 
lung tumor 
liver tumor 
colon tumor 
lung tumor 
breast tumor 
rectal tumor 
colon tumor 
lung tumor 
breast tumor 



Table8fcont') 
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as compared to: 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
matched normal colon control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal norma] control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 fcont') 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Molecule 


is overexDressed in: 


as comnared to* 


PR0183 


rectal tumor 


universal normal control 


PRO! 84 


lung tumor 


universal normal control 


PR0184 


breast tumor 


universal normal control 


PR0185 


lung tumor 


iinivprcAl normal pi^nfml 


PRO200 


colon tumor 


iinivi>rcii1 normal rv^ntrol 


PRO200 


luns tumor 


iiniViPfCiil normol /^/xntrAl 
uxiivf^ioai iiviiiiiai vi/iitllll 


PRO200 


breast tumor 


UiUVvloal liUllllai vvIUlUl 


PRO200 


rectal tumor 


UlliVCiSMll IlUlUlcU (A/ULIUI 


PRO202 


colon tumor 


UiUVCfSal not lllal COUllOl 


PRO202 


liuiff tumor 


universal iionnai coniFoi 


PRO202 


brfiast tumnr 


universal normal coniroi 


PRO202 


rectal tumor 


universal nonnai coniroi 


PRO202 


livfT Mimor 


umversai normal control 


PR0214 




umversai normal control 


PR0214 


liino tiimnr 


universal normal control 


PR0215 




umversai normal control 


PR0215 


liino tiiTTiAr 


umversai normal control 


PR0215 




universal normal control 


PR0219 


UUIUI1 lumor 


universal normal control 


PR0219 


luug nimor 


universal normal control 


PR0219 


UiCoSkl luiuur 


universal normal control 


PR0219 


liver lumor 


universal normal control 


PR0211 


limn tiinn^f* 
lUXlg llilliUl 


universal normal control 


PR0211 


oreasi lumor 


universal normal control 


PRO220 


ciiion lumor 


universal normal control 


PRO220 


limo tiimnr 

lung lumor 


universal normal control 


PRO220 


l^fAocfr tiimrkr 

OiCdsi lunior 


universal normal control 


PR0366 


coion lumor 


universal normal control 


PR0366 


liino tiimor 
lUlt^ lUlilUI 


umversai noimal control 


PR0366 


Dicaai lumor 


universal normal control 


PR0216 


llUlg lUIIlUl 


universal normal control 


PR0221 


\AJL\jU. iUtllUI 


umversai normal control 


PR0221 


liino fiim/^r 

iUllg iUiilUI 


universal normal control 


PR022I 


ni'ASCf t'litn/iT ' 
IIUIlUl 


universal normal control 


PR0228 


liino tiimAr 


umversai noimai control 


PR0228 


brefljst tumor 


universal normal conuoi 


PR0217 


liino tiinvir 


umversai normal control 


PR0217 


hrftAQt rumor 


umversai normal control 


PR0222 


polon tiimor 


universal normal control 


PR0222 


lun? tumor 


umversai noiuiai control 


PR0222 


■ breast tumor 


uiuvciacu iiuiiiuiL vA/nirui 


PR0224 


colon tumor 


itni\/Aff*cQl n/M*tno1 /^/Mitr*/^] 
uiiiveioal iiuruuii (.H/iiiroi 


PR0224 


Inn a tumor 


uiiiversai noiiiiHi conuoi 


PR0224 


breast tumor 


uiuversoi uuiuioi cuniiui 


PR0224 


prostate tumor 


iinSvPrcsil nomnal c*nntml 
lUUVCIoal IHIIIIUll eUIlUUI 


PR0224 


rectal tumor 


iinivpr^isil normitl p/>nlm1 


PRO230 


colon tumor 


universal normal control 


PRO230 


lung tumor 


universal normal control 


PRO230 


breast tumor 


universal normal control 


PRO230 


prostate tumor 


universal normal control 


PR0198 


colon tumor 


universal normal control 


PR0198 


lung tumor 


universal normal control 


PR0198 


breast tumor 


universal normal control 


PR0198 


liver mmor 


universal normal control 
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Molecule 
PR0226 
PR0226 
PR0261 
5 PR0242 
PR0242 
PR0242 
PR0227 
PR0227 
10 PR0237 
PR0237 
PR0237 
PR0237 
PR0241 
15 PR0241 
PR0241 
PR0231 
PR0231 
PR0231 
20 PR0231 
PR0235 
PR0235 
PR0235 
PR0235 
25 PR0323 
PR0323 
PR0323 
PR0245 
PR0245 
30 PR0245 
PR0245 
PR0245 
PR0246 
PR0246 
35 PR0246 
PR0288 
PR0288 
PR0248 
PR0248 
40 PR0257 
PR0257 
PR0257 
PR0172 
PR0172 
45 PR0172 
PR0258 
PR0258 
PR0258 
PR0265 
50 PR0265 
PR0265 
PR0326 
PR0326 
PR0326 
55 PR0326 



is overexp reRsfiH in- 
lung tumor 
breast tumor 
lung tumor 
colon tumor 
lung tumor 
breast tumor 

colon tumor 

lung tumor 

colon tumor 

lung tumor 

breast tumor 

prostate tumor 

colon tumor 

lung tumor 

breast tumor 

colon tumor 

lung tumor 

breast tumor 

rectal tumor 

colon tumor 

limg tumor 

breast tumor 

liver tumor 

lung tumor 

breast tumor 

rectal tumor 

colon tumor 

lung tumor 

breast tumor 

cervical tumor 

liver tumor 
colon tumor 
lung tumor 
breast tumor 
lung tumor 
breast tumor 
lung tumor 
rectal tumor 
colon tumor 
lung tumor 
prostate tumor 
colon tumor 
lung tumor 
breast tumor 
colon tumor 
lung tumor 
breast tumor 
lung tumor 
breast tumor 
rectal tumor 
colon tumor 
lung tumor 
breast tumor 
liver tumor 



Table 8 fcont^ 



as compared tor 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal norinal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 (conV) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Molecule 


is overexoressed in: 


as compared to: 


PR0266 


colon tumor 


universal normal control 


PR0266 


lung tumor 


universal normal cnntrnl 


PR0266 


breast tumor 


universal normal control 


PR0269 


lung tumor 


universal nonmAl ronfrnl 


PR0269 


rectal tumor 


universal normal control 


PR0285 


colon tumor 


liniVPr^al nnnTi!i1 n/sntrnl 


PR0285 


lung tumor 


Ui 11 verbal IiUilllal coniFOi 


PR0285 


breast tumor 


uiiivcLoai noi mat voniroi 


PR0328 


colon tumor 


uiiivciMi uuniiai coniFOi 


. PR0328 


lung tumor 


luiivcradi iioiiiiai coniroi 


PR0328 


breast tumor 


11tl1VAff*CQl nrMTtifil f*f\r\trr\X 

uuivcidai nomuu coniroi 


PR0344 


breast tumor 


uiuvcrsai normal conuoi 


PR0272 


lun? tumor 


universal normal control 


PRO301 


colon nitnor 


universal normal control 


PRO301 


lunff tumnr 


umversai normal control 


PRO301 


breast tiiniAr 


umversal normal control 


PR0331 


colnn tiimnr 


universal normal control 


PR0331 


liinp fiimor 


luuversal normal control 


PR0331 


nn*2ict' tiitnrtr 


umversal normal control 


PR0332 


cnlAfi tiirvmr 

wVlVIll lUIUUl 


universal normal control 


PR0332 


liifio liimnr 


umversal normal control 


PR0332 


hri^iiQt tiimrkr 

Ul^OOl LUllll/l 


umversal normal control 


PR0353 


Cjti\t\n tiitnAr 


universal normal control 


PR0353 


liinff finnnr 


universal normal control 


PR0353 




universal normal control 


PRO310 




universal normal control 


PRO310 


itmo tiimnr 


universal normal control 


PRO310 


hrpaQt tiiTfirtr 
i/iciioi iujiiui 


universal normal control 


PRO310 




universal normal control 


PR0337 


colnn tiimnr 


umversal normal control 


PR0337 


luri9 tumor 


umversal normal control 


PR0337 


breast tumor 


universai normal coniroi 


PR0346 


lung tumor 


uiuversai normal control 


PRO350 


lunff tumor 


Umversal normal control 


PRO350 


breast tumor 


universal normal control 


PR0526 


colon tumor 


umversai normal control 


PR0526 


lun? tumor 


universal normal control 


PR0526 


breast tumor 


universal normal control 


PR0381 


colon tumor 


imivprcal nArmal /*/tr%tn\1 
uiuTciaaj iivniuu vuiuroi 


PR0381 


lung tumor 


universal normal coniroi 


PR0381 


breast tumor 


iinivprcnl nArmtal r»r\ntr/\1 
uiiiv^-idcu iiviiuai i^uiiUUl 


PR0381 


prostate tumor 


Ullivcraal nuililal coniFOi 


PR0846 


colon tumor 


uuivcioai noiiiiai conuOi 


PR0846 


lung tumor 


llfllVATcal fflArmol /^nnfrv^l 

luuvciaMU liuniicii woiiuoi 


PR0363 


colon tumor 


nnivprcal nnrmal tf^rtn#tv\1 
uiiivciaai liUlUutl \A/llimi 


PR0363 


lung tumor 


universal normal control 


PR0365 


lung tumor 


universal normal control 


PR0365 


breast tumor 


universal normal control 


PR01310 


breast tumor 


universal normal control 


PR0731 


colon tumor 


universal normal control 


PR0731 


lung tumor 


universal normal control 


PR0731 


breast tumor 


universal normal control 


PR0322 


colon tumor 


universal normal control 


PR0322 


lung tumor 


universal normal control 
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Molecule 


is overexoressed in- 


PR0322 


breast tumor 


PR0322 


rectal tumor 


PR0322 


liver tumor 


PR0536 


lung tumor 


PR0536 


breast tumor 


PR0536 


liver tumor 


PR0719 


colon tumor 


PR0719 


lun? tumor 


PR0719 


breast tumor 


PR0619 


colon tiimor 


PR0619 


lime tumor 


PR0619 


hr^<kf tiimAr 


PR0771 


colon tiimnr 


PR0771 


liin? fiimAr 


PR0771 


hr^cf tinrmr 


PRO1083 


colon fiimnr 


PRO1083 


IliiKT tiitrmr 


PRO1083 


uicddi lumor 


PRO1083 


pii/outic luitiur 


PR0862 




PR0862 


lung lumor 


PR0862 


uicdai lumor 


PR0733 


coion rumor 


PR0733 


limn titnvAv 

lung uimor 


PR0733 


urcosi uunor 


PR0733 


iivcr iiunor 


PR01188 


lung lumor 


PR01I88 


UlCdol (uiuor 


PR01188 




PRO770 


■lino timrmr 

luiig Uunor 


PRO770 


ituuui 


PRO1080 




PRO1080 


liino titnnrtr 
iLUig itiiuur 


PRO1080 


breaQf nimnr 


PRO 101 7 




PRO1017 


limo liiniAr 


PRO1017 


nFRStcf* tiifnnr 
uicaoi luuiiii 


PRO1016 


cnlAn tiirrmr 


PROi016 


Iiin? tumor 


PRO1016 


bneafit tumAr 


PRO1016 


recfjil himnr 


PR0792 


lunc tumor 


PR0938 


cnlnn tiimAr 


PR0938 


lung tumor 


PR0938 


breast tumor 


PRO1012 


colon tumor 


PRO1012 


lung tumor 


PRO1012 


rectal tumor 


PRO1012 


liver tumor 


PRO 1008 


lung tumor 


PRO1075 


colon tumor 


PRO1075 


lung tumor 


PRO1007 


colon tumor 


PRO1007 


lung tumor 



^ CQmpared to; 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal norma! control 
universal norma! control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal norma! control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal norma] control 
universal normal control 
universal normal control 
universal normal control 
imiversal norma! control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal coritro! 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal iK>rmal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 (cont't 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



nA /\1 A/^ 1 1 1 0 


is overexDressed in: 


as comoared to: 




breast tumor 


universal normal control 


PDninn7 


recta! tumor 


universal normal control 


rtsxj ixjjij 


colon tumor 


universal normal control 




lung tumor 


universal normal control 


rKUIUjO 


breast tumor 


universal normal control 


rKU/Vl 


colon tumor 


universal normal control 


rKO/yl 


lung tumor 


universal normal control 


rKO/yi 


breast tumor 


universal normal control 




rectal tumor 


universal normal control 


111 

rKUl 111 


colon tumor 


universal normal control 


T%n /*\ fill 

PROi 11 1 


lung tumor 


universal normal control 


no 1111 
FROllll 


breast xyxsmx 


universal normal control 


rKOolZ 


. lung tumor 


universal normal control 


PROS 12 


breast tumor 


universal normal control 


nn /N0 1 o 

PROS 12 


rectal tumor 


universal normal control 


PRO 1066 


lung tumor 


universal normal control 


PROI 185 


colon tumor 


universal normal control 


PRO 11 85 


lung tumor 


universal normal control 


PROI 185 


breast tumor 


universal normal control 


PRO1031 


lung tumor 


universal normal control 


PRO 1360 


. lung tumor 


universal normal control 


PRO 1360 


breast tumor 


universal normal control 


PRO 1309 


lung tumor 


universal normal control 


PRO 1309 


breast tumor 


universal normal control 


PROI 107 


lung tumor 


universal normal connrol 


PROI 107 


breast tumor 


universal normal control 


PR0836 


colon tumor 


universal normal control 


PR0836 


lung tumor 


universal normal control 


PROI 132 


lung tumor 


universal normal control 


nn ^\ 1 1 ^ 

PROI 132 


breast tumor 


universal normal control 


PROI 131 


colon tumor 


universal normal control 


PROI 131 


lung tumor 


universal normal control 


nn 1 1 1 

PRO 1131 


breast tumor 


universal normal control 


nn^^i 111 

PROI 131 


liver tumor 


universal normal control 


nn/\i 1 4/\ 

PROI 130 


colon tumor 


universal normal control 


nn/Ni 1 o/\ 

PROI 130 


lung tumor 


universal normal control 


PROI 130 


breast tumor 


universal normal control 


rKUo44 


colon tumor 


universal normal control 


nn/AOii>i 


lung tumor 


universal normal control 


PR0844 


breast tumor' 


universal normal control 


nn r\QA A 


rectal tiimor 


universal normal control 


rKUl IM 


colon tumor 


universal normal control 


rKUl IM 


lung tumor 


universal normal control 


rKUl 134 


rectal tumor 


. universal normal control 


rKUl IM 


liver tumor 


universal nomial control 




lung tumor 


universal normal control 


PROI 181 


breast tumor 


universal normal control 


PROI 126 


colon tumor 


universal normal control 


PROI 126 


lung tumor 


universal normal control 


PR01126 


breast tumor 


universal normal control 


PROI 126 


adrenal tumor 


universal normal control 


PROI 186 


colon tumor 


universal normal control 


PR01186 


lung tunior 


universal normal control 


PROI 186 


breast tumor . 


universal normal control 
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Molecule 
PR01186 
PR01198 
PRO 1 198 
5 PR01159 
PR01159 
PR01159 
PR01265 
PRO 1265 
10 PRO1250 
PRO 1250 
PRO1250 
PR01475 
PR01475 
15 PR01312 
PR01312 
PR01312 
PRO1308 
PRO 1308 
20 PRO1308 
PR01326 
PR01325 
PR01326 
PR01192 
25 PR01192 
PR01192 
PR01246 
PR01246 
PR01246 
30 PR01246 
PR01356 
PR01356 
PR01356 
PR01275 
35 PR01275 
PR01274 
PR01358 
PR01358 
PR01358 
40 PR01286 
PR01286 
PR01286 
PR01286 
PR01294 
45 PR01294 
PR01294 
PRO 1294 
PRO 1273 
PR01273 
50 PR01279 
PR01279 
PRO 1195 
PR01195 
PR0127I 
55 PR01271 



is overexprey M^d in- 
liver tumor 
colon tumor 
lung tumor 
lung tumor 
breast tumor 
liver tumor 
colon tumor 
breast tumor 

colon tumor 

lung tumor 

breast tumor 

colon tumor 

breast tumor 

colon tumor 

lung tumor 

breast tumor 

colon tumor 

lung tumor 

liver tumor 

colon tumor 

lung tumor 

breast tumor 

colon tumor 

lung tumor 

breast tumor 

colon tumor 

lung tumor 

breast tumor 

prostate tumor 

colon tumor 

lung tumor 

breast tumor. 

lung tumor 
breast tumor 
lung tumor 
colon tumor 
lung tumor 
prostate tumor 
colon tumor 
lung tumor 
prostate tumor 
rectal tumor 
colon tumor 
lung tumor 
breast tumor 
rectal tumor 
limg tumor 
rectal tumor 
colon tumor 
lung tumor 
lung tumor 
breast tumor 
lung tumor 
breast tumor 



Table 8 (cont*^ 



as compared to: 
universal normal control 
universal iK)rmal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal conn-ol 
universal normal control 
universal normal control 
universal normal control 
universal norma] control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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TabieSfcopn 



ivioicCulc 


isjjverexDressed in: 


rKXJl^l L 


liver tumor 


mKxJIJJo 


colon tiimor 




lung tumor 


rKUliJo 


breast tumor 




colon tumor 




lung tumor 


r KOI 343 


breast tumor 


rR01343 


rectal tumor 


rKUl434 


lung tumor 


PRO 14 18 


lung tumor 


rR0141o 


liver tumor 


rR013o7 


colon mmor 


PR01387 


lung tumor 


PRO 1387 


prostate tumor 


PRO 1387 


rectal tumor 


PRO 1384 


colon tumor 


PRO 1384 


lung tumor 


PRO 1565 


colon tumor 


PRO 1565 


lung tumor 


PR01565 


prostate tumor 


PR01474 


colon tumor 


PRO 1474 


lung tumor 


PRO 1474 


breast tumor 


PR01474 


rectal tumor 


PR01917 


colon tumor 


PR01917 


lung tumor 


PR01917 


breast tumor 


PRO 1787 


colon tumor 


PRO 1787 


lung tumor 


PR01787 


breast tumor 


PRO 1556 


lung tumor 


PRO 1556 


breast tumor 


PR01561 


colon tumor 


PRO 1561 


lung tumor 


PRO 1561 


rectal tumor 


rK01o93 


colon tumor 


PR01693 


lung tumor 


PRO 1693 


breast tumor 


rKOloOo 


lung tumor 


rKOloOo 


breast tumor 


PRO 1890 


colon tumor 


rKOlsyO 


lung tumor 


DD C\ 1 OCit\ 

rKOlo90 


breast tumor 


rKOlo90 


prostate tumor 


PR01oo7 


colon tumor 


PRO 1887 


breast tumor 


PR04353 


lung tumor 


PR04353 


breast tumor 


PRO1801 


colon tumor 


PRO 1801 


lung tumor 


PR04357 


lung tumor 


PR04357 


breast tumor 


PRO4302 


colon tumor 


PRO4302 


lung tumor 



as compared to: 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
imiversal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
universal normal control 
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Table 8 fcont') 

i^Q^^"^^ is overexpressed in: as compared to: 

PRO4302 breast tumor universal normal control 

PRO4302 prostate tumor universal normal control 

PRO5990 colon nunor universal normal control 

PRO5990 lungnamor universal normal control 

PRO5990 breast tumor univereal normal control 

EXAMPLES!: Identification of Receptor/Li gand Interactions 

In this assay, various PRO polypeptides are tested for ability to bind to a panel of potential receptor or 
ligand molecules for the purpose of identifying receptor/ligand interactions. The identification of a ligand for 
a known receptor, a receptor for a known Ugand or a novel receptor/ligand pair is useful for a variety of 
indications including, for example, targeting bioactive molecules Oinked to the ligand or receptor) to a cell 
known to express the receptor or Ugand. use of the receptor or ligand as a reagent to detea the presence of the 
ligand or receptor in a composition suspected of containing the same, wherein the con^wsition may comprise 
cells suspected of expressing the ligand or receptor, modulatmg the growth of or anoflier biological or 
immunological activity of a cell known to express or respond to the receptor or ligand. modulating the unmune 
response of cells or toward cells that express the receptor or ligand, allowing the preparaion of agonists, 
antagonists and/or antibodies directed against the receptor or ligand which will modulate the growth of or a 
biological or immunological activity of a cell expressing the receptor or ligand. and various other indications 
which will be readily apparent to the ordinarily skilled artisan. 

The assay is performed as follows. A PRO polypeptide of the present invention suspected of being a 
ligand for a receptor -is expressed as a fusion protein containing the Fc domain of human IgG (an 
immunoadhesin). Receptor-ligandbmding is detected by allowing interaction of tfie immunoadhesin polypeptide 
with cells (e.g. Cos cells) expressing candidate PRO polypeptide receptors and visualization of bound 
immunoadhesin with fluorescent reagents directed toward die Fc fusion domain and examination by microscope . 
Cells expressing candidate receptors are produced by transient transfection. in parallel, of defined subsets of a 
library of cDN A expression vectors encoding PRO polypeptides that may function as receptor molecules. C:ells 
are then incubated for I hour in die presence of the PRO polypeptide immunoadhesin being tested for possible 
receptor binding. The cells are then washed and fixed with paraformaldehyde. The cells are dien incubated with 
fluorescent conjugated antibody directed against the Fc portion of the PRO polypeptide immunoadhesin (e.g. 
FITC conjugated goat anti-human-Fc antibody). The cells are then washed again and examined by microscope. 
A positive interaction is judged by the presence of fluorescent labeling of cells transfected with cDNA encoding 
a particular PRO polypq)tide recq)tor or pool of receptors and an absence of similar fluorescent labelmg of 
similarly prepared cells that have been transfected with other cDNA or pools of cDNA. If a defined pool of 
cDNA expression vectors is judged to be positive for interaction with a PRO polypeptide immunoadhesin. the 
hidividual cDN A species that comprise the pool are tested individually (the pool is "broken down") to determme 
the specific cDNA that encodes a receptor able to interact with the PRO polypeptide immunoadhesin. 

In another embodiment of this assay, an epitope-tagged potential ligand PRO polypeptide (e.g. 8 
histidine "His" tag) is allowed to interact widi a panel of potential receptor PRO polypeptide molecules that have 
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been expressed as fusions with the Fc domain of human IgG (inununoadhesins). Following a I hour 
co-incubation with the epitope tagged PRO polypeptide, the candidate receptors are each immunoprecipitated 
with protein A beads and the beads are washed. Potential ligand interaction is determined by western blot 
analysis of the immunoprecipitated complexes with antibody directed towards the epitope tag. An interaction 
is judged to occur if a band of the anticipated molecular weight of the epitope tagged protein is observed in the 
5 western blot analysis with a candidate receptor, but is not observed to occur with the other members of the panel 
of potential receptors. 

Using these assays, the following receptor/ltgand interactions have been herein identified: 

(1) PRO1801 binds to PROl 1 14 and PR04978. 

(2) PROlOO binds to PROl 1 14. 

10 The foregoing written specification is considered to be sufficient to enable one skilled in the art to 

practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that die written description herein contained is inadequate to enable the practice of any 

15 aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
' to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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PCT 

Original (for SUBMISSION) - printed on 01 . 1 2.2000 02:57:35 PM 



0-1 
0-1-1 


1 Form - PCT/RO/134 (EASY) 
Indications Relating to Deposited 
Microorganfem(s) or Other Blologica 
Material (PCT Rule 13bis) 
Prepared using 


PCT-EASY Version 2.91 
(updated 10.10.2000) 


u-z 
0^3 


International Application No. 
Applicanfs or agenrs file reference 


P3330R1 


1 

1-1 

1-* 

1-3 


The indications made t>elow relate to 
the deposited microorgantsnrt(s) or 
other biological material referred to in 
the description on: 
page 

line 


98 
34 


1-3-1 
1-3-2 

1-3-3 
1-3-4 
1-4 


Identification of Deposit 
Name of depositary institution 
Address of depositaiy institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801. University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

14 April 1998 (14.04.1998) 
ATCC 209771 


1-5 


Additional Indications 


NONE 


1-6 


Designated States for Which 
Indications are Made 


all designated States 


2 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemationat Bureau later 


NONE 


2-1 
2-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


98 

35 


2-3-1 
2-3-2 

2-3-3 
2-3-4 , 
2-4 


Identification of Deposit 
Name of depositary institution 
Address of depositaiy institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209anited States of 
America 

09 February 1999 (09.02.1999) 
ATCC 203654 




Additional Indications ^ 


NONE 


2-5 

1 


5esignated States for Which 
ndications are Made 


all designated States 


2-6 i 

1 
t 

3 1 


Separate Furnishing of Indications ' 

rhese indications win be submitted to 
he International Bureau later 


tlONE 


t 

< 
t 

3-1 p 
3-2 li 


he indications made below relateto" ~ 
he deposited microorganism(s) or 
>ther biological material referred to in 
he description on: 

tage ^ 
ne 


»8 
)6 
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PCT 



Original (for SUBMISSION) - printed on 01.12.2000 02:57:35 PM 



P3330R1 



3-3 

3-3-1 

3-3-2 

3-3-3 
3-3-4 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05.1999) 
ATCC PTA-127 




Additional Indications 


NONE 


3t5 


Designated States for Which 
Indications are IMade 


all designated States 


3^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


4 . 

4-1 
4-2 


The Indications made t>elow relate to 
the deposited mlcroorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


98 
37 


4-3-1 
4-3-2 

4-3-3 
4-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of denn^tt 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

27 July 1999 (27.07.1999) 
ATCC PTA-429 


4^ 


Additional Indications 


NONE 




Designated States for Which 
Indications are Made 


all designated States 


Te 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


5-1 
5-2 


The Indications made tielow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

tine 


98 
38 


6-3 
5-3-1 

5-3-2 

5-3-3 
5-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 July 1999 (27.07.1999) 
ATCC PTA-432 


5-4 


Additional Indications 


NONE 




Designated States for Which 
Indications are Made 


all designated States 


si 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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PCT 



6-1 
6-2 

6-3-1 
6-3-2 



6-3-3 
6-3-4 



&4 



6-5 



7-1 

7-2 



7^ 

7-3-1 

7-3-2 



7-3-3 
7-3-4 



7^ 



7-6 



6-1 

8-2 



8-3 
8-3-1 

8-3-2 



8-3-3 
8-3^ 



The indications made below relate to 
the deposited inicroorganisni(s) or 
other biological material referred to in 
the description on: 

page 

line 

Identification of DeposlT 

Name of depositary institution 
Address of depositary Institution 



Original (for SUBMISSION) - pnnted on 01.12.2000 02:57:35 PM 



P3330R1 



98 
39 



Date of deposit 
Accession Number 
Additional Indications 



Designated States for Which 
Indications are Made 
Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 
The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

10 December 1997 (10.12.1997) 
ATCC 209525 



HONE 



all designated States 
NONE 



I Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 



Additional Indications 



Designated states for Which 
Indications are Made 



separate Furnishing off Indications 

These Indications wilt be submitted to 
the International Bureau later 



The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 

page 

line 



8-5 



Identification of Deposit 
Nanne of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 
Additional Indications 



Designated :>tates for Which 
Indications are Made 



99 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203577 



NONE 



all designated States 



NONE 



99 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 July 1999 (27.07.1999) 
ATCC PTA-430 
NONE 



all designated States 
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P3330R1 
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8-6 


Separate Furnishing of Indications 


NOlirE 




These indications will be subnottted to 
the Intemattonat Bureau later 




9 

9-1 


TKe indications made below relate to 
the deposited microorganisni(s) or 
other biological material referred to in 
the description on: 
page 

line 


• 

99 
4 


93 
9-3-1 

9-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


Americsui Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


9-3-3 
9-3-4 


Date of deposit 
Accession Number 


America > 
08 June 1999 (08.06.1999) 
ATCC PTA-203 


95 


Additional Indications 


NONE 


9-5 


Designated States for Which 
Indications are IMade 


all designated States 




Separate Furnishing of Indications 


NONE 




These indications will be submitted to 
the International Bureau later 


■■ . 


10 

10-1 
10-2 


The Indications made below relate to 
the deposited microorgani8m(s) or 
other biological material referred to in 
the description on: 
page 

line 


99 
5 


10-3-1 
10-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


10-3-3 
10-3-4 


Date of deposit 
Accession Number 


America 

01 July 1998 (01.07.1998) 
ATCC 203()40 


155 


Additional indications 


NONE 


10-5 


Designated States for Which 
Indications are Made 


all designated States 


10-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


11 

11-1 
11-2 


The indications made below relate to 
the deposited microorgani8m(8) or 
ottier biological material referral to in 
the description on: 
page 

line 


99 
6 


11-3 
11-3-1 

11-3-2 


Identification of Deposit ' 
Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


11-3-3 
11-3-4 


Date of deposit 
Accession Number 


America 

31 August 1999 (31.08.1999) 
ATCC PTA-611 
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11-4 
11-5 

11-6 


Additional Indications ~" 

Designated States for Which 
Indications are Made 


INONE ' 
all designated States 


12 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


~NONE ' 


12-1 
12-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material refen^d to ii 
the description on: 

page 

fine 


1 

99 
7 


12-3-1 
12-3-2 

12-3-3 
12-3-4 
12-4 


Identification of Deposit 
Name of depositaiy institution 
Address of depositary institution 

Date of deposit 
Accession Numt>er 
Additloruil Indications 


Americaux Type Culture Coll^^r^f-i 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

21 January 1998 (21.01.1998) 
ATCC 20d^O^ 

NONE ■ — — 


12-5 
12^6 


Designated states for Which 
Indications ane Made 


all designated States 


13 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ' — — - 


13-1 
13-2 

155 


The indications made below relate to 
the deposited microorganism(s} or 
other biological material referred to In 
the description on: 

page 

line 


99 
8 


13-3-1 
13-3-2 

13-3-3 
13-3^ 

Im 


Identification of Deposit 

Name of depositary institution 
Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture ColleGti^nn 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 February 1999 (09.02.1999) 
ATCC 203649 

NONE ■ 


13-5 

1 

13-6 i 


uesignated States for Which 
ndications are Made 


all designated States 


1 

14 


separate Furnishing of Indications ; 

rhese indications will be submitted to 
he International Bureau later 


ffONE " ~ 


C 

14-1 p 
14-2 If 


He Indications made below relate to 
he deposited microorganism(s) or 
rther biological material referred to in 
he description on: 

age ^ 
ne ; 


»9 
> 
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14-3 

14-3-1 

14-3-2 

14-3-3 
14-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203574 


14^ 


Additional Indications 


NONE 


"145 


Designated States for Whicli 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


15 

15-1 

15-2 


The indications made below relate to 
the deposited mlcroor9ani8m(s) or 
other biological material referred to In 
the description on: 

page 

line 


99 
10 


ISO 

15-3-1 

15-3-2 

15-3-3 
15-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 
Date of deposit 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America . 

25 May 1999 (25.05.1999) 
ATCC PTA-129 




Additional Indications 


NONE 


15^ 


Designated States for Which 
Indications are Made 


all designated States 


15-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the tntemationa! Bureau later 


NONE 


16 

16-1 
16-2 


The indications made below relate to 
the deposited mlcroorgani$m(s) or 
other biological material referred to in 
ttie description on: 

page 

line 


99 
11 


155 

16-3-1 
16-3-2 

16-3-3 
16-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collechion 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 May 1998 (27.05.1998) 
ATCC 209905 


16-4 


Additional Indications 


NONE 


l6l 


Designated States for Which 
Indications are Made 


all designated States 


155 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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17 

17-1 
17-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to ii 
the description on: 

page 

line 


1 

99 
12 


17-3 

17-3-1 

17-3-2 

17-3-3 
17-3^ 


" Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203585 


17-4 


Additional Indications ~ 


NOME 


17-5 


Designated States for Which 
indications are Made 


all designated States 


17-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Internationa! Bureau later 


NONE 


18 

18-1 
18-2 


The indications made below relate to 
the deposited microorganlsm(s} or 

other hlolofli^Al fna^ov4al nftCAmA^ *a 

vuiw uiwiugi^^i maienai reierreo to in 
the description on: 

page 

line 


99 
13 


18-3 

18-3-1 

18-3-2 

18-3-3 
18-3^ 
l8=4 


Identification of Deposit 
Name of depositary institution 
Address of depositary , institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 February 1999 (09.02.1999) 
ATCC 203665 




Additional Indications 


NONE 




Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


19 

19-1 
19-2 

1m 


The indications made t>elow relate to 
the deposited microorganism(s) or 
otiier iMoiogical material referred to in 
the description on: 

page 

line 


99 
14 


19-3-1 
19-3-2 

19-3-3 
19-3^ i 
115 'i 


dentification of Deposit 

Name of depositary institution 
Address of depositary institution 

^ 

Date of deposit 

Accession Numk)er ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 July 1999 (27.07.1999) 
HTCC PTA-427 




Additional Indications ^ 


KIONE 


19-5 1 
1 


Designated States for Which " " 
ndications are IWade ' 


all designated States 
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19-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


20 

20-1 
20-2 


The indications made below relate to 
the deooslted microoraanismis) or 
Other biological material referred to in 
the description on: 
page 

line 


99 
15 


203 
20-3-1 

20-3-2 

20-3-3 
20-3-4 


Identification of Deposit " 

Nsme of dPOOfiitarv inQtitiitirin 

Address of depositary institution 

Date of deposit 
Accession . Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

31 August 1999 (31.08.1999) 
ATCC PTA-615 


204 


Additional Indications 


NONE 


"20l 


Designated States for Which 
Indications are Made 


all designated States 


20-6 


Separate Furnishing of Indications 

These indications wil) be submitted to 
the International Bureau later 


NONE 


21 

21-1 
21-2 


Tt>e indications made below relate to 
ui« ueposiieo microorganism^s) or 
other tilological material referred to In 
the description on: 
page 

line 


99 
16 


21-3 

21-3-1 

21-3-2 

21-3-3 
21-3-4 


Identification of Deposit 

• veil IK? WI UC^USklLaiy UlSinUliOli 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203582 


21^4 


Additional Indications 


NONE 


21^ 


Designated States for Which 
Indications are Made 


all designated States 


2^6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


22 

22-1 

22-2 


The indications made twiow relate to 
the deposited microorganismCs) or 
other biological material referred to In 
the description on: 
page 

line 


99 
17 


22-3 
22-3-1 

22-3-2 

22-3-3 
22-3-4 


Name of depositary institution 
Address of deposiUry institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 March 1999 (09^03.1999) 
ATCC 203838 
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22-4 
22-5 


Additional Indications 

Designated States for Which 
Indications are Made 


NONE "~ — 

all designated States 


22^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


23-1 
23-2 
2i5 


The indications made below relate to 
the deposited microorganism's) or 
other biological material referred to ir 

wic uv9wri|7uon on. 
page 

line 


99 
18 


23-3-1 
23-3-2 

23-3-3 
23-3-4 
23-4 


Identification of Deposit ' 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 July 1999 (27.07.1999) 

ATCC PTA-428 

NONE 


23-6 


Designated SUtes for Which 
Indications are Made 


all designated States 


24 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


24-1 
24-2 
2?5 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
uwv aBscnimon on. 

page 
line 


99 
19 


24-3-1 

24-3-3 
24-3^ 
2^4 


lldentlficatton of Deposit " 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 March 1999 (09.03.1999) 

ATCC 203836 

blONE 




teslgnated States for Which 
ndications are Made 


all designated States 


24-6 

1 


Separate Furnishing of Indications ] 

rhese indications wilt be submitted to 
he Intemational Bureau later 


fclONE 


25 1 
1 
( 
t 

25-1 f 
25-2 li 


rhe indications made below relate to 
he deposited microorganism(s) or 
Jther biological material referred to In 
he description on: 

lage ^ 


99 
10 
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25^ 

25^3-1 

25-3-2 


" Identification of Deposit 

Name of depositary institution 

Address of depositary institution 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


25.3-3 
25-3-4 


Date of deposit 

AcCA^ifUl NlimhAr 


America 

08 June 1999 (08.06.1999) 
ATCC PTA-205 


25-4 


Additional Indications 


NONE 


25-5 


Designated States for Which 
indications are Made 


all designated States 


25-6 


Separate Furnishing of indications 

These Indications wilt be submitted to 
the International Bureau later 




26 

26-1 
26-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


99 


26^ 
26-3-1 

26-3-2 


Identification of Deposit 

Name of deoositarv in^tifiitinn 

w| standi J iiiotHUUUII 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Koinassas, 
Virginia 20110-2209United States of 


26-3-3 
26-3-4 


Date of deposit 

^^cccsolOll iNumDer 


America 

27 July 1999 (27.07.1999) 
ATCC PTA-431 




Additional Indications 


NONE 


26-5 


Designated States for Which 
Indications are Made 


all designated States 


1m 


Separate Furnishing of Indications 


HONS 




These indications will be submitted to 
the International Bureau later 




27 

27-1 
27-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


99 

22 


27-3 

27-3-1 

27-3-2 


Identification of Deposit 

Name of depositary institution 
Address of depositary institution 


American Tvoe Cultuife Col la^^^nn 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


27-3-3 
27-3-4 


Date of deposit 
Accession Number 


America 

09 February 1999 (09.02.1999) 
ATCC 203659 


27-4 


Additional Indications 


NOME 


27-5 


Designated States for Which 
indications are Made 


all designated States 


27^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 
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28 

28-1 


The indications made below relate to 
the deposited niicroorganism(5) or 
ouier Dioiogicai maienai reieiTSQ to in 
the description on: 
page 

line 


99 
23 


28-3-1 
28-3-2 

28-3-3 
28-3-4 


" Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203584 


28-4 


Additional Indications 


NONE 


"28^5 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


29 

29-1 
29-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


99 
24 


29-3 
29-3-1 

29-3-2 

29-3-3 
29-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05.1999) 
ATCC PTA-126 


29^ 


Additional Indications 


NONE 


295 


Designated States for Which 
ndications are Made 


all designated States 


29-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the intemational Bureau later 


NONE 


30 

30-1 

30-2 


The indications made tielow relate to 
the deposited microorganism(s) or 
omer Dioiogicai maEeriai referred to in 
the description on: 
page 

line 


99 

25 


30-3-1 
30-3-2 

30-3-3 
30-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05.1999) 
ATCC PTA-128 


305 


Additional Indications 


NONE 


355 


designated States for Which 
ndications are Made 


all designated States 
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30-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


31 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 




31-1 


page 


99 


31-2 


line 


26 


31-3 


Identification of Deposit 




31-3-1 
31-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


31-3-3 


Date of deposit 
Accession Number 


09 February 1999 (09.02.1999) 
ATCC 203664 


31-4 


Additional Indications 


NONE 


31-5 


Designated States for Which 
Indications are Made 


all designated States 


31-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 




32 

32-1 


The indications made below relate to 
the deoositBd microoraanismlfi) or 
Other biological material referred to In 
the description on: 

page 


99 


32-2 


line 


27 


32-3 
32-3-1 

32-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 


32-3-3 
32-3-4 


Date of deposit 

Ac^^<i(iiAn NiimKar 


America 

12 January 1999 (12.01.1999) 
ATCC 203578 


32^ 


Additional Indications 


NONE 


32-5 


Designated States for Which 
Indications are Made 


all designated States 


32^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


33 

33-1 


The indications made below relate to 
the deposited microorganismfs) or 
other biological material referred to in 
the description on: 
page 


99 


33-2 


line 


28 


33-3 
33-3-1 

33-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


33-3-3 
33-3-4 


Date of deposit 
Accession Number 


22 December 1998 (22.12.1998) 
ATCC 203554 
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33-5 
33-6 


Additional Indications 

Designated States for Which 
Indications are Made 


NONE — 

all designated States 


34 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


'none 


34-1 
34-2 
34-3 


The indications made below relate to 
the deposited microorganl8m(s) or 
other biological material referred to Ir 
the description on: 
page 

line 


99 
29 


34-3-1 

34-3-3 
34-3-4 
34-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 

ATCC 203850 

NONE 


345 
34-6 


Designated states for Which 
Indications are Made 


all designated States 


35 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE '' ' 


35-1 
35-2 
"3W 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

page 

line 


99 
30 




35-3-1 
35-3-2 

35-3-3 
35-3^ 
355 


Identification of Deposit 
Name of depositary institution 
Mooress or. oepositary institution 

Date of deposit 

Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 May 1999 (11.05.1999) 
ATCC PTA-45 


355 


Additional Indications 


trONE 


35-6 


designated States for Which 
Indications are Made 


all designated States 


t 

36 1 


Separate Furnishing of Indications " 

rhese indications will be submitted to 
he Intemational Bureau later 


NONE 


t 
i 
t 

36-1 f 
36-2 I] 


rhe indications made below relate to ~ 
he deposited microorganism(s) or 
)ther biological material referred to in 
he description on: 

»age j 
ne 


»9 
)1 



148 




wo 01/40466 PCT/USOO/32678 



P3330R1 

Original (for SUBMISSION) - printed on 01 .12.2000 02:57:35 PM 



36-3 


Identification of Deposit 




36-3-1 
36-3-2 


Name of deoositarv institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


36-3-3 
36-3-4 


Date of deposit 
Accession Number 


22 December 1998 (22.12.1998) 
ATCC 203545 


36-4 


Additional indications 


NONE 


36-5 


Designated States for Which 
Indications are Made 


all designated States 


36-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


37 
37-1 


The Indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 


QQ 


37-2 


line 


32 


37-3 


Identification of Deposit 




37-3-1 
37-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


37-3-3 
37-3-4 


Date of deposit 

Accession Number 


22 December 1998 (22.12.1998) 
ATCC 203544 


37-4 


Additional Indications 


NONE 


37-5 


Designated States for Which 
Indications are Made 


all designated States 


37-6 


These indications wilt be submitted to 
the International Bureau later 


NONE 


38 

38-1 
38-2 


The indications made below relate to 
me oeposiiea microorganismts) or 
other biological material refenned to In 
the description on: 
page 

line 


99 
33 


36-3 


Identification of Deposit 




38-3-1 
38-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


38-3-3 
38-3-4 


Date of deposit 
Accession Number 


America 

15 June 1999 (15.06.1999) 
ATCC PTA-234 


38-4 


Additional Indications 


NONE 


38-5 


Designated States for Which 
Indications are Made 


all designated States 


38^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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39-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to ii 
the description on: 

page 

line 


n 

99 


39-3 
39-3-1 

39-3-2 

39-3-3 
39-3-4 
39-4 


Name of depositary institution 
Address of depositary Institution 

Date of deposit 
Aooesston Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203848 


39-5 


Additional Indications 


NONE 


3M 


Designated states for Which 
Indications are RAade 


all designated States 




Separate Furnishing of Indications 


NONE 




These indications will be submitted to 
the International Bureau later 


40 

40-1 
40-2 


The indicaUons made below relate to 
the deposited microorganlsmCs) or 
otfier biological material rafermd tA in 
the description on: 
page 

line 


99 

35 


40-3-1 

40-3-3 
40-3-4 


idenuTicatlon of Deposit 
Name of depositary institution 
Address of depositary instihition 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 December 1998 (22.12.1998) 
ATCC 203555 


404 
40^5 


Additional Indications 


NONE 


40-6 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 


NONE 




These Indications mhII be submitted to 
the international Bureau later 


41 

•f 1- 1 
41-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 

}age 

ine 


99 
36 


41-3 1 
41-3-1 1 

41-3-2 > 

41-3-3 t 
41-3-4 / 


dentification of Deposit 

Mame of depositary institution ^ 

\ddress of depositary Institution 

2 

)ate of deposit < 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
i^irginia 20110-2209United States of 
lUiierica 

20 April 1999 (20.04.1999) 
^TCC 203949 


41-4 / 
41-5 C 


Additional Indications j 


TONE ' 


1 


)esignated States for Which ~ 
ndications are Made ' 


ill designated States 
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41-6 


1 Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NOME 


42 

42-1 
42-2 


The Indications made t>etow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


■ 

99 

37 


42-3 

42-3-1 

42-3-2 
42-3-3 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

IS December 1998 (15.12.1998) 
ATCC 203539 


42-4 


Additional Indications 


NONE 


42-5 


Designated States for Which 
indications are iVIade 


all designated States 


42-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


43 

43-1 
43-2 


The indications made below relate to 

the d^no^ltflrt mi^mAmanlemfel 

%itv uw^woi LTjvj iiii(#rvvrganisrn(si or 
Other biological ntaterial referred to in 
the description on: 
page 

line 


99 
38 


43-3 

41-3-1 

43-3-2 

43-3-3 
43-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 
Date of deposit 

^v»*rfCi»oiuii nuiitutsr 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203871 


43-4 


Additional Indications 


NONE 


43T 


Designated States for Which 
indications are IVIade 


all designated States 


43-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


44 

44-1 
44-2 


The Indications made tielow relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


99 
39 


44-3 
44-3-1 

44-3-2 

44-3-3 
44-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203862 
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44-4 


Additional Indications ~ 


NONE 


44-5 


Designated States for Which 
Indications are Made 


all designated States 


44-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


45 

45-1 
45-2 
455 


The indications made below relate to 
the deposited microorganism(s) or 

other llloloaical material rAfarmri in 

the description on: 
page 

line 


99 
40 


45-3-1 
45-3-2 

45-3-3 
45-3^ 
455 


Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

10 August 1999 (10.08.1999) 
ATCC PTA-510 


455 


Additional Indications 


NONE 




Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


46 

46-1 
46-2 
46l 


me indications made below relate to 
the deposited microorganlsm(s) or 

other hiolAaical matAnal «ifji.»-mi 

w w< viwjiibai iiHusnai rafGrreu CO in 

the description on: 

page 

line 


99 
41 


46-3-1 
46-3-2 

46^3 
46-3-4 
"465 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 January 1999 (20.01.1999) 
ATCC 203603 


465 




NONE 


465 


Designated States for Which 
Indications are Made 


all designated States 




separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ■■ - 


47 

1 
i 
t 

47-1 1 
47-2 1 


rhe indications made below relate to ' 
the deposited microorganlsm(s) or 
other biological material referred to In 
the description on: 
Jage 

Ine 


99 
42 
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47-3 

47-3-1 

47-3-2 

47-3-3 
47-3-4 


Identification of Deposit 

Name of depositary instKution 

Address of depositary institution 

Date of deposit 
Accession Number 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203813 


47-4 


Additional Indications 


NONE 


47-5 


Designated States for Which 
Indications are Made 


all designated States 


47-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


48 

48-1 
48-2 


The indications made below relate to 
the deposited microofganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


99 
43 


485 

48-3-1 

48-3-2 

48-3-3 
48-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary Institution 

Date of deposit 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203812 


"4514 


Additional Indications 


NONE 


48-5 


Designated States for Which 
Indications are Made 


all designated States 


1m 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


49 

49-1 
49.2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 

page 

line 


99 
44 


49-3-1 
49.3-2 

49-3-3 
49-34 


1 dentification of i>eposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Tvpe Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

29 October 1998 (29.10.1998) 
ATCC 203391 


49^ 


Additional Indications 


NONE 


49-5 


Designated States for Which 
Indications are Made 


all designated States 


49-6 

1 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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50 

50-1 
50-2 
50-3 


j The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to 1 
the description on: 
page 

liine 


n 

99 
45 


50-3-1 
50-3-2 

50-3-3 
50-3-4 
50-4 


1 identification of Deposit 
1 Name of depositary institution 
1 Address of depositary institution 

1 Date of deposit 
1 Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 J^ril 1999 (27.04.1999) 
ATCC 203965 

NONE 


155 


Designated States for Which 
1 Indications are IMade 


all designated States 


51 


1 Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ' 


51-1 
51-2 
51-3 


The indications made below relate to 
the deposited microorganismfs) or 
other biological material referred to in 
j the description on: 
page 

line 


99 
46 


51-3-1 
51-3-2 

51-3-3 
61-3-4 
514 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203816 

NONE • 


51-5 
51-6 


Designated States for Which 
Indications are Made 


all designated States 


52 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ■ 


1 

52-1 
52-2 

52-3 1 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


99 
47 


52-3-1 1 
52-3-2 L 

52-3-3 C 
52-3-4 / 
524 7 


dentification of Deposit 

Mame of depositary institution ^ 

\ddress of depositary institution 

1 

)ate of deposit ^ 
Accession Number j 
Additional Indications " j 


American Type Culture Collection 
10801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
^erica 

32 March 1999 (02.03.1999) 
^TCC 203814 

TONE — 


52-5 C 
U 


>esignated States for Which 1 
ndications are Made * 


ill designated States 
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52-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NOME 


53 

53-1 
53-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological ntaterial referred to ir 
the descrlotion on* 
page 

line 


1 

99 
48 


6i5 

53-3-1 
53-3-2 

53-3-3 
53-3-4 


Identification of Deposit 

Name of depositary institution " 

Address of depositary institution 

Date of deposit 
Accession Number 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203810 


53-4 


Additional Indications 


NONE 




Designated States for Which 
Indications are Made 


all designated States 


53-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


54 

54-1 
54-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
uw aascnpcion oni 
page 

line 


99 
49 


545 
54-3-1 

54-3-2 

54-3-3 
54-3-4 
54^4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


AiaeiTican rype uuxture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

04 May 1999 (04.05.1999) 

AX irXA~~ ^ Z 




Additional Indications ~ 


NONE 


54l 


Designated States for Which 
indications are Made 


all designated States 


54l 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemationat Bureau later 


NONE 


55 

55-1 
55-2 


' he indications made l>elow relate to 
the deposited microorganism(s) or 
ouier oioiogicai material referred to in 
the description on: 

page 

line 


99 
50 


5M 
55-3-1 

55-3-2 i 

55-3-3 1 
55-3-4 > 


Identification of Deposit ' 
Name of depositary institution 

Address of depositary institution 
3ate of deposit 

Accession Number ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203580 



ISS 
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56 



56-1 
56-2 



56-3 

56-3-1 

56-3-2 



56-3-3 
56-3-4 



56-5 
565" 



Additional Indications 

Designatea states for Which 
indications are Made 
Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 
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all designated States 
NONE 



The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 



Identification of Deposit 
Name of depositary institution 

Address of depositary institution 



Date of deposit 
Accession Number 
Additional Indications 



99 
51 



57-1 
57-2 



57-3-2 



57-3-3 
57-3-4 
57^ 



57-5 



57-6 



58-1 
58-2 



Designated states for Which 

Indications are Made 

Separate Furnishing of Indications 

These indications will be submitted to 

he International Bureau later 

The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 

Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 
Additional Indications 



Designated States for Which 
Indications are Made 



Separate Furnishing of Indications 

These indicaUons will be submitted to 
the Intemattonal Bureau later 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

30 March 1999 (30.03.1999) 
ATCC 203889 
NONE 



all designated States 
NONE 



99 
52 

American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 April 1999 (27.04.1999) 
ATCC 203964 



NONE 



all designated States 



NONE 



The indications made below relate to 
the deposited microoiganismCs) or 
other biological material referred to in 
the description on: 
page 

line 



99 
53 
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58-3 


Identification of Deposit 




58-3-1 
58-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Cul^uire Collec1:i.on 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


58-3-3 


Date of deposit 
Accession Numoer 


22 December 1998 (22.12.1998) 
ATCC 203548 


58-4 


Additional Indications 


NONE 


58-5 


Designated States for Which 
indications are Made 


all designated States 


58^ 


Separate Furnishing of Indications 

These indications will tye submitted to 
the International Bureau later 




"59 
59-1 


The indications made below relate to 
uw? utppvaiuTu iriicruorganism\S| or 
other biological material referred to In 
the description on: 
page 


99 


59-2 


line 


54 


595 
59-3-1 

59-3-2 


Identification of Deposit 
naiiits w uepoanary insuiuiion 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 


59-3-3 

59-3-4 


Date of deposit 

Accession Numt)er 


America 

02 March 1999 (02.03.1999) 
ATCC 203817 


59-4 


Additional indications 


NONE 


59-5 


Designated States for Which 
Indications are Made 


all designated States 


59-6 


Separate Furnishing of Indications 

These Indications wilt be submitted to 
the International Bureau later 




60 

60-1 


The Indications made below relate to 
the deooslted microoraanismfst or 
Other biological material referred to in 
the description on: 
page 


99 


60-2 


line 


55 


60-3 


Identification of Deposit 




60-3-1 
60-3-2 


Name of depositary institution 
Address of depositary institution 


Axutsf^iCAii •ijfpe ^uxcure v«oxxecT#xon 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


60-3-3 
60-3^ 


Date of deposit 
Accession Number 


America 

15 June 1999 (15.06.1999) 
ATCC PTA-235 


60-4 


Additional Indications 


NONE 


60-5 


Designated States for Which 
Indications are Made 


all designated States 


60-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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61.1 
61-2 

61-3-1 
61-3-2 



61-3-3 
61-3-4 



The indications made l:>elow relate to 
the deposited microor9anlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 

Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



61-4 



61-5 



61-6 



Date of deposit 
Accession Number 



Additional Indications 



100 
2^ 

lAmerican Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
lAmerica 

27 April 1999 (27.04.1999) 
JATCC 203968 



62 



62-1 
62-2 



62-3 

62-3-1 

62-3-2 



62-3-3 
62-3-4 
62:4 



62-5 
6M" 



63-1 
63-2 



63-3 

63-3-1 

63-3-2 



63-3-3 
63-3-4 



63^ 



Designated States for Which 

Indications are Made 

Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 



NOME 



I all designated States 
NONE 



The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to Ini 
the description on: ' 
page 



fine 



Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 
Additional Indications 



100 
3 



Designated states for Which 

indications are Utede 

separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 



American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
lAznerica 

30 March 1999 (30.03.1999) 
ATCC 203894 
NONE 



I all designated States 
NONE 



The indications made below relate to 
the deposited mlcroorganism(s) or ■ 
other biological material referred to inl 
the description on: 



line 



Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Numt}er 
Additional Indications 



100 
4 



Designated states for Which 
Indications are Wlade 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

30 March 1999 (30.03.1999) 
ATCC 203893 
NONE 



I all designated States 
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63-6 


1 Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NOHE 


64 

64-1 
64-2 


The indications made below relate to 
tfie deposited microorganism(s) or 
other biological material referred to in 
tne aescription on: 

page 

line 


• 

100 
5 


645 

64-3-1 

64-3-2 

64-3-3 
64-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 
Date of deposit 


American Type Culture Collection 
10801 University Blvd. ^ Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203811 


644 


Additional Indications 


NONE 


64-5 


Designated States for Which 
Indications are Made 


all designated States 


64-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


65 

65-1 
65-2 


The indications made tielow relate to 
the deposited microor9anlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
6 


65-3 
65-3-1 

65-3-2 

65-3-3 
65-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203867 


65-4 


Additional Indications 


NONE 


65-5 


Designated States for Which 
Indications are iWlade 


all designated States 


65-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


66 

66-1 
66-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
7 


66-3 

66-3-1 

66-3-2 

66-3-3 
66-3^ 


Identification of Deposit '■ 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 J^ril 1999 (27.04.1999) 
ATCC 203963 
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66-4 


Additional Indications 


NONE 


66-5 


Designated States for Which 
Indications are Made 


all designated States 


66-6 
67 


' Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


67-1 
67-2 


The indications made below relate to 
the deposited microorganismfs) or 
VMRPT KMwiogiMii iiiaienai reierred to ir 
the description on: 
page 

line 


100 
8 


67-3 
67-3-1 

67-3-2 

67-3-3 
67-3-4 
67-4 


identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Cul1:ure Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

02 March 1999 (02.03.1999) 
ATCC 203fil>> 




Additional Indications 


NONE 


67-5 


Designated States for Which 
Indications are Made 


all designated States 


67-6 


Separate Furnishing of Indications 


NONE 




These indications will be submitted to 
the International Bureau later 


66-1 
66-2 

"555 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on:, 
page 

fine 


100 
9 


66-3-1 
66-3-2 

68-3-3 
68-3^ 


Identification of Deposit 
Name of depositary institution 
Address of depositary Institution 

Date of deposit 
Accession Number 


^®^icem Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

30 March 1999 (30.03.1999) 
ATCC 203890 




Aoaiuonai indfcattons 


NONE 


68-5 
68-6 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications " 

These indications will be submitted to 
the Intemationaf Bureau later 


tlONE 


< 
t 

69-1 f 
69-2 1 


"he Indications made below relate to 
the deposited microorganlsm(s) or 
Mher biological material referred to in 
he description on: 
>a9e 

me 


LOO 
LO 
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69-3 


Identification of Deposit 




69-3-1 
69-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 


69-3-3 
69-3-4 


Date of deposit 
Accession Numt>er 


25 May 1999 (25.05.1999) 
ATCC PTA-130 


69-4 


Additional Indications 


NOME 


69-5 


Designated States for Which 
Indications are Made 


all designated States 


69-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


70 
70-1 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 


100 


70-2 


line 


11 


70*3 


Identification of Deposit 




70-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


70-3-3 

70-3-4 


Date of deposit 
Accession Number 


27 i^ril 1999 (27.04.1999) 
ATCC 203970 


70-4 


Additional Indications 


NONE 


70-5 


Designated States for Which 
Indications are Made 


all designated States 


70-6 


Separate Furnishing of Indications 

1 illUIUallUlid Will DB SUDi I lllieQ lO 

the International Bureau later 


NONE 


71 

71-1 
71-2 


The indications made below relate to 

ths dfiDO^itflH ml^rAnmanlemfel t%r 

Mw MV|#\j9iu;u iiiiMQvi^«ini9rfi|9| or 

other biological material referred to In 

the description on: 

page 

line 


100 
12 


71-3 


Identification of Deposit 




71-3-1 
71-3-2 


Name of depositary institution 
Address of depositary institution 


American xyp® Caix^ure coxxection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


71-3-3 
71-3-4 


Date of deposit 
Accession Number 


16 March 1999 (16.03.1999) 
ATCC 203845 


71-4 


Additional Indications 


NONE 


71-S 


Designated States for Which 
Indications are Made 


all designated States 


71-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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72 

72-1 
72-2 
72^ 


The indicatiofts made below relate tc 
the deposited microorganism(s) or 
other biological material referred to i 
the description on: 
page 

line 


" 

n 

100 
13 


72-3-1 
72-3-2 

72-3-3 
72-3-4 
72-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Muaiuonai inaications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03,1999) 
ATCC 203861 

NONE ' 


72-5 
72-6 


Designated states for Which 
Indications are Made 


all designated States 


73 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


73-1 

73-2 
73-3 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
14 


73-3-1 
73-3-2 

73-3-3 
73-3-4 
73^ 


Identification of Deposit 
Name of depositary institution 
Address of deoositarv im^hiKnn 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203844 

NONE ' ~ ~" 


755 
735 


1 designated States for Which 
Indications are IMade 


all designated States 


74 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


74-1 
74-2 

745 1 


The Indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 

page 

ine 


100 

15 


74-3-1 1 
74-3-2 i 

74-3-3 C 
74-3-4 / 
74^ / 


dentification of Deposit 

Mame of depositary institution ^ 

^ddress of depositary institution 

^ 
1 

>ate of deposit 

Accession Number ^ 
Additional Indications ^ 


American Type Culture Collection 
L0801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
^erica 

LO August 1999 (10.08.1999) 
VTCC PTA-513 

TONE " 


715 C 
li 


designated states for Which 1 
idications are Made * 


111 designated States 
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74-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau iater 


NONE 


75 

75-1 
75-2 


The indications made below relate to 
the deposited niicroorganisni(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
16 


7S5 

75-3-1 

75-3-2 
75-3-3 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 February 1999 (09.02.1999) 
ATCC 203663 


75^ 


Additional Indications 


NONE 


75-5 


Designated States for Which 
Indications are Rffade 


all designated States 


76=6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


76 

76-1 

76-2 


The indications made t>elow relate to 

the dfiDOSitfid mi(*rnmnanlem/o\ r\r 

other biological material referred to in 
the description on: 

page 

line 


100 
17 


76-3 
76-3-1 

76-3-2 

76-3-3 
76-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary instihJtion 
Date of deposit 


American Type Culture Collection 
10801 Universitr^ Blvd. , Manassas,, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203851 


76-4 


Additional Indications 


NONE 


"765 


Designated States for Which 
Indications are Made 


all designated States 


76-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


77 

77-1 
77-2 


The indications made t>elow relate to 
the deposited microorganism(8) or 
other biological material referred to in 
the description on: 
page 

line 


100 
18 


770 
77-3-1 

77-3-2 

77-3-3 
77-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

20 i^ril 1999 (20.04.1999) 
ATCC 203950 
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77-4 


Additional Indications 


NONE 


77-S 


Designated States for Whlcfi 
indications are Made 


all designated States 


77-6 


Separate Furnishing of Indications 


NONE 


78 


These indications will be submitted to* 
the International Bureau later 


78-1 
78-2 
75^3 


The indications made below relate to 
the deposited microorgani$m(s) or 
other biological material referred to Ir 
the description on: 

Dane 

line 


100 
19 


78-3-1 
78-3-2 

78-3-3 
78-3-4 


Identification of Deposit " 
Name of depositary instihition 

Address of denofiitarw incMitiMM 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

30 March 1999 (30.03.1999) 
ATCC 203895 




Additional Indications 


NONE 


78-6 
78-6 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ' ■ — 


79 

79-1 
79-2 
79-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 


100 


line 


20 


79-3-1 
79-3-2 

79-3-3 
79-3-4 


Identification of Deposit 
Name of depositary institution 
Address of deoositarv in«fitiitiAn 

'■w|*w*iMPi y lll9UUIllWffl 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05.1999) 
ATCC PTA-134 


79^ 


Additional Indications 


iiSijp ' ■ ^ 


795 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 


NOME 




These indications will be submitted to 
the International Bureau later 


80 

t 
i 
1 

80-1 1 
80-2 1 


rhe indications made below relate to 
the deposited microorganismCs) or 
ather biological material referred to in 
the description on: 
>age 

Ine 4 


LOO 
21 
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80-3 
80-3-1 

80-3-2 

80-3-3 
80-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numher 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203852 


80-4 


Additional indications 


NONE 


80-5 


Designated States for Which 
Indications are Made 


all designated States 


80-6 . 


Separate Furnishing of Indications 

These indications wilt t>e submitted to 
the International Bureau later 


NONE 


81 

81-1 
81-2 


The indications made below relate to 
the deposited inlcroorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


100 
22 


81-3 
81-3-1 

81-3-2 

81-3-3 
81-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 

Mccessfon numoer 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110r2209United States of 
America 

22 June 1999 (22.06.1999) 
ATCC PTA-258 


81-4 


Additional Indications 


NONE 


81-5 


Designated States for Which 
Indications are Made 


all designated States 


81-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


82 

82-1 
82-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological ntateriat referred to in 
the description on: 
page 

line 


100 
23 


82-3 

82-3-1 

82-3-2 

82-3-3 
82-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Tvoe CuliiuT'A PftTl«ri4»irtw 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 June 1999 (22.06.1999) 
ATCC PTA-259 


82^ 


Additional Indications 


NONE 


82-5 


Designated States for Which 
Indications are Made 


all designated States 


82-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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83 

83-1 
83.2 

wis 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to ii 
the description on: 
page 

line 


1 

100 
24 


83-3-1 
83^3-2 

83-3^ 
83-3-4 
83-4 


Identification of Deposit 
Name of depositary institution 
jwuicaa ui ueposnary insiiiuiion 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 

ATCC 203866 

NONE 


83-5 
"8S€ 


Designated States for Which 
Indications are lAade 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


84 

Oil 4 

o4-i 
84-2 


The indications made below relate to 
the deposited mlcroorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 


8« 
84-3-1 

64-3-2 

84-3-3 
84-3^ 


Identification of Deposit 
Name of depositary institution 
/waress OT aeposiiaiy institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas „ 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203853 




Additional Indications 


NONE 


84-S 
8J5 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


85 

85-1 
85-2 
855 


The indications made below relate to 
the deposited mlcroorganism(s} or 
other biological material referred to In 
the description on: 

page 

line 


100 


85-3-1 
85-3-2 i 

85-3-3 1 
85-3-4 > 
85-4 i 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

3ate of deposit 

Accession Number j 
Additional Indications ] 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

30 March 1999 (30.03.1999) 

ATCC 203892 

HONE 


155 1 
1 


designated States for Which ' 
ndlcations are Made * 


all designated States 
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85-6 


1 Separate Furnishing of Indications 

These indications wiil be submitted to 
the Internationa) Bureau later 


NONE 


66-1 
86<2 


The indications made t>elow relate to 
the deposited microorganlsnfi(s) or 
other biological material referredto in 
the description on: 
page 

iine 


100 
27 


86-3 

86-3-1 

86-3-2 
86-3-3 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203847 


86^ 


Additional Indications 


NONE 


86-5 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications wlti be submitted to 
the International Bureau later 


NONE 


87 

87-1 
87-2 


The indications made tielow relate to 

ttie deoositfid rnicrooraaniQm/cl nr 

Other biological material referred to in 

the description on: 

page 

line 


100 
28 


87-3 
87-3-1 

87-3-2 

87-3-3 
87-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 
Date of deposit 

Accession Niimh^r 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

04 May 1999 (04.05.1999) 
ATCC PTA-21 


87-4 


Additional Indications 


NONE 


87-5 


Designated States for Which 
Indications are Made 


all designated States 


87-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


88 

88-1 
88-2 


The indications made t)elow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
29 


88-3 
88-3-1 

88-3-2 

88-3-3 
88-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05,1999) 
ATCC PTA-121 
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88-4 
88-5 

88-6 


Additional Indications 

Designated sutes for Which 
Indications are Made 


NONE ^ 

all designated States 


89 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


"none ' 


89-1 
89-2 
855 


The indications made below relate to 
the deposited microofganism<s) or 
other biological material refened to ii 
the description on: 
page 

line 


100 
30 


89-3-1 
89-3-2 

89-3-3 
89-3-4 


Identification of Deposit " 
Name of depositary institution 
Address of deoositarv instttution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

20 April 1999 (20.04.1999) 
ATCC 203951 

NONE ' 


855 

155 


Designated States for Which 
indications are Made 


all designated States 


90 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


90-1 
90-2 
905 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

tine 


100 
31 


90-3-1 
90-3-2 

90-3-3 
90-3-4 
905 


Identification of CDeposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203869 
(TOME ' 


Ws 

90-6 


:^ignated States for Which 
ndications are Made 


all designated States 


t 

91 1 


Separate Furnishing of Indications ■ 

rhese indications will be submitted to 
he International Bureau later 


NONE ' ' 


t 
c 
t 

91-1 F 
91-2 li 


rhe indications made below relate to 
he deposited mlcroorganism(s) or 
)ther biological material referred to in 
he description on: 
>age 

ne ] 


LOO 
32 
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91-3 
91-3-1 

91-3-2 

91-3-3 
91-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 

Accession Numljer 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

15 June 1999 (15.06.1999) 


91*4 


Additional indications 


NONE 


91-5 


Designated States for Which 
Indications are IVIade 


all designated States 


91-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


92 

92-1 
92-2 


The Indications made below relate to 
the deposited microorgant8m(s} or 
other biological material referred to In 
uiv vMisvtipuon on* 
page 

line 


100 
33 


92-3 
92-3-1 

92-3-2 

92-3-3 
92-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America . 

20 July 1999 (20.07.1999). 


92-4 


Additional Indications ~ 


NONE 


925 


Designated States for Which 
Indications are Made 


all designated States 


92-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


93 

93-1 
93-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
34 


93-3 
93-3-1 

93-3-2 

93-3-3 
93-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203864 


93-4 


Additional Indications 


NONE 


93-5 


Designated States for Which 
indications are Made 


all designated States 


*9M 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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94 

94-1 
94-2 
94-3 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to 1 
the description on: 
page 

line 


n 

100 
35 


94-3-1 
94-3-2 

94-3-3 
94-3-4 
94-4 


Identification of Deposit 
Name of deposttaiy institution 
Address of depositary institution 

Date of deposit 
Accession Numtjer 
Additiorial Indications 


American Type Culture Collection 
xuBOl Unxversxty Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 June 1999 (22.06.1999) 
ATCC PTA-262 

NONE — 


945 
945 


Designated States for Which 
Indications are IMade 


all designated States 


95 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ' 


95-1 
95-2 
95-3 


The indications made below relate to 
the deposited microorganl$m(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
36 


95-3-1 
95-3-2 

95.3.3 
95-3-4 
95.4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications " 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 July 1999 (20.07.1999) 

ATCC PTA-381 

NONE 


95-5 
955 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


96-1 

96-2 1 
96-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

page 

ine 


100 
37 


96-3-1 
96-3-2 > 

96-3-3 C 
96-3-4 
96.4 / 


dentiftcation of Deposit 

Manie of depositary institution j 

\ddress of depositary institution 

1 

)ate of deposit ^ 
accession Number ^ 
Ldditional Indications j 


American Type Culture Collection 
L0801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
^erica 

)4 May 1999 (04.05.1999) 
VTCC PTA-15 

70NE ' 


96-5 C 
li 


^slgnated Stales for Which "I 
Kfications are Made * 


Lll designated States 
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96-6 


" Separate Furnishing of Indications 

These indications will be submitted to 
the Intemattonal Bureau later 


NONE 


97 

97-1 
97-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to ir 
uiv Qoscnpuon oni 
page 

line 


100 
38 


97-3 

97-3-1 

97-3-2 

97-3^ 
97-3-4 


Identification of Deposit " 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numtser 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

15 June 1999 (15.06.1999) 
ATCC PTA-239 


97«4 


Additional Indications 


NONE 


97-5 


Designated States for Which 
Indications are Made 


all designated States 


97-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


98 

96-1 
98-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

page 

line 


100 
39 


155 
98-3-1 

98-3-2 

93-3-3 
98-3^ 


Identification of Deposit 
Name of depositary institution 
Address of depositary Institution 

Date of deposit 
Accession Number 


^*«i«*iwan xype cuxcure coxxection 

10801 University Blvd., Manassas, 

Virginia 20110-2209United States of 

America ^ 

20 July 1999 (20.07.1999) 

ATCC PTA~3o4 


985 


Additional Indications 


NONE 


98-5 


Designated States for Which 
Indications are Made 


all designated States 


98-6 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


"99 

99-1 
99-2 


The indications made lielow relate to 
the deposited microorganlsm(s) or 
other biological material referred to In 
the description on: 

page 

line 


100 
40 


9M 
99-3-1 

99-3-2 

99-3-3 
99-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
toresslon Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-475 
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99-4 


~ Additional Indications ~" 


NONE 


99-5 


Designated States for Which 
Indications are Made 


all designated States 


99-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


100 

100-1 
100-2 
100-3 


The indications made below relate to 
the deposited mlcroort|anism(s) or 
other biological material referred to ir 
the description on: 
page 

line 


1 

100 
41 


100-3-1 
100-3-2 

100-3-3 
100-3-4 
100-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


Aype ^uituire coxlection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 

ATCC 203854 

NONE 


100-5 


Designated States for \Nh\ch 
Indications are Made 


all designated States 


100-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


101 

101-1 
101-2 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material raferred to in 
the description on: 

page 

line 


100 
42 


101-3 
101-3-1 

101-3-2 

101-3-3 
101-3-4 
101-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


Aniesrican Tvca Cn \ f-n t*a r"/> 1 1 ^^4- 4 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 July 1999 (20.07.1999) 

ATCC PTA-378 

NONE 


101-5 
101-6 


[>esignat8d States for Which 
Indications are Made 


all designated States 


102 


Separate Furnishing of Indications 

these indications will be submitted to 
the International Bureau later 


NONE 


1 
1 

102-1 1 
102-2 h 


rhe Indications made below relate to 
the deposited microorganlsm(s) or 
ather biological material referred to in 
1t\e description on: 
)age 

ine 


LOO 
13 
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102-3 
102-3-1 

102-3-2 

102-3-3 
102-3-4 


Identification of Deposit 
Nanfie of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Nunnt>er 


Aiii63ri.cdn TvnA Ou14'iit-a r*^l Ia^v^^m 

10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 June 1999 (22.06.1999) 
ATCC PTA-257 


102-4 


Additional Indications 


NONE 


102-5 


Designated States for Which 
Indications are Made 


all designated States 


102-5 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


103 

103-1 
103-2 


The indications made below relate to 
the deposited niicroorgani$m(s) or 
other biological material referred to In 
the description on: 
page 

line 


100 
44 


103-3 

103-3-1 

1(XJ-3-2 

103-3-3 
103-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 
Date of deposit 

Acf^<«ton Kliimhc»r 

r^wWBaoiuti inuiilliKJI 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

15 June 1999 (15.06.1999) 
ATCC PTA-231 


103-4 


Additional Indications 


NONE 


103-5 


Designated States for Which 
Indications are Made 


all designated States 


1034 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


104 

104-1 
104-2 


The indications made below, relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
me descnpuon on: 

page 

line 


100 
45 


104-3 

104-3-1 

104-3-2 

104-3-3 
104-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of , 
America 

20 July 1999 (20.07.1999) 
ATCC PTA-388 


104-4 


Additional Indications 


NONE 


104-5 


Designated States for Which 
Indications are Made 


all designated States 


104-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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105 

105-1 
105.2 
105-3 


1 The Indications made below relate to 
the deposited microorganisfn(s) or 
other biological material referred to 1 
the description on: 
page 

Ibne 


n 

100 


105-3-1 
105-3-2 

105-3-3 
105-3-4 
105-4 


1 Identification of Deposit 
Name of depositary institution 

1 AcMr^ss of ri^ivwifan/ if 

1 VI ucpuallary inSluUIIOn 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

31 August 1999 (31.08.1999) 
ATCC PTA-620 

NONE 


105-5 
105^6 


Designated States for Which 
1 Indications are Made 


all designated States 




Separate Furnishing of Indications 

1 These indications will be submitted to 
Ithe International Bureau later 


NONE 


106-1 
106-2 
106-3 


The indications made below relate to 
the deposited mlcroorganism(s) or 

1 other biological material referred to in 

1 the description on: 
page 

line 


100 


106-3-1 
106-3-2 

106-3-3 
106-3^ 


Identification of Deposit 
Name of depositary institution 
rxuuiess or ueposnary institution 

Date of deposit 
Accession Number 
Adaitionai Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 May 1999 (25.05.1999) 

ATCC PTA-118 

NONE ■ 


106-5 
106-6 


Designated States for Which 
Indications are Made 


eH.!. designated States 


107 


Separate Furnishing of indications 

These indications wUI be submitted to 
the International Bureau' later 


NONE 


107-1 
107-2 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

ine 


100 
48 


107-3 1 
107-3-1 1 

107-3-2 / 

107-3-3 [ 
107-3-4 / 
107-4 / 


dentification of Deposit 

Mame of depositary institution ^ 

Address of depositary institution 

i 

^te of deposit , 
Accession Number ^ 
Additional Indications ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
IVmerica 

33 August 1999 (03.08.1999) 
^TCC PTA-477 

' 


107-5 I 
li 


^Ignated States for Which 1 
ndications are Made * 


ill designated States 
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107-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


108 

108-1 
108-2 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
49 


108-3 

108-3-1 

108-3-2 

108-3-3 
108-3-4 


Identification of Deposit 
Name of deoositarv inafitiition 

Address of depositary instihition 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209Uni ted States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-488 


108-4 


Additional Indications ' 


NONE 


108-5 


Designated States for Which 
Indications are Made 


all designated States 


108-6 


Separate Furnishing of Indications 

These indications will be sufctmitted to 
the international Bureau later 


NONE 


109 

109-1 
109-2 


The Indications made below relate to 
U10 uepusicea rnicroorganism(S| or 
other biological material referred to in 
the description on: 
page 

line 


100 
50 


109-3 
109-3-1 

109-3-2 

109-3-3 
109-3-4 


Identification of Deposit 
Namfi of d&nncitiirv mctitiitiAn 

Address of depositary institution 

Date of deposit 
Mccession Numoer 


American Type Culture Collection 
10801 University Blvd., Kanassas, 
Virginia 20110-2209United States of 
America 

16 March 1999 (16.03.1999) 
ATCC 203849 


109-4 


Additional Indications 


NONE 


109-5 


Designated States for Which 
Indications are Made 


all designated States 


109-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


110 

110-1 
110-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


100 
51 


110-3 
110-3-1 

110-3-2 

110-3-3 
110-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 March 1999 (09.03.1999) 
ATCC 203837 
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110-4 


Additional indications 


NONE 


110-5 


" Designated States for Which 
Indications are Made 


all designated States 


110-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


111 

111-1 

111-2 


The indications made below relate to 
the deposited microorganism(s) or 
%9unsw oiviogicai maienai rererred to ir 
the description on: 
page 

line 


100 
52 


111-3 
111-3-1 

111-3-2 

111-3-3 
111-3-4 


Identification of Deposit 
Nanie of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 July 1999 (20.07.1999) 
ATCC PTA-380 


111-4 


Additional Indications 


NONE 


111-5 


Designated States for Which 
indications are Made 


all designated States 


111-6 


Separate Furnishing of indications 

These indications wHI be submitted to 
the International Bureau later 


NONE 


112 

112-1 
112-2 


The indications made t>elow relate to 
the deposited microorganisin(s) or 
otner oioiogicai material referred to in 
ttie description on: 

page 


100 
53 


112-3 
112-3-1 

112-3-2 

112-3-3 
112-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 May 1999 (11.05.1999) 
ATCC PTA-44 


112-4 


Additional Indications 


NONE 


112-5 


Designated States for Which 
Indications are Made 


all designated States 


112-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


113 

113-1 
113-2 


The Indications made below relate to 
the deposited microorganlsm(s) or 
other biological material refened to in 
the description on: 
page 

ine 


100 
54 
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113-3 
113-3-1 

113-3-2 

113-3-3 
113-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

11 May 1999 (11.05.1999) 
ATCC PTA~42 


113-4 


Additional Indications 


NONE 


113-5 


Designated States for Which 
Indications are Made 


all designated States 


113-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NOME 


114 

114-1 
114^ 


The Indications made below relate to 
the deposited microofganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


100 
55 


114-3 
114-3-1 

114-3-2 

114-3-3 
114-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209Uni ted States of 
America 

25 May 1999 (25.05.1999) 
ATCC PTA-123 


114-4 


Additional Indications 


NONE. 


114-5 


Designated States for Which 
Indications are Made 


all designated States 


114-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the Intemationat Bureau later 


NONE 


115 

115-1 
115-2 


The indications made t>elow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
uie aescnpuon on: 

page 

line 


101 
2 


115-3 

115-3-1 

115-3-2 

115-3-3 
115-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number . 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-482 


115-4 


Additional Indications 


NONE 


115-5 


Designated States for Which 
Indications are IMade 


all designated States 


115-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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116 

116-1 
116-2 
116-3 


The Indications made below relate to 
the deposited microorganism{s) or 
other biological material referred to t 
the description on: 
page 

line 


n 

101 
3 


116-3-1 
116-3-2 

116-3-3 
116-3-4 
11&4 


Identification of Deposit " 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Numtier 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-483 

NONE 


116-5 
116-6 


Designated States for Which 
Indications are Made 


all designated States 


117 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE - 


117-1 
117-2 
117-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
4 


117-3-1 
117-3-2 

117-3-3 
117-3^ 
117-4 


Identification of Deposit 
Name of depositary institution . 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-485 
NONE ' 


117-5 
117-6 


Designated states for Which 
Indications are Made 


all designated States 


118 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE " ~ 


118-1 
118-2 
118-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

page 

ine 


101 

5 


118-3-1 
118-3-2 > 

lia-3-3 t 
118-3-4 / 
11&4 / 


MviiuiiwlUUn 9i L/cDOSli 

Mame of depositary institution 
Address of depositary institution 

1 

)ate of deposit ^ 
Accession Number j 
additional Indications ^ 


American Type Culture Collection 
L0801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
America 

)3 August 1999 (03.08.1999) 
VPCC PTA-480 

TONE 


118-5 C 
ti 


^stgnated States for Which Z 
idlcatlons are Made ^ 


ill designated States 
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118^ 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


119 

119-1 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 


101 


119-2 


line 


6 


119-3 


Identification of Deposit 




119-3-1 
119-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


119-3-3 
i19-3-4 


Date of deposit 
Accession Number 


03 August 1999 (03.08.1999) 
ATCC PTA-476 


119^ 


Additional Indications 


NONE 


119-5 


Designated States for Which 
Indications are Made 


all designated States 


119-6 


Separate Furnishing of Indications 

iiieoc iiiuiuoiions win suoniineQ lo 
the International Bureau later 


NONE 


120 

120-1 
120-2 


The indications made below relate to 
iiie uepusiiea microQrganism|S| or 
other biological material referred to In 
the description on: 
page 

fine 


101 
7 


120-3 


Identification of Deposit 




120^1 
120^2 


Name of deoo^iitniv incttittitiAn 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America . 


120-3-3 


Date of deposit 

Mccession nuniDer 


03 August 1999 (03.08.1999) 
ATCC PTA-472 


120-4 


Additional Indications 


NONE 


120-6 


Designated States for Which 
indications are Made 


all designated States 


120-6 


Separate Furnishing of Indications 

ITiese Indications will be submitted to 
the International Bureau later 


NONE 


121 

121-1 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 


101 


121-2 


line 


8 


121-3 


Identification of Deposit 




121-3-1 
121-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


121-3-3 
121-3^ 


Date of deposit 
Accession Number 


03 August 1999 (03.08.1999) 
ATCC PTA-487 
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121-4 
121-5 

121^ 


Additional Indications 

Designated States for Which 
Indications are IMade 


NONE " — 

all designated States 


"122 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


"none " " 


122-1 
122>2 
122-3 


The indications made below relate to 
the deposited microorganism{s) or 
other biological material referred to ii 
the description on: 
page 

line 


101 
9 


122-3-1 
122-3-2 

122-3-3 
122-3-4 
122^ 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20H0-2209United States of 
America 

03 August 1999 (03.08.1999) 
ATCC PTA-484 

none — 


122-5 
122-6 


Designated States for Which 
Indications are Made 


all designated States 


123 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE ~ — 


123-1 
123-2 
123-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material rsfarraH f a in 
the description on: 
page 

line 


101 
10 


123-3-1 
123-3-2 

123-3-3 
123-3-4 

123^ : 


Identification of Deposit ^ 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 

Additional Indications " ■ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 August 1999 (17.08.1999) 

^ONE ~~~ 


123^ 
123-6 \ 


Mignated States for Which 
ndicatlons are Made ' 


all designated States 


t 

124 1 


separate Furnishing of indications ] 

liese indications will be submitted to 
he International Bureau later 


^lONE " 


t 

c 

t 

124-1 p 
124-2 li 


he indications made below relate to 
he deposited microorganism(s) or 
rther biological material referred to in 
he description on: 

•age 2 
ne ^ 


LOl 

LI 
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124-3 


Identification of Deposit 




124-3-1 
124-3-2 


Name of depositary institution 
Address of depositary institution 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


1 ^«t-«J-o 
124-3-4 


Date of deposit 
Accession Nuntber 


10 August 1999 (10.08.1999) 
ATCC PTA-515 


124^ 


Additional Indications 


NONE 


124-5 


Designated States for Which 
Indications are Made 


all designated States 


124^ 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


125 
125-1 


The Indications made below relate to 
the deposited inlcroofganlsin(s) or 
other biological material rBfarred to in 
the description on: 

page 


101 


125-2 


line 


12 


125-3 


Identification of Deposit 




125-3-1 
125-3-2 


Name of depositary Institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


125-3-3 
125-3-4 


Date of deposit 


19 October 1999 (19.10.1999) 
ATCC PTA-861 


125^ 


Additional Indications 


NONE 


125-5 


Designated States for Which 
Indications are Made 


all designated States 


125-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


126 

125- 1 

126- 2 


The indications made below relate to 
the deposited microorganismts) or 
ottwr biological material referred to in 
the description on: 

page 

line 


101 
13 


126-3 


Identification of Deposit 




126-3-1 
126-3-2 


Name of depositary institution 
Address of depositary institution 


American Tvoe Culture Colleetion 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


126-3-3 
126-3-4 


Date of deposit 
Accession Number 


10 August 1999 (10.08.1999) 
ATCC PTA-518 


12&4 


Additional Indications 


NONE 


126-5 


Designated States for Which 
Indications are Made 


all designated States 


126-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 



181 



wo 01/40466 

PCT/USOO/32678 



PCT 

Original (for SUBMISSION) - printed on 01.12.2000 02:57:35 PM 



127 

127-1 

197 O 

127^ 


IThe indications made below relate tc 
j the deposited microorganism(s) or 
1 other biological material referred to i 
1 the description on: 
page 
■ line 


► 1 " ' " • — ^ ^ 

n| 

101 
14 


127-3-1 
127-3-2 

127-3-3 
127-3-4 
127-4 


1 Identification of Deposit 
iName of depositary institution 
i Address of depositaiy institution 

pate of deposit 
1 Accession Number 

Additional Indications ' 


•nus^Awou j.^fpe uuAtuire CoHeccxon 
10801 University Blvd. , Kanassas, 
Virginia 20110-2209United States of 
America 

10 August 1999 (10.08.1999) 

NONE ' 


127-5 
127-« 


Designated states for Which 
Indications are Made 


(all designated States 


128 


Separate Furnishing of Indications 

These indications wilt be submitted to 
the International Bureau later 


NONE " ' 


128-1 
128-2 
128-3 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 

15 


128-3-1 
128-3-2 

128-3-3 
128-3-4 
126-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary Institution 

Date of deposit 
Accession Number 
Additional Indications 


^MuezTican ryp& Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110"2209United States of 
America 

03 August 1999 (03.08.1999) 

^ ffif^^ rnnn iiAn 

NONE 


128-5 
128-6 


Designated States for Which 
indications are Made | 


all designated States 


129 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later | 


NONE ■ 


1 

129-1 
129-2 
129-3 1 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 1 
page 

Sne 1 


101 
1 

ILO 


129-3-1 1 
129-3-2 > 

129-3-3 C 
129-3-4 / 
129-4 / 


dentification of Deposit T 
Slame of depositary institution h 
Address of depositary institution 1 • 

P 
J 

)ate of deposit U 
Accession Number |^ 
Additional Indications £ 


teariean Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209Onited States of 
America 

Jl August 1999 (31.08.1999) 
^TCC PlA-614 

roNE ■ 


129-5 C 
Ji 


designated States for Which Z 
idications are Made 


ill designated States ~ 
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129-6 
"130 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


130-1 
130-2 


The indications made below relate to 

the deOOSlted niicroonianiQml«\ Ar 

Other biological material referred to ii 

the description on: 

page 

line 


101 

•fc W Jk 

17 


130-3 
130-3-1 

130-3-2 

130-3-3 
130-3^ 


Identification of Deposit 
Name of denosifAtv inQiifiifii«n 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 November 1999 (16.11.1999) 
ATCC PTA-957 


130-4 


Additional Indications 


NONE 


130-5 


Designated States for Which 
Indications are Made 


all designated States 


130-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


131 

131-1 
131-2 


The indications made below relate to 
ine aepositeo microorganism(s) or 
other biological material referred to In 
the description on: 
page 

tine 


XwX 

18 


131-3 
131-3-1 

131-3-2 
131-3-3 


Identification of Deposit 
i^aitiQ wi ucpoaiiary insinuiiOn 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

05 October 1999 (05.10.1999) 
ATCC PTA-819 


131-4 


Additional Indications "~ 


NONE 


131-5 


Designated States for Which 
Indications are Made 


all designated States 


131-6 


Separate Furnishing of Indications 

These indications wilt be submitted to 
the International Bureau later 


NONE 


132 

132-1 
132-2 1 
132-3 1 


WW oeposiiea microorganism(s) or 
other biotoglcal material referred to in 
the description on: 
page 

ine 


101 
19 


132-3-1 1 
132-3-2 > 

132-3-3 C 
132-3-4 / 


dentification of Deposit 

^ame of depositary institution ^ 

\ddress of depositary institution 

^ 

>ate of deposit 

Accession Number j 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
i^irginia 20110-2209United States of 
l^erica 

18 September 1997 (18.09.1997) 
\TCC 209280 
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132-4 


Additional Indications 


NONE 


132-S 


Designated States for Which 
indications are Made 


all designated States 


132^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


133 

133-1 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


101 
20 


133-3 
133-3-1 

133-3-2 

133-3-3 
133-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Colleeiiion 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

14 April 1998 (14.04.1998) 


133-4 


Additional Indications 


NONE 


133-5 


Designated States for Which 
Indications are Made 


all designated States 


133-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


134 

134-1 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


101 
21 


134-3 

134-3-1 

134-3-2 

134-3-3 
134-3-4 


identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 October 1997 (16.10.1997) 
ATCC 209375 


134-4 


Additional Indications 


NONE 


134-5 


Designated States for Which 
Indications are Made 


all designated States 


134-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


135 

135-1 
135-2 


The indications made tielow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

Bne 


101 
22 
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135-3 

135-3-1 

135-3-2 

135-3-3 
135-3-4 


1 Identification of Deposit 

Name of depositary institution 

Address of depositary institution 
Date of deposit 

Accei&<siAn Kliimhor 


10801 University Blvd., Manassas, 
Virginia 20110<-2209United States of 
America 

23 September 1997 (23.09.1997) 
ATCC 209296 


135-4 


Additional Indications 


NONE 


135-5 


Designated States for Which 
Indications are Made 


all designated States 


135-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


136 

136-1 
136-2 


The Indications made below relate to 
the deoosited ffnicraortxanlfimffi) nr 
other biological material referred to in 
the description on: 

page 

line 


101 
23 


136-3 
136-3-1 

136-3-2 
136<J-3 


Identificatiori of Deposit 
Name of deoositarv instifiitinn 

Address of depositary institution 

Date of deposit 

Mccesston Numoer 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America . 

18 September 1997 (18.09.1997) 
ATCC 209279 


136-4 


Additional Indications 


NONE 


136-5 


Designated States for Which 
Indications are Made 


all designated States 


136-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


137 

137-1 
137-2 


The indications made he\ow relate to 
the deposited microorganism(8) or 
other biological material referred to In 
the description on: 
page 

line 


101 
24 


137-3 

137-3-1 

137-3-2 

137-3-3 
137-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Tvoe Cultur-e CftTl^r»^T «« 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

05 March 1998 (05.03.1998) 
ATCC 209653 


137-4 


Additional Indications 


NONE 


137-5 


Designated States for Which 
Indications are Made 


all designated States 


137-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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138 

138-1 
138-2 
138-3 


The indications made below relate tc 
the deposited microorganisni(s) or 
other biological matarial referred to i 
the descriptidn on: 
page 

line 


n 

101 

25 


138-3-1 
138^2 

138-3-3 
138-3^ 
138-4 


Identification of Deposit 
Name of depositaiy institution 
Address of depositary institution 

Date of deposit 
Accession Number 
1 Additional Indications ' 


American Type Culture Collection 
10801 Unxversity Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 October 1997 (16. 10 ,1997) 
ATCC 209385 

NONE ~ ' 


138-S 
138-6 


Designated States for Which 
Indications are Made 


all designated States 


139 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE — 


139-1 
139-2 
139-3 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

iine 


101 
26 


139-3-1 j 
139-3-2 

139-3-3 
139-3-4 
139-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
xuDux university Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209261 

NONE ' ■ — 


139^ 
139-6 


designated States tor Which 
Indications are Made 


all designated States 


[l 


Separate Furnishing of Indications 

rhese indications will be submitted to 

he International Bureau later 

rhe Indications made below relate to 


NONE ' 


t 

l< 
t 

14U-1 If 

140-2 111 
140*3 1 


4ie deposited microorganism(s) or 
Jther biological material refened to in 
he description on: 

>age 

ne 


101 
27 


140-3-1 h 
140-3-2 \a 

140-3-3 0 
140-3-4 A 
140-4 A 


dentiffcation of Deposit ~~ " 
lame of depositary institution j 
Mress of depositary institution 

\ 
2 

tate of deposit ^ 
ccession Number ^ 
oaitional Indications 


American Type Culture Collection 
L0801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
\merica 

L6 October 1997 (16.10.1997) 
ITCC 209384 

lONE ' ' 


140-5 p 

Ur 


esignated States for Which 
tdications are IMade ^ 


ill designated States 
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140-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


141 

141-1 
141-2 


The indications made below relate to 
the deposited n)icroorganism(s) or 
other biological material referred to in 
the description on: 
page 

tine 


101 
28 


141-3 

141-3-1 

141-3-2 

141-3-3 
141-3-4 


Identification of Deposit " 

Name of deoositarv in^tihitinn 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209258 


141-4 


Additional Indications ' 


NONE 


141-5 


Designated States for Which ~ 
indications are Made 


all designated States 


141-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


142 

142-1 
142-2 


The indications made below relate to 

uio uvpu^iura ifiiwr oorganiSm^SI Of 

other biological material referred to In 

the description on: 

page 

line 


101 
29 


142-3 
142-3-1 

142-3-2 
142-3-3 


Identification of Deposit 

Name of denosltarv in^titntinn 

Address of depositary institution 

Date of deposit 
accession i>iumDer 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 2O110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209257 


142-4 


Additional Indications 


NONE 


142-5 


Designated States for Which 
Indications are Made 


all designated States 


142-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


143 

143-1 
143-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
30 


143-3 

143-3-1 

143-3-2 

143-3-3 
143-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

30 May 1997 (30.05.1997) 
ATCC 209087 
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143-4 
143-5 

143-6 


Additional Indications 

Designated States for Which 
Indications are IMade 


NONE " ' ' 

all designated States 


144 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


144-1 
144-2 
144-3 


The indications made below relate to 
the deposited microorganism(s) or 
other t>iologicat material referred to Ir 
the description on: 
page 

line 


101 
31 


144-3-1 
144-3-2 

144-3-3 
144-4 


Identification of Deposit 

Name of dpnivtitan/ ineKdifiAn 

Address of depositary institution 

Date of deposit 
Accession Numt>er 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 October 1997 (16.10.1997) 
ATCC 209381 

NONE ~~ ~ ' 


144-5 
l44^ 


Designated States for Which 
indications are IVlade 


all designated States 




Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


145 

145-1 

145-2 
145-3 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
ttie description on: 
page 

line 


101 
32 


145-3-1 
145-3-2 

145-3-3 
145-W 
14S4 


Identification of Deposit ! 

Name of deoosltarv institiittnn 

Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
(TONE 


145-5 
145-6 


:>esignated States for Which 
ndlcations are Made 


all designated St:ates 


t 

146 "i 


Separate Furnishing of Indications ' 

rhese indications will be submitted to 
he International Bureau later 


NONE 


t 
t 

146-1 p 
146-2 li 


rhe indications made t>elow relate to 
he deposited microorganism(s) or 
>ther biological material referred to in 
he description on: 
age 

ne 


LOl 
33 
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146-3 
146-3-1 

146-3-2 

146-3-3 
146-3-4 


' Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


Amefxcan Tvoe Culture Cnl Itf^ri^—i 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

28 October 1997 (28,10.1997) 


1464 


Additional Indications 


NONE 


146-5 


Designated States for Which 
Indications are Made 


all designated States 


146-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


147 

147-1 
147-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
34 


147-3 
147-3-1 

147-3-2 

147-3-3 
147-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


«*"erxcan ijfpe culture Collectxon 
10801 University Blvd., Manassas, 
Virginia 201 10-220 9Uni ted States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209256 


147-4 


Additional Indications 


NONE 


147-5 


Designated States for Which 
Indications are Made 


all designated States 


1474 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


148 

148-1 
148-2 


The Indications made below relate to 
the deposited microor^nlsm(s) or 
other biological material referred to la 

tho HnfUiHnflrai ah* 

page 
line 


101 
35 


148-3 
148-3-1 

148-3-2 

148-3-3 
148-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209251 


148-4 


Additional Indications 


NONE 


148-5 


Designated States for Which 
Indications are Made 


all designated States 


148-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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149 

149-1 
149-2 


The Indications made below relate to 
the deposited microor9ani$m(s) or 
other biological material referred to i 
the description on: 
page 

line 


101 
36 


149-3 
149-3-1 

149-3-2 

149-3-3 
149-3-4 
149-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 

ATCC 20 9263 
NONE 


149-5 


Designated States for Which 
Indications are Made 


all designated States 


149-6 

Iso 


Separate Furnishing of Indications 

These indications will be submitted to 
tiie International Bureau later 


NONE 


150-1 
150-2 


The indications made t>elow relate to 
the deposited mlcroorganism(s) or 
other biologicaJ material referred to In 

the d<k<crintlAn An* 

page 
line 


101 
37 


150-3 
150-3-1 

150-3-2 

150-3-3 
160-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


"^ifpe uuxcure coliectxon 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
AlwC ZU72o4 


150-4 


Additional Indications 


NONE 


150-5 


Designated SUtes for Which 
Indications are Made 


all designated States 


150-6 


Separate Furnishing of indications 

Tliese indications will be submitted to 
the International Bureau later 


NONE 


151 

151-1 
151-2 


The indications made k>elow relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
38 


151-3 

151-3-1 

151-3-2 i 

151-3-3 I 
151-3-4 > 
151-4 i 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
\ccession Number 

Vdditional Indications ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
f^merica 

L6 October 1997 (16.10.1997) 

HTCC 209376 

iYONE 


151-5 I 

t 


>esignated States for Which "j 
ndlcations are IMade 


ill designated States 
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151-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


152 

152-1 
152-2 


The indications made below relate to 
the deoosited rnicrooraanifim/fil or 
other biological material referred to In 
the description on: 
page 

line 


■ 

101 
39 


152^ 
152-3-1 

152-3-2 

152-3-3 
152-3-4 


Identification of Deposit 
Address Of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110*2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209391 


152-4 


Additional Indications 


NONE 


152-5 


Designated States for Which 
Indications are Made 


all designated States 


152-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


153 

153-1 
153-2 


The indications made below relate to 
ine aeposiiea microorgantsmis) or 
other biological material referred to in 
the description on: 
page 

line 


101 
40 


153-3 
153-3-1 

153-3-2 
153-3-3 


Identification of Deposit 

Name of H^rvxitarv inefitiifiAn 

Address of depositary institution 

Date of deposit 

Accession Numt}er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

28 October 1997 (28.10.1997) 
ATCC 209417 


153-4 


Additional Indications 


NONE 


153-5 


Designated States for Which 
Indications are Made 


all designated States 


153-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


l54 

154-1 
154-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
41 


154-3 
154-3-1 

154-3-2 

154-3-3 
154-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209253 
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154<4 


Additional Indications 


NONE 


154-5 


~ Designated States for Which 
Indications are Made 


all designated States 


154*6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


155 

155-1 

155-2 


The indications made below relate to 
the deposited mlcroofganism(s) or 

other biolooicdl material ml^rvaH *a in 

the description on: 
page 

line 


101 
42 


155-3 
155-3-1 

155-3-2 

155-3-3 
155-3-4 


Identification of Deposit 
Nanie of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 May 1998 (12.05.1998) 
ATCC 209855 


155^ 


Additional Indications 


NONE 


155^ 


Designated States for Which 
Indications are Made 


all designated States 


155-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


156 

156-1 
156-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 

other tlioioaleAl matAWsl vaXamv^w L 

wuivi wiviw(|ivcii maimMi referreo to in 

the description on: 

page 

line 


101 
43 


156-3 
166-3-1 

156-3-2 

156-3-3 
156-3-4 


Identification of Deposit ' 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

10 December 1997 (10.12.1997) 
ATCC 209526 


156-4 


MQamonai inaicauons 


NONE 


156-5 


Designated States for Which 
Indications are Made 


all designated States 


156-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


157 

1 
i 
1 

157-1 I 
157-2 1 


he indications made below relate to ' 
the deposited microorganism(s) or 
sther luological material referred to In 
the description on: 
sage 

ine ^ 


LOl 
^4 
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157-3 

157-3-1 

157-3-3 
157-3-4 


Identification of Deposit 
Name of depositary institution 

Address of d^Dositary institution 

Date of deposit 
Accession Nuntber 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209252 


157-4 


Additional Indications 


NONE 


157-5 


Designated States for Which 
Indications are Made 


all designated States 


157-6 


Separate Furnishing of Indications 

These indications Mrill be submitted to 
the International Bureau later 


NONE 


158 

158-1 
158-2 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
ottier biological material referred to in 
the description on: 

page 

line 


101 


158-3 

158-3-1 

158-3-2 

158-3-3 
158-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209Uni ted States of 
America 

16 October 1997 (16.10.1997) 
ATCC 209374 


156-4 


Additional Indications 


NONE 


158-5 


Designated States for Which 
Indications are Made 


all designated States 


158H6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


159 

159-1 
159-2 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


101 

* o . 


159-3 
159-3-1 

159-3-2 

159-3-3 
159-34 


Identification of Deposit 
Name of depositary institution 
Address of deoositarv faifititiitian 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 Unxversxty Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

10 December 1997 (10.12.1997) 
ATCC 209528 


1594 


Additional Indications 


NONE 


159-5 


Designated States for Which 
Indications are Made 


all designated States 


159-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 



193 



wo 01/40466 

PCT/USOO/32678 



PCT 



Original (for SUBMISSION) • printed on 01.12,2000 02:57:35 PM 



160 

160-1 
160-2 
160-3 


The Indications made below relate to 
the deposited microorganism(8) or 
other biological material referred to i 
the description on: 
page 

line 


n 

101 
47 


160-3-1 
160-3-2 

160-3-3 
160-3-4 
1604 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd. , Manassas 
Virginia 20110-2209United States of 
America 

16 September 1997 (16.09.1997) 
ATCC 209265 

NONE ' ' — " 


160-5 
160-6 


Designated States for Which " 
Indications are Made 


all desxgnated States 


161 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE — 


161-1 
161-2 
161-3 


The indications made below relate to 
the deposited microorganism{s) or 
other biological nrtaterial referred to in 
the description on: 

line 


101 
48 


161-3-1 
161-3-2 

161-3-3 
161-3-4 
161^ 


Name of depositary Institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd Manassas 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 

ATCC 209396 * 

NONE " ' ' 


161-5 
161-6 


Designated States for Which 
Indications are Made 


all designated States 


162 


Separate Furnishing of lndicatior» 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


162-1 
162-2 
'162-3 


The indications made below relate to 
the deposited microorganlsmCs) or 
other biological material referred to in 
the description on: 
page 

ine 


101 
49 


162-3-1 1 
162-3-2 i 

162-3-3 C 
162-3-4 / 
162-4 / 


dentification of Deposit 

Mante of depositaty institution ^ 

\ddress of depositary institution 

i 

)ate of deposit 

Accession Number j 
Additional Indications j 


American Type Culture Collection 
10801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
l^erica 

L8 August 1997 (18.08.1997) 

VTCC 209201 

TONE 


162-6 I 

ti 


)esignated States tor Which " 
ndications are Made * 


111 designated States 
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162-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


163 

163-1 
163-2 


The Indications nrtade below relate to 
the deposited microorganism(s) or 
other bioloaicai materiat refemkH t a in 
the description on: 
page 

line 


101 
50 


163^ 
163-3-1 

163-3-2 

163-3-3 
163-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

28 October 1997 (28 .10, 1997) 
ATCC 209416 


163^ 


Additional Indications 


NONE 


163-5 


Designated States for Which 
Indications are Made 


all designated States 


163-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


164 

164-1 

164-2 


The indications made below relate to 
the deposited microorgani8m(s) or 
oiner oioiogicai matenai refeired to in 
the description on: 
page 

line 


101 
51 


164-3 
164-3-1 

164-3-2 

164-3-3 
164-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209403 


164-4 


Additional Indications 


NONE 


164-5 


Designated States for Which 
Indications are Made 


all designated States 


164-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


165 

165-1 
165-2 


The indications made tielow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


101 
52 


165-3 
165-3-1 

165-3-2 

165-3-3 
165-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

28 October 1997 (28.10.1997) 
ATCC 209419 



195 



wo 01/40466 



PCT/USOO/32678 



Original (for SUBMISSION) - printed on 01 1 2 2000 02:57:35 PM 



165-4 


Additional IndicaHnnc 


NONE 


165-5 


Designated States for Which 
Indications are Made 


all designated States 


165^ 
"166 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


166-1 
166-2 


The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to ir 
the description on: 
page 

line 


101 
53 


166-3 

166-3-1 

166-3-2 

166-3-3 
166-3-4 


Identification of Deposit 
Name of depositary institution 

Address of denositAfv inetifirtiAn 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209402 




Additional Indications 


NONE 


166-5 


Designated States for Which 
Indications are Made 


all designated States 


166-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


167 

167-1 
167-2 


1 ne maicauons made below relate to 
the deposited microorganlsm(s) or 
other biological material refened to in 
the description on: 

page 

line 


101 
54 


167-3 
167-3-1 

167-3-2 

167-3-3 
167-3-4 


Identification of Deposit 
Name of depositary institution 
Address of deoositarv institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 October 1997 (16.10.1997) 
ATCC 209378 


167-4 


Addiuonal Indications 


ur\Mi7 •■ — '■— 


167-5 


Designated States for Which 
Indications are Made 


all designated States 


167-6 


Separate Furnishing of Indications 

These indications wilt be submitted to 
the International Bureau later 


NONE 


168 

1 

168-1 1 
168-2 1 


The indicattoi^ made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
;)age 

tne 


101 
55 
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168^ 
168-3-1 

168-3-2 

168-3-3 
168-3-4 


Identification of Deposit 
Name of depositaiy institution 
MQuioss OT oeposnafy msittuiion 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

21 November 1997 (21.11.1997) 
ATCC 209489 


168-4 


Additional Indications 


NONE 


168^ 


Designated States for Which 
Indications are Made 


all designated States 


1664 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


T69 

169-1 
169-2 


The indications made Mow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 

2 


169-3 
169-3-1 

169-3-2 

169-3-3 
169-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209401 


169-4 


Additional Indications '' 


NONE 


169-5 


Designated States for Which 
Indications are Made 


all designated States 


169-6 


Separate Furnishing of Indications 

These indications wiil be submitted to 
the International Bureau later 


NONE 


170 

170-1 
170-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


102 
o 


170^ 
170-a-l 

170-3-2 

170-3-3 
170-3^ 


Identification of Deposit 
Nanne of depositary institution 
Address of depositary institution 

Date of deposit 

Accession Number 


American Type Culture Collection 
10801 Universxty Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209397 


170-4 


Additional Indications ' 


NONE 


170-5 


designated States for Which 
Indications are Mlade 


all designated States 


170-6 

1 


Separate Furnishing of Indications 

rhese indications will be submitted to 
the. International Bureau later 


NONE 
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171 

-171-1 
171-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to Ir 
the description on: 
page 

line 


1 

102 
4 


171-3 
171-3-1 

171-3-2 

171-3-3 
171-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209389 


171-4 


Additional Indications 


NONE 


171-S 


Designated States for Which 
Indications are Made 


all designated States 


171-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemationat Bureau later 


NONE 


172 

172-1 
172-2 


The indications made below relate to 
the deposited microorgani$m(s) or 
other biological material referred to in 
the description on: 

page 

line 


102 
5 


172-3 

172-3-1 

172-3-2 

172-3-3 

172-3-4 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

07 November 1997 (07.11.1997) 
ATCC 209438 


172-4 


Additional Indications 


NONE 


172-5 


Designated States for Which 
Indications are Made 


all designated States 


172-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


173 

173-1 
173-2 


The Indications made below relate to 
the deposited microorganismfs) or 
other biological material referred to In 
the description on: 
page 

line 


102 
6 


173-3 
173-3-1 

173-3-2 

173-3-3 
173-3^ i 


Identification of Deposit 
Name of depositary instihition 
^dress of depositary institution 

Date of deposit 

^coession Number ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110^2209United States of 
America 

21 November 1997 (21.11.1997) 
ATCC 209492 


173-4 "i 
173-5 1 


Additional indications " " 


^ONE 


1 


designated States for Which 
ndicattons are IViade ^ 


Eill designated States 
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173^ 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


174 

174-1 

174-2 


The indications made below relate to 
the deposited mIcroorganismCs) or 
other biological material referred to ir 
the description on: 
oaoe 

line 


1 

102 
7 


174-3 
174-3-1 

174-3-2 

174-3-3 
174-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depository institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 Unxversxty Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 October 1997 (17.10.1997) 
ATCC 209388 




Aoaiuonai indications 


NONE 


174-5 


Designated States for Which 
Indications are Made 


all designated States 


174-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


175 

175- 1 

176- 2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

line 


102 
8 


175-3 

175-3-1 

175-3-2 

175-3-3 
175-3^ 
T755 


Identification of Deposit '■ 
Name of deposKary institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

07 November 1997 (07.11.1997) 
ATCC 209432 




^MMiuuiKji inoicauons 


NONE 


175-5 


Designated Stetes for Which 
Indications are Made 


all designated States 


17&6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


176 

176-1 

176-2 


The indications made below relate to 
ttie deposited microorganfsm(s) or 
other biological material referred to in 
the cteseriotion An- 

M«9WII|#UWII Vila 

page 

ine 


102 
9 


176-3 

176-3-1 

176-3-2 i 

176-3-3 1 
176-3-4 > 


Identification of Deposit 
Name of depositary institution 
^dress of depositary Institution 

Date of deposit 

\ooession Number ^ 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

07 November 1997 (07.11.1997) 
ATCC 209439 
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176^ 


1 Additional Indications 


NONE 


176-5 


Designated States for Which 
Indications are Made 


all designated States 


176-6 


Separate Furnishing of Indicatiorts 

These indications will be submitted to 
the International Bureau iater 


NONE 


177 

177-1 
177-2 


The indications made below relate to 
the deposited niicroorganIsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
10 


177-3 
177-3-1 

177-3-2 

177-3-3 
177-3^ 


Name of depositary institution 
Address of depositary institution 

Date of deposit 

Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209nnited States of 
America 

07 November 1997 (07.11,1997) 
ATCC 209433 


177-4 


Additional Indications 


NONE 


177-5 


Designated States for Which 
Indications are Made 


all designated States 


177-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


178 

178-1 

178-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
11 


178-3 
178-3-1 

178-3-2 

178-3-3 
178-3^ 


Name of depositary instihJtion 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

OS February 1998 (05.02.1998) 
ATCC 209618 


178-4 


Additional Indications 


NONE 


178.5 


Indications are Made 


all designated States 


178-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


179 

( 

179-1 1 
179-2 1 


rhe indications made ImIow relate to 
the deposited mlcroorganism(s) or 
Dtiier biological material refened to in 
the description on: 
sage 

ine 


102 
12 
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17M 

170-3-1 

170-3-2 

179-3-3 
179-3-4 


Identiftcation of DeposK 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

21 November 1997 (21 . 11 .1997) 
ATCC 209484 


179^ 


Additional Indications 


NONE 


179-6 


Designated States for Which " 
Indications are Made 


all designated States 


179-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Internationat Bureau later 


NONE 


180 
180-1 


The indications made be\ow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
13 


180-3 
180-3-1 

180-3-2 

180-3-3 
180-3-4 


identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

21 November 1997 (21.11.1997) 
ATCC 209487 


180-4 


Additional indications 


NONE 


180-5 


Designated States for Which 
Indications are Made 


all designated States 


180-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


181 

181-1 
181-2 


The indications made below relate to 
the deposited microorgantsm($) or 
uuiifr iJiQivgicai maienai reterreu to in 
the description on: 
page 

line 


102 
14 


181-3 
181-3-1 

181-3^ 
181-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 

Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-220 9Uni ted States of 
America 

07 November 1997 (07.11.1997) 
ATCC 209434 


181-4 


Additional Indications 


NONE 


181-5 


Designated States for Which 
Indications are Made 


all designated States 


181-6 


Separate Furnishing of indications 

These indications will be sutMrnitted to 
the International Bureau later 


NONE 
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182 

182-1 
182-2 


The indications made below relate to 
the deposited inlcroorganl$ni(8) or 
other biological material referred to ii 
the description on: 
page 

line 


1 

102 
15 


182-3 
182-3-1 

182-3-2 

182-3-3 
182-3-4 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

26 March 1998 (26.03.1998) 
ATCC 209704 


182-4 


Additional Indications 


NONE 


182-5 


Designated States for Which 
Indications are Made 


all designated States 


182-6 
"183 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


183-1 
183-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 

page 

line 


102 
16 


183-3 
183-3-1 

183-3-2 

183-3-3 
183-3-4 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

28 April 1998 (28.04.1998) 
ATCC 209808 


183-4 


Additional Indications 


NONE 


183-5 


Designated States for Which 
Indications are Made 


all designated States 


183-6 
l84 


Separate Furnishing of indications 

These indications will be submitted to 
tiie International Bureau later 


NONE 


184-1 
184-2 


The Indications made below relate to 
the deposited mlcroorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
17 


184-3 

184-3-1 

184-3-2 

184-3-3 
184-3-4 i 


Identification of Deposit " 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

06 May 1998 (06.05.1998) 
ATCC 209847 


184-4 ■ 


Additional Indications 


^ONE 


184-^ 1 
1 


Designated States for Which 
ndlcations are Made ' 


all designated States 
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184-6 


1 Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


185 

185-1 
185-2 


Tlie indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 


• 

102 
18 


185-3 

185-3-1 

185-3-2 

185-3-3 
185-3-4 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 

Date of deposK 
Accession Number 


American Tvoe Culliurd Collar^'UnTi 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

05 February 1998 (05.02.1998) 


185-4 


Additional Indications 


NONE 


185-5 


Designated States for Which 
Indications are Made 


all designated States 


185-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


"186 

186-1 
166-2 


The indications made lielow relate to 
the deposited microoroanism(s) or 
other biological material referred to In 

wiv uwbn|Juoii Oils 

page 

line 


102 

19 


186-3 
186-3-1 

186-3-2 

186-3-3 
186-3-4 


identification of Deposit 
Name of depositaiy Institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Tvoe Cultuire CoTl^of-^ftw 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

05 February 1998; (05.02.1998) 


186-4 


Additional Indications 


NONE 


186<5 


Designated States for Which 
Indications are Made 


all designated States 


186-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intematronal Bureau later 


NONE 


187 

187-1 
187-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
wuier oioiogi^i maieriai feierrea lo in 
the description on: 

page 

line 


102 
20 


187^ 

187-3-1 

187-3-2 

187-3-3 
187-3^ 


dentificatlon of Deposit " 

Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Mainassas, 
Virginia 20110^2209United States of 
America 

11 August 1998 (11.08.1998) 
ATCC 203109 
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187-4 
187-5 


Additional Indications 

" Designated States for Which 
Indications are Made 


NONE 

all designated States 


187^ 

lis 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


188-1 
188-2 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to Ir 

u m ui^uripiion on* 

page 
line 


102 
21 


188-3 

188-3-1 

186-3-2 

188-3-3 
188-3-4 
18^4 


' Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

31 March 1998 (31. 03 •1998) 

ATCC 209715 

NONE 


188-5 


Designated States for Which 
Indications are Made 


all designated States 


188-6 

lis 


Separate Furnishing of Indications 

Tiiese indications will be submitted to 
the International Bureau later 


NONE 


189-1 
169-2 


The indications made below relate to 
the deposited mlcroorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
22 


189-3 
189-3-1 

189-3-3 
189-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 March 1998 (11.03.1998) 
ATCC 209669 


189-4 


Additional Indications ' 


NONE 


189-5 


Designated States for Which 
ndications are Made 


all designated States 


189-6 

1 

190 1 


Separate Furnishing of Indications ] 

rhese indications will be submitted to 
Jhie International Bureau later 


mom 


i 
( 
t 

190-1 t 
190-2 r 


rhe indications made below relate to ' 
lie deposited microorganism(s) or 
>ther biological material referred to in 
he description on: 
>age 

ne 


L02 
23 
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190*3 


Identification of Deposit 




190-3-1 
190-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd, , Manassas, 
Virginia 20110-2209United States of 
America 


190-3-3 
190-3-4 


Date of deposit 

Accession Number 


23 June 1998 (23.06.1998) 




Additional Indications 


NOME 


190-5 


Designated States for Which 
Indications are Made 


all designated States 


190-6 


Separate Furnishing of Indications 

These Indications will be sutMHitted to 
the International Bureau later 


NONE 


191 

191-1 


The indications made below relate to 
the deposited microorganlsmCs) or 
other biological material referred to in 

page 


102 


191-2 


line 


24 


191-3 


Identification of i^poslt 




191-3-1 
191-3-2 


Name of depositary institution 
Address of depositary institution 


American fTvDe dil^urc^ ^olTer^f*! on 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


191-3-3 
191-3-4 


Accession Number 


26 March 1998 (26.03.1998) 


191-4 


Additional Indications 


NONE 


191-S 


Designated States for Which 
Indications are Made 


all designated States 


191-6 


Separate Furnishing of Indications 

These indications win be submitted to . 
the International Bureau later 


NONE 


192 
192-1 


The indications made below relate to 
the deposited mlcroorsanism(8) or 
other biological material referred (o in 
the description on: 
page 


— • ■ 

102 


192-2 


line 


25 


192-3 
192-3-1 

192-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


American Tyi>e Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


192-3-3 

192-3-4 


Date of deposit 
Accession Number 


America 

16 June 1998 (16.06.1998) 
ATCC 209981 


192-4 


Additional Indications 


NOME 


192-5 


Designated States for Which 
Indications are Made 


all designated States 


192^ 


Separate Furnishing off Indications 

These indications win be submitted to 
the Intemational Bureau later 


NONE 
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193 

193-1 
193-2 

Hut 4 


The indications made below relate to 
the deposited microorganisni(s) or 
other biological material referred to 1 
the description on: 

page 

line 


n 

102 
26 


193*3 

193-3-1 
193-3-2 

193-3-3 
193-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 Universitv nivH ut^T%^^^^^ 

Virginia 20110-2209United States of 
America 

07 i^ril 1998 (07.04.1998) 
ATCC 209749 


193-4 
193-5 


Additioruii Indications 




193-6 


Designated States for Which " 
Indications are Made 


all designated States 


194 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


. 194-1 
194-2 
194-3 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material refenred to in 

the descrintion an* 

page 
line 


1 no 
27 


194-3-1 
194-3-2 

194-3-3 
194-3-4 
194-4 


Identification of Deposit ' 

Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 

Avrovj. vxiiArersicy xsxvd. , Manassas, 
Virginia 20110-2209United States of 
America 

12 May 1998 (12.05.1998) 
ATCC 209859 




Additional Indications 




194-5 
194^ 


Designated States for Which 
Indications are Made 


all designated States 


195 


Separate Fumishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


195-1 
195-2 
195^ 1 


The indications made tielow relate to 
the deposited mtcroorganism(s) or 
other biological material referred to in 
tiie description on: 

5age 

Ine 


102 
28 


195-3-1 1 
195-3-2 > 

195-3-3 [ 
195-3^ / 
195^ / 


dentification of Deposit 

Mame of depositary institution ^ 

Address of depositary institution 

1 

>ate of deposit , 
Vccession Numt>er j 
Additional Indications ' ~ j 


American Type Culture Collection 
10801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
lUaerica 

36 May 1998 (06.05.1998) 
\TCC 209845 

^ONE ~~ 


195-5 C 
li 


>esignated Slates for Which 1 
ndications are Made * 


ill designated States 
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195^ 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


196 

196-1 


The indications made below relate to 
the deposited microoFganism(s) or 
other biological material referred to In 
the description on: 
page 


102 


196-2 


line 


29 


196-3 


Identification of Deposit 




196-3-1 
196-3-2 


Name of depositary institution 
Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 


196-3-3 

196-3-4 
-— 


Date of deposit 
Accession Number 


America 

07 April 1998 (07.04.1998) 




Additional Indications 


NONE 


196-5 


Designated States for Which 
Indications are Made 


all designated States 


196-6 


Separate Furnishing of indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


197 

197-1 
197-2 


The indications made below relate to 
the deposited microorganism(s> or 
other biological material referred to In 
the descrfntlon on* 

W W«WII|M1VII Vila 

page 
line 


102 
30 


197-3 


Identification of Deposit 




197-3-1 
197-3-2 


Name of depositary institution 
Address of deposttaiy Institution 


American Type Culture Collection 
10801 University Blvd. ^ Manassas, 
Virginia 20110-2209United States of 


197-3-3 
197-3^ 


Date of deposit 
Accession Numtier 


America 

11 August 1998 (11.08.1998) 
ATCC 203107 


197-4 


Additional Indications 


NONE 


197-5 


Designated States for Which 
Indications are IMade 


all designated States 


197-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


198 

198-1 
198-2 


The indications made below relate to 
the deposited microorganism(8) or 
vuwr Diwivyicai maiefiai refferreo lo in 
the description on: 
page 

line 


V 

102 
31 


198-3 
198-3-1 

198-3-2 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 


198-3-3 
198-3^ 


Date of deposit 
Accession Number 


23 April 1998 (23.04.1998) 
ATCC 209801 
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198-4 
198-5 


1 Additional Indications " 

Designated States for Which 
Indications are Made 


all designated States 


198-6 
199 


1 Separate Furnishing of Indications 

1 These indications will t>e submitted to 
Ithe International Bureau later 


NONE ~ — 


199-1 
199-2 
199-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to Ir 
the description on: 

page 

lline 


i| 

102 
32 


199-3-1 
199-3-2 

199-3-3 
199-3-4 
199-4 


Identification of Deposit 

iName of depositary Institution 
1 Address of depositary institution 

Date of deposit 
1 Accession Number 
1 Additional Indications 


American Type Culture Collection 
iUbUi university Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 June 1998 (09.06.1998) 
ATCC 209948 

[none 


199-5 
199-6 


Designated States for Which 
j Indications are Made 


lall designated States 


1 Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE ~ — — 


200-1 
200-2 

"355 1 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 

page 

tine 1 


102 
33 


200^1 
200-3-2 

200-3-3 
200-3-4 
200-4 


Identification of Deposit | 
Name of depositary institution 

Address of depositary institution j 

Date of deposit | 
Accession Numt>er | 
Additional Indications y 


American Type Culture Collection 
xuouj. unxversxty Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 May 1998 (20.05.1998) 
ATCC 209883 

NONE ' 


200-5 1 

1 

200-6 1 


Designated States for Which 
Indications are Made | 


all designated States 


il 

201 P 


Separate Furnishing of Indications h 

rhese indications will be submitted to 
lie International Bureau later | 


blONE 


jl 

H 
1 

201-1 U 
201-2 |j 


rhe indications made below relate to 1 
ihe deposited mlcroorganl$m(s) or 1 
yther biological material referred to in 
fie description on: 1 

)age L 

ine L 


L02 
}4 
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201 >3 
201-3-1 

201-3-2 

201-3-3 
201-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

01 July 1998 (01.07.1998) 
ATCC 203049 


201^ 


Additional Indications 


NONE 


201-5 


Designated States for Which 
Indications are Made 


all designated States 


201-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


202 

202-1 
202-2 


The Indications made below relate to 
the deposited mIcroorganlsnKs) or 
other biological material referred to in 
the description on: 
page 

tine 


102 
35 


202-3 
202-3-1 

202-3-2 

202-3-3 
202-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

06 May 1998 (06.05.1998) 
ATCC 209846 


202-4 


Additional Indications 


NONE 


202-5 


Designated States for Which 
Indications are Made 


all designated States 


202-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


203 

203-1 


The Indications made t>elow relate to 
the deposited mlcroorganlsm(s) or 
other biological material referred to In 
the description on: 
page 

nne 


102 
36 


203-3 

203-3-1 

203-3-2 

203-3-3 
203-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

12 May 1998 (12.05.1998) 
ATCC 209857 


203-4 


Additional Indications 


NONE 


203-5 


Designated States for Which 
Indications are Made 


all designated Stateis 


203-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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204 

204-1 
204-2 
204>3 


The indications made below relate to 
the deposited mlcroorganism($) or 
other biological material referred to i 
the description on: 
page 

line 


1 

nl 

102 
37 


204-3-1 
204-3-2 

204-3-3 
204-3-4 
204-4 


Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 University Blvd. , Manassas 
Virginia 20110-2209United States of 
America 

14 May 1998 (14.05.1998) 
ATCC 209864 

|mone ' — ~ • — 


204-5 
204-6 


Designated States for Which 
Indications are Made 


lall designated States 


205 


Serrate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE ~" 


205-1 
205-2 
205-3 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 

line 


102 
38 


205-3-1 

205^3-2 

205-3-3 
205-3-4 
20&4 


laenuTicauon Of Deposit 
Nanfte of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 

Additional Indications 1 


American Type Culture Collection 
10801 University Blvd MAnAaoao 
Virginia 20110-2209United States of 
America 

20 May 1998 (20.05.1998) 
ATCC 209880 

NONE ' ' — 


205-5 
205-6 


Designated States for Which 
Indications are Made 1 


all designated States 


206 


separate Furnishing of Indications | 

These indications will be submitted to 1 
the International Bureau later | 


NONE " " " 


206-1 
206-2 i 
206-3 1 


The indications made below relate to J 
the deposited microorganisffl(s) or 
other biological material referred to in 
the description on: 1 

page 1 

ine j 


102 
39 


206-3-1 1 
206-3-2 > 

206.3-3 C 
206-3-4 /) 
206-4 / 


dentlfication of Deposit 1 
^me of depositary institution 1 j 

Wdress of depositary institution 1 ^ 

' r 

12 

)ate of deposit I- 
accession Number L 
additional Indications |^ 


American Type Culture Collection 
L0801 University Blvd., Manassas, 
/irginia 201lO-2209United States of 
^erica 

L4 May 1998 (14.05.1998) 
VTCC 209869 

TONE 


206-5 C 

h 


designated States for Which 
idtcations are Made | 


l11 designated States 
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206-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


207 

207-1 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to In 
the description on: 
page 

Khe 


102 
40 


207-3 
207-3-1 

207-3-2 

207-3-3 

207-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American TypG Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 June 1998 (09.06.1998) 
ATCC 209950 


207-4 


Additional Indications 


NONE 


207-5 


Designated States for Which 
Indications are Made 


all designated States 


207-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


208 

208-1 

208-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


102 
41 


208-3 
208-3-1 

208-3-2 

208-3-3 
208-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States o£ 
America 

23 June 1998 (23.06.1998) 
ATCC 203008 


208-4 


Additional Indications 


NONE 


208-5 


Designated States for Which 
Indications are Made 


all designated States 


208-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


209 

209-1 
209-2 


The indications made below relate to 
the deposited microorganlsm(s) or 
ottier bioloatcai material referred ft% In 
the description on: 
page 

line 


102 
42 


209-3 
.209-3-1 

209-3-2 

209-3-3 
209-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110<-2209United States of 
America 

23 June 1998 (23.06.1998) 
ATCC 203014 
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The indications made below relate to 
the deposited microorganl$m(s) or 
ottier biological material referred to in 
the description on: 



fine 



Identification of Deposit 
Name of depositary Institution 
Address of depositary institution 



211-3-3 
211-3-4 



Date of deposit 
Accession Number 



212-1 
212-2 



he indications made below relate to 
the deposited microorganisin(s) or 
other biological material referred to in 
the description on 
page 

line 



102 
43 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 August 1998 (11.08.1998) 
ATCC 203110 

NONE 

all designated States 




American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 June 1998 (23.06.1998) 
ATCC 203009 




102 
45 



212 



wo 01/40466 PCT/USOO/32678 



P3330R1 

Original (for SUBMISSION) - printed on 01 .12.2000 02:57:35 PM 



212-3 
212-3-1 

212-3-2 

212-3-3 

212-3-4 


Identification of Deposit 
Name of deposttaiy institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 June 1998 (09.06,1998) 
ATCC 209961 


212-4 


Additional Indications 


NONE 


212-5 


Designated States for Which 
Indications are Made 


all designated States 


212T" 


Separate Furnishing of Indications 

These indications will be submitted to. 
the International Bureau later 


NONE 


213 

213-1 
213-2 


The indications made t>etow relate to 
the deposited itiicroorgamsm(s) or 
other biological material referred to In 
the description on: 
page 

line 


102 
46 


213-3 

213-3-1 

213-3-2 

213-3-3 
213-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 June 1998 (09.06.1998) 
ATCC 209962 


213-4 


Additional Indications 


NONE 


213-5 


Designated States for Which 
Indications are Made 


all designated States 


213-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


214 

214-1 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


102 
47 


214-3 
214-3-1 

214-3-2 

214-3-3 
214-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt>er 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

14 May 1998 (14.05.1998) 
ATCC 209866 


214-4 


Additional Indications 


NONE 


214-5 


Designated SUtes for Which 
Indications are Made 


all designated States 


214-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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215 

215-1 
215-2 
215-3 


1 The indications made below relate tc 
1 the deposited microorganlsni(s) or 
other biological material referred to i 
Ithe description on: 
Ipage 

1 line 


► ' ' ■ •■ — — 

n 

102 
48 


215^1 
215-3-2 

215-3-3 
215-3-4 
215-4 


1 Identification of Deposit 
1 Name of depositary institution 
Address of depositary institution 

1 Date of deposit 
Accession Number 
1 Additional Indications 


American Type Culture Collection 
10801 Universitv Blvd Man;3fi«oa 
Virginia 20110-2209United States of 
America 

25 August 1998 (25.08.1998) 
ATCC 203157 

NONE ' " • 


215-5 
215-6 

216 


Designated States for Which 

Indications are Made 

Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


all designated States 

NONE 


216-1 
216-2 
216-3 


Mhe indications made below relate to 
the deposited microorgani8m(s) or 
other biological material referred to in 
the description on: 
page 

'line 


102 
49 


216-3-1 
216-3-2 j 

216-3-3 
216-3-4 
216-4 


Identification of Deposit 
Name of depositary institution • 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional IrulJcations 


American Type Culture Collection 

10801 Universitv Blvd MAnaQo^ke 

*^A'v\jL» f i^sAzicissas ^ 

Virginia 20110-2209United States of 
America 

11 August 1998 (11.08.1998) 
ATCC 203106 

NONE ' 


216-5 
216-6 


Designated Stetes for Which 

Indications are Made 

Separate Furnishing of Indications 


all designated States 


1 

217 


These indications will be submitted to 
the International Bureau later 


NONE 


1 

M 
t 

217-1 J 
217-2 1 
217-3 1 


rne indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
>age 

rne 


102 
50 


217-3-1 1^ 

217-3-2 1/ 

217-3-3 C 
217-3-4 \a 
217-4 |a 


dentification of Deposit 

4ame of depository Institution j 

Uldress of depository institution 

^ 
J 

late of deposit ^ 
coession Number ^ 
dditional Indications ^ 


\merican Type Culture Collection 
L0801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
toerica 

)9 June 1998 (09.06.1998) 
ITCC 209945 

rONE " 


217-5 D 


esignated States for Which 

idications are Utede * 


ill designated States 
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217-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


218 

218-1 


The indications made below relate to 
the deposited microorganism(s) or 
omer oioiogicai n^ienal referred to ir 
the description on: 
page 

lino 


102 
51 


218-3 
218-3-1 

218-3-2 

218-3-3 

218-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 June 1998 (16.06.1998) 
ATCC 209989 


218-4 


Additional Indications 


NONE 


218-5 


Designated States for Which " 
Indications are Made 


all designated States 


218-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


219 

219-1 


The indications made below relate to 
the deposited microorganlsm(s) or 
other biological material referred to in 
the description on: 
page 

line 


102 
52 


219-3 
219-3-1 

219-3-2 

219-3-3 
219-3-4 


Identification of Deposit 
Hame of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 August 1998 (11.08.1998) 
ATCC 203108 


219-4 


Additional Indications 


NONE 


219-5 


)eslgnated States for Which 
Indications are Made 


all designated States 


219-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


220 

220-1 
220-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


102 
53 


220-3 
220-3-1 

220-3-2 

220-3-3 
220-3-4 


Identification of Deposit ' 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
^Vccession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

11 August 1998 (11.08.1998) 
ATCC 203111 
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22M 
221-2 



The indications made below relate to 
the deposited microorganism(s) or 
other biological material lefened to in 
I the description on: 
page 

fine 



221^-3 I Identification of Deposit 

Name of depositary institution 
Address of depositary institution 



221-3-1 
221-3-2 



221-3-3 
221-3-4 



Date of deposit 
Accession Number 



221-4 Additional Indications 



221-5 
221-6 



Designated States for Which 

Indications are Made 

Separate Furnishing of Indications 

These indications will be submitted to 
I the International Burea u later 
I The indications made below relate to 
I the deposited mfcroorganism(s) or 

other biological material raferrad to in 
I the description onr 

page 

I tine 

Identification of Deposit 
Name of depositary institution 
Address of depositaiy institution 



222-3-3 
222-3^ 



Date of deposit 
I Accession Numt>er 



222^ Additional Indications' 



223 



223-1 
223-2 



Designated states for Which 
Indications are Made 
Separate Furnishing of Indications 

These Indications will be submitted to 
I the International Bureau later 
The Indications made below relate to 



102 
54 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 October 1998 (20.10.1998) 
ATCC 203359 



NONE 



all designated States 
NONE 



102 
55 

American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 June 1998 (16.06.1998) 
ATCC 209988 



NONE 



all designated States 
NONE 



the deposited mlcroorganlsm(s) or 
other biological material referred to In 
I the description on: 
page 

line 



103 
2 
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223-3 
223.3-1 

223-3-2 

223-3-3 
223-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 June 1998 (16.06.1998) 
ATCC 209978 


223-4 


Additional Indications 


NONE 


223-5 


Designated States for Which 
Indications are Made 


ail designated States 


223-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


224 

224-1 
224-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 

thfi dftficrintinn An* 

page 

line 


103 
3 


224-3 
224-3-1 

224-3-2 

224-3-3 
224-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Tvoe Culture Col lee t ion 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

04 August 1998 (04.08.1998) 


224-4 


Additional Indications 


NONE 


224-5 


Designated SUtes for Which 
Indications are Made 


all designated States 


224-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


225 

225-1 
225-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


103 
4 


225-3 
225-3-1 

225-3-2 

225-3-3 
225-3-4 


Identification of Deposit 
Nante of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

16 June 1998 (16.06.1998) 
ATCC 209980 


225-4 


Additional Indications 


NONE 


225-5 


Designated States for Which 
Indications are Made 


all designated States 


225-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 
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226 

226-1 
226-2 
226-3~" 


The indications made below relate to 
the deposited microorganisnn(s) or 
other biological material referred to 1 
the description on: 

line 


n 

103 
5 


226-3-1 
226-3-2 

226-3-3 
226-3-4 
22&4 


laennncaiion of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Numt)er 
Additional Indications 


American Type Culture Collection 
10801 Universitv Blvd ManA<s«ao 
Virginia 20110-2209United States of 
America 

04 August 1998 (04 •08.1998) 
ATCC 203091 

UGKV ' — 


226-5 
226^ 


Designated States for Which 
Indications are Made 


all designated States 


227 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


227-1 
227-2 


The indicauons made below relate to 
the deposited microorganismCs) or 
other biological material referred to in 
the description on: 
page 

line 


103 
6 


227-3-1 
227-3-2 

227-3-3 
227-3^ 
227-4 


Identification of Deposit 

Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
AddlUonal Indications 


American Type Culture Collection 

10801 Universitv Rl-trH MaMa«.«.^« 
^ww^ WA«A 9^ j3Avd- f roanassas ^ 

Virginia 20110-2209United States of 
America 

04 August 1998 (04.08.1998) 
ATCC 203090 

NONE " ~ — 


227-5 
227-6 


Designated States for Which 
Indications are IMade 


all designated States 




separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


228-1 
228-2 
228-3 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

ine 


103 
7 


228-3-1 1 
228-3-2 > 

228-3-3 C 
228-3-4 / 
228-4 7 


dentification of Deposit 

Mame of depositary institution ^ 

Address of depositary institution 

i 

3ate of deposit | 
Vccession Number 

Additional indications " j 


Jtoerican Type Culture Collection 
10801 University Blvd., Manassas, 
/irginia 20110-2209United States of 
F^erica 

34 August 1998 (04.08.1998) 
^TCC 203092 

^ON£ " 


228-5 C 

li 


teslgnated States for Which " 
ndications are Made * 


ill designated States 
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228-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


229 

229-1 
229-2 


The indications made below relate to 
the deposited microorgani$m(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 

o 
D 


229-3 

229-3-1 

229-3-2 

229-3-3 

229-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 

Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas/ 
Virginia 20110-2209United States of 
America 

10 November 1998 (10.11.1998) 
ATCC 203452 


229-4 


Additional Indications 


NONE 


229-5 


Designated States for Which 
Indications are Made 


all designated States 


229-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the international Bureau later 


NONE 


230 

230-1 
230-2 


The indications made below relate to 
the deposited microorganism{s) or 
other bloloalcal matArial n»fom>/4 tA in 

the description on: 
page 

line 


103 
9 


230-3 

230-3-1 

230-3-2 

230-3-3 
230-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

01 September 1998 (01.09.1998) 
ATCC 203173 


130^4 


Additional Indications 


NONE 


230-5 


Designated States for Which 
Indications are Made 


all designated States 


230-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


231 

231-1 
231-2 


The indications made below relate to 
the deposited microorganismfs) or 
other biological material referred to in 
the description on: 

page 

line 


103 
10 


231-3 
231-3-1 

231-3-2 

231-3-3 
231-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

17 November 1998 (17.11.1998) 
ATCC 203464 
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231^ 
231-5 

23lT" 


Additional Indications 

' Designated states for Which 
Indications are Made 


NONE " " — - 

all designated States 




Separate l^umlshing of Indications 

These indications wHI be submitted to 
the International Bureau later 


'none 


232-1 
232-2 


The indications made t>elow relate to 
the deposited microorganism(s) or 
other biotogical material inferred to ir 
the description on: 

page 

line 


103 
11 


232 3-1 
232-3-2 

232-3-3 
232-3-4 
232-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 

10801 Univerait-v niv^^ iur^««^»»«^ 
^www^ V44j.vtsi»ity i3j.vo. f Manassas, 

Virginia 20110-2209United States of 
America 

18 August 1998 (18.08.1998) 
ATCC 203132 


232-5 
232-6 


Designated States for Which 
Indications are Made 


all designated States 




Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


233-1 

233-2 


The indications made below relate to 
the deposited microorganl8m(s) or 
other biological material referred to in 
the description oh: 
page 

line 


103 
12 


233-3-1 
233-3-2 

233-3-3 
233-3-4 
233^ 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection 
10801 Universitv Blv^i Man^oasie. 

Virginia 20110-2209United States of 
America 

09 September 1998 (09.09.1998) 
ATCC 203254 

NONE " 


233-5 
233-6 ' 


uesignated States for Which 
ndications are Made 


all designated States 


1 

234 . 1 


separate Furnishing of Indications " 

rhese indications will be submitted to 
he International Bureau later 


NONE 


t 
t 

234-1 F 
234-2 li 


rhe indications made t>elow relate to 
he deposited microorganism(s) or 
>ther biological material referred to In 
he description on: 

>age 

ne 


LOS 
L3 
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234-3 
234-3-1 

234-3-2 

234-3-3 

234-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

20 October 1998 (20.10.1998) 
ATCC 203358 


2344 


Additional IndicaUons 


NONE 


234^ 


Designated States for Which 
Indications are Made 


all designated States 


234-6 


Separate furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


235 

235-1 


The indications made l>elow relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
14 


235-3 

235-3-1 

235-3-2 

235-3-3 
235-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-'2209United States of 
America 

04 August 1998 (04.08.1998) 
ATCC 203093 


235-4 


Additional Indications 


NONE 


235-5 


C^signated States for Which 
Indications are Made 


all designated States 


235-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 


236 

236-1 
236-2 


The indications made tielow relate to 
the deposited microorganlsm(s) or 
vuwi Dioiogicai macenai referreo to in 
the description on: 
page 

line 


103 
15 


236-3 
236-3-1 

236-3-3 
236-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 November 1998 (03.11.1998) 
ATCC 203457 


236-4 


Additional Indications 


NONE 


236-5 


Designated States for Which 
Indications are Made 


all designated States 


236-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the Intemational Bureau later 


NONE 
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237-1 
237-2 
237-3 



The indications made below relate to 
the deposited niicroorgant$m(s) or 
other biological material referred to in 
the description on: 

page 

line 



103 
16 



237-3-1 
237-3-2 



237-3-3 
237-3-4 



Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 September 1998 (09.09.1998) 
ATCC 203241 




These indications will be submitted to 
the International Bureau later 



NOME 



238-1 
238-2 
2383 



The Indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

page 

line 



103 
17 



238-3-1 
238-3-2 



238-3-3 
238-3-4 



Identification of Deposit 
Name of depositary institution 
Address of depositary institution 



Date of deposit 
Accession Number 



238-4 



Additional Indications 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110"2209United States of 
America 

09 September. 1998 (09.09.1998) 
ATCC 203249 



238-5 



Designated States for Which 
Indications are Made 



NONE 



all designated States 



239 



239-1 
239-2 



Separate Furnishing of Indications 

These indications win be submitted to 
the International Bureau later 



NONE 



The Indications made below relate to 
tfie deposited microor0anism(8) or 
ottier biological material referred to in 
the description on: 
page 

line 



103 
18 



239-3 
239-3-1 

239^2 



239-3-3 
239-3-4 



Identification of Deposit 
Name of depositary institutibn 

Address of depositary nistitution 



Date of deposit 
Accession Number 



2394 
239-5 



Additional Indications 

Designated States for Which" 
Indications are Utede 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 September 1998 (09.09.1998) 
ATCC 203250 



NONE 

all designated States 
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239-6 


Separate Furnishing of Indications 

These indications wiil be submitted to 
the International Bureau later 


NONE 


240 

240-1 


The indications made betow relate to 
the deposited microorganisnn{s) or 
other biological material referred to in 
the description on: 

page 


103 


240-2 


line 


19 


240-3 


Identification of Deposit 




240-3-1 
240-3-2 


Name of depositary institution 
Address of depositary institution 


■Aaerican Typ© Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 


240-3-3 
240-3-4 


Date of deposit 
Accession Number 


America 

18 August 1998 (18.08.1998) 


240-4 


Additional Indications 


NOME 


240-5 


Designated States for Which 
Indications are Made 


all designated (States 


240-6 


Separate Furnishing of Indications 

These indications win be submitted to 
the Intemational Bureau later 




241 

241-1 

241-2 


The indications made below relate to 
the deposited microorgani8m(s) or 
other biological material referred to in 

Miv iitp3wiifnjori on* 

page 

line 


103 
20 


241-3 


Identification of Deposit 




241-3-1 
241-3-2 


Name of depositary institution 
Address of depositary institution 


American Tvoe Culture Col lection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 


241-3-3 
241-3-4 


Date of deposit 
Accession Number 


15 Sept^nber 1998 (15.09.1998) 


241-4 


Additional Indications 


NONE 


241-5 


Designated States for Which 
Indications are Made 


all designated States 


241-6 


Separate Furnishing of Indications 

These indications will be sulunttted to 
the Intemational Bureau later 


NONE 


242 

242-1 
242-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to In 
the description on: 
page 

line 


103 
21 


242-3 
242-3-1 

242-3-2 


Identification of Deposit 

Name of depositary institution 

Address of depositary institution 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209Uni ted States of 
America 


242-3-3 
242-3-4 


Date of deposit 
Accession Number 


15 September 1998 (15.09.1998) 
ATCC 203233 
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242-4 


Additional Indications " 


NONE 


242-S 


Designated States for Which ~ 
Indications are Made 


all designated States 


242-6 


Separate Furnishing of Indications 

These Indications will be submitted to 
the International Bureau later 


NONE 


243 

243-1 
243-2 


The indications made below relate to 

the deposited niicroorganism(s) or 

other biological material referred to Ir 

the description on: 

page 

line 


103 
22 


243-3-1 
243-3-2 

243-3-3 
243-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 Universitv Blvd Mana«««aQ 
Virginia 20110-2209United States of 
America 

09 September 1998 (09.09.1998) 
ATCC 203252 


243-4 


Additional Indications 


NONE ' ~" " — 


243-5 


Designated States for Which 
Indications are Made 


all designated States 


243-6 


Separate Furnishing of Indications 

These indications wlH be submitted to 
the international Bureau later 


NONE 


244 

244-1 

244-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 

line 


103 
23 


244-3-1 
244-3-2 

244-3-3 
244-3-4 
244-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 
Additional Indications 


American Type Culture Collection^ 
10801 University Blvd. « Manassas 
Virginia 20110-2209United States of 
America 

17 November 1998 (17.11.1998) 

ATCC 203476 

MONE 


244-5 


Designated States for Which 
Indications are Made 


all designated States 


244-6 

1 


Separate Furnishing of Indications 

fhese indications will be submitted to 
the International Bureau later 


NONE 


245 

t 
i 
1 

245-1 ( 
245-2 t 


rhe indications made below relate to ' 
the deposited microorgan»m(s) or 
3ther biological material referred to in 
i>e description on: 

)age 

ne , 


103 
24 
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245-3 
245-3-1 

245*3-2 

245-3-3 
245-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numt)er 


American Tvd© CuXtiux^e CoHeetirtn 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

04 August 1998 (04.08.1998) 


245-4 


Additional Indications 


NONE 


245-5 


Designated States for Which 
Indications are Made 


all designated States 


245-6 


Separate Furnishing of Indications 

These indications will be sulmiitted to 
the International Bureau later 


NONE 


246 

246-1 

246-2 


The Indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
25 


246-3 

246-3-1 

246-3-2 

246-3-3 
246-3-4 


Identification of Deposit 
Nanie of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Nunnt)er 


American ^rypG ^uxcure ^oxxeccion. 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America . 

15 September 1998 (15.09.1998) 

ATCC 203239 


246-4 


Additional Indications 


NONE 


246-5 


Designated States for Which 
indications are Made 


all designated States 


246-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


247 

247-1 
247-2 


The indications made below relate to 
the deposited mlcroorganlsmCs) or 
other biological material referred to In 

page 
line 


103 
26 


247-3 
247-3-1 

247-3-2 

247-3-3 
247-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

22 September 1998 (22.09.1998) 
ATCC 203267 


247^ 


Additional Indications 


NONE 


247-5 


Designated States for Which 
Indications are Made 


all designated States 


247-6 


Separate Furnishing of Indications 

these indications wOl be submitted to 
the International Bureau later 


NONE 
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248 

248-1 


The indications made below relate to 
the deposited microorganism<s) or 
oiner uioiogicai niaienai rererred to ir 
the description on: 
page 

tine 


103 
27 


248-3 

248.3-1 

248-3-2 

248-3-3 
248-3-4 


' Identification of Deposit 

Name of depositary institution 

Address of depositary Institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 September 1998 (22.09.1998) 
ATCC 203282 


248-4 


Additional Indications 


NONE 


248-5 


Designated States for Which 
indications are Made 


all designated States 


248-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


249 

249-1 
249-2 


The indications made tielow relate to 
the deposited microorganlsm($) or 
other biological material referred to in 
the description on: 
page 

line 


103 
28 


249-3 

249^-1 

249-3-2 

249-3-3 
249-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 February 1999 (09.02.1999) 
ATCC 203657 


249-4 


Additional Indications 


NONE 


249-5 


Designated States for Which 
Indications are Made 


all designated States 


249-8 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


250 

250-1 
250-2 


The indications made below relate to 
the deposited microorganl8m(s) or 
oiner oioiogicai material referred to In 
the description on: 
page 

line 


103 
29 


250-3 

250-3-1 

250-3-2 

250-3-3 
250-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 September 1998 (22.09.1998) 
ATCC 203276 


2504 


Additional Indications 


NONE 


250-5 


Designated Stales for Which 
indications are Made 


all designated States 
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250-6 


Separate Furnishing of Indications 

These indications will t>e submitted to . 
the International Bureau later 


NONE 


251 

25M 
251-2 


The indications made below relate to 
the deposited microorganism(s} or 
other biological material referred to in 
the description on: 
page 

line 


103 
30 


251-3 

251-3-1 

251-3-2 

251-3-3 
251-3-4 


Identification of Deposit 
Name of depositaiy institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

25 August 1998 (25.08.1998) 
ATCC 203160 


251-4 


Additional Indications 


NONE 


251-5 


Designated States for Which 
Indications are Made 


all designated States 


251-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the international Bureau later 


NONE 


252 

252-1 
252-2 


The indications made below relate to 
the deposited microorganlsm($) or 
wui^r Dioiogicai maxenai ref erred to in 
the description on: 
page 

line 


103 
31 


252-3 

252-3-1 

252-3-2 

252-3-3 
252-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

18 August 1998 (18.08.1998) 
ATCC 203135 


252<4 


Additional Indications 


NONE 


252-5 


Designated States for Which '■ ~ 
Indications are Made 


all designated States 


252-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


253 

253-1 
253-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
32 


253-3 
253-3-1 

253-3-2 

253-3-3 
253-3-4 i 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

03 November 1998 (03.11.1998) 
ATCC 203459 
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253-4 




NONE 


253^ 


Designated States for Which 
Indications are Made 


all designated States 


253^ 


Separate Furnishing of Indications 


NONE 




These indications will be submitted to 
the International Bureau later 


254 

254-1 
254-2 


The indications made below relate to 
the deposited mlcroorganisni(s) or 
other biological material refeirod to ir 
the description on: 

page 

line 


1 

103 

33 


254-3 
254-3-1 

254-3-2 

254-3-3 
254-3-4 


Name of depositary institution 
rwviicsa w ucposuary insinunon 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 September 1998 (22.09.1998) 
ATCC 203270 




Additional Indications 


NONE 


254-5 


Designated States for Which 
indications are Made 


all designated States 


254-6 


Separate Furnishing of indications 


NONE 


"255 


These indications will be submitted to 
the international Bureau later 


255-1 
255-2 


The indications made below relate to 
the deposited microorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
34 


255-3 
255-3-1 

255-3-2 

255*3-3 
255-3-4 


Name of depositary institution 

Address of deoosltarw inctitiitlAn 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-22 09Uni ted States of 
America 

12 January 1999 (12.01.1999) 
ATCC 203573 


255-4 


Additional Indications ~ 




255-5 


Designated states for Which 
Indications are Made 


all designated States 


255-6 


Separate Furnishing of Indications 


NONE 




These indications will be subntitted to 
the International Bureau later 


1 
i 
1 

256-1 1 
256-2 ( 


rhe Indications made below relate to 
the deposited microofgani$m(s) or 
other biological material referred to in 
the description on: 
>age 

ine 


L03 
35 
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2S6-3 
256-3-1 

256-3-2 

256-3-3 
256-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Numk)er 


American Type Culture Collection 
10801 University Blvd., Manassas/ 
Virginia 20110-2209United States o£ 
America 

17 November 1998 (17.11.1998) 
ATCC 203477 


25&4 


Additional Indications 


NONE 


256-5 


Designated States for Which 
Indications are Made 


all designated States 


256-6 


Separate Furnishing of Indications 

These Indtcations will be submitted to 
the International Bureau later 


NONE 


257 

257-1 
257-2 


The indications made below relate to 

the deposited microorgani8m(8) or 

other biological material lefbrred to In 

the descrintion on* 

page 

line 


103 
36 


257-3 
257-3-1 

257-3-2 

257t3-3 
257-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Tvoe CultupA PAiiAof-nnn 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

06 October 1998 (06.10.1998) 


257^ 


Additional Indications " 


NONE 


257-5 


Designated States for Which 
Indications are Made 


all designated States 


257-6 


Separate Furnishing of indications 

These indications will be submitted to 
the International Bureau later 


NONE 


"258 

258-1 
Zoo-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
37 


258-3 
258^1 

258-3-2 

258-3-3 
258-3^ 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110<-2209United States of 
America 

06 October 1998 (06.10.1998) 
ATCC 203313 


258^ 


Additional Indications 


NONE 


258-5 


Designated States for Which 
Indications are Made 


all designated States 


258-6 


Separate Furnishing of Indications 

These indications wilt toe submitted to 
the International Bureau later 


NONE 
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259-1 



■ The indications made below relate to 
the deposited microorgani8m(s) or 
other biological material referred to 

I the description on: 
page 



259-2 line 



259-3 I Identification of Deposit 
259-3-1 Name of depositary institution 
259-3>2 Address of depositary institution 



259-3-3 Date of deposit 
259-3-4 Accession Number 



103 
38 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

27 October 1998 (27.10.1998) 
ATCC 203407 



269-5 I Designated States for Which 



NONE 



I indications are IMade 



all designated States 



259-6 I Separate Furnishing of Indications 

These indications will be submitted to 
[the intemationai Bureau later 



NONE 



260-1 
260-2 



I The indications made below relate to 
I the deposited microorganism(s) or 

other biological material referred to in 
I the description on; 

page 

line 



103 
39 



ntification of Deposit 

260-3-1 Name of depositary institution 
260-3-2 Address of depositary institution 



260-3-3 I Date of deposit 
260-34 Accession Number 



260-4 I Additional Indications 

260-5 Designated States for Which 
I Indications are Made 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 December 1998 (22.12.1998) 
ATCC 203553 



NONE 



260-6 I Separate Fumisliing of Indications 

These indications wiU be submitted to 
[the Intemationai Bureau later 



all designated States 



NONE 



261 I Trie indications made below relate to 
the deposited microorganismCs) or 
other biological material referred to in 
I the description on: 
page 



261-1 
261-2 



tine 



103 
40 



261^ I Identification of Deposit 
261-3-1 Name of depositary institution 
261-3-2 I Address of depositary institution 



261-3-3 pate of deposit 
261-3^ I Accession Number 
261-4 



American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

22 December 1998 (22.12.1998) 
ATCC 203549 



261-5 



I Designated states for Which 
I Indications are Made 



NONE 



all designated States 
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261-6 


Separate Furnishing of Indications 

These indications will t>e submitted to 
the International Bureau later 


NONE 


262 

262-1 
262-2 


The indications made below relate to 
the deposited microorganism(s) or 
other bioloflical material referred to ir 
the description on: 
page 

line 


103 
41 


262-3 
262-3-1 

262-3-2 

262-3-3 
262-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States o£ 
America 

22 December 1998 (22.12.1998) 
ATCC 203550 


262-4 


Additional Indicatioris 


NONE 


262-5 


Designated States for Which 
Indications are Made 


all designated States 


262-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


263-1 

263-2 


The indications made t>elow relate to 
the deposited microorgani8m(s) or 
uu ivr Dioiogicai maienai rcforiOM to in 
the description on: 
page 

line 


103 
42 


263-3 
263-3-1 

263-3-2 

263-3-3 
263-3-4 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States: of 
America 

08 June 1999 (08.06.1999) 
ATCC PTA-204 


263-4 


Additional Indications 


NONE 


263-6 


Designated States for Which 
Indications are Made 


all designated States 


263-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


264 

264-1 
264-2 


The indications made below relate to 
the deposited mlcroorganism($) or 
other biological material refened to in 
the description on: 

page 

line 


103 
43 


264-3 
264-3-1 

264-3-2 

264-3-3 
264-3-4 i 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 
Date of deposit 

Accession Number ^ 


toerican Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 2Q110-2209United States of 
taierica 

29 October 1998 (29.10.1998) 
I^TCC 203391 
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264-4 


Additional Indications 


NONE 


264-5 


" Designated States for Which 
Indications are Made 


all designated States 


264-6 
l65 


Separate Furnishing of Indications 

These indications will t>e subnnitted to 
the International Bureau later 


NONE 


265-1 

265-2 


The indications made below relate to 
the deposited mlcroorganisnn(s) or 
other biological material referred to in 
the description on: 
page 

line 


103 
44 


265^ 
265-3-1 

265-3-2 

265-3-3 
265-3^ 


Identification of Deposit 
Name of depositary institution 
Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

23 March 1999 (23.03.1999) 
ATCC 203863 


265-4 


Additional Indications 


NONE 


265-5 


Designated States for Which 
Indications are Made 


all designated States 


265-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


266 

266-1 
266-2 


The indications made below relate to 
the deposited mlcroorganism(s) or 
other bioloaical material r»fer»H *a in 
the description on: 
page 

fine 


103 
45 


266-3 
266-3-1 

266-3-2 

266-3-3 
266-3-4 


Identification of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd., Manassas, 
Virginia 20110-2209United States of 
America 

09 March 1999 (09.03.1999) 
ATCC 203834 


266-4 




NONE 


266-5 


Designated States for Which 
Indications are Made 


all designated States 


266-6 


Separate Furnishing of Indications 

These indications will be submitted to 
the International Bureau later 


NONE 


267 

1 

i 
1 

267-1 1 
267-2 1 


rhe Indications made below relate to 
the deposited microorgdnism(8) or 
cyther biological material referred to in 
the description on: 
»age 

ine 


103 
46 
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267-3 
267-3-1 

267-3-2 

267-3-3 
267-3-4 


Identiflcation of Deposit 
Name of depositary institution 

Address of depositary institution 

Date of deposit 
Accession Number 


American Type Culture Collection 
10801 University Blvd. , Manassas, 
Virginia 20110-2209United States of 
America 

20 July 1999 (20.07 ,1999) 
ATCC PTA-382 


267-4 


Additional Indications 


NONE 


267-5 


Designated States for Which 
Indications are Made 


all designated States 


267-6 


Separate Furnishing of Indications 

These indications wtl) be submitted to 
the International Bureau later 


NONE 


FOR RECEIVING OFFICE USE ONLY 


0-4 


This form was received with the 
international application: 
(yes or no) 




0-4-1 


Auti^orized officer 






FOR INTERNATIONAL BUREAU USE ONLY 


0-5 


This form was received by the 
international Bureau on: 




0-5-1 


Authorized officer 
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"^^^^^^ PCr/US00«2678 
WHAT IS CLAIMED IS - 

1 . Isolated nucleic acid having at least 80% nucleic acid sequence identic to a nucleotide sequence 
that encodes an amino acid sequence selected from the group consisting of the amino acid sequence shown in 
Figure 2 (SEQ ID N0:2). Figure 4 (SEQ ID N0:4). Figure 6 (SEQ ID N0:6). Figure 8 (SEQ ID N0:8). Figure 
10 (SEQ ID NO:10). Figure 12 (SEQ ID N0:12). Rgure 14 (SEQ ID N0:14). Figure 16 (SEQ ID N0:16). 
Figure 18 (SEQ ID NO: 18). Figure 20 (SEQ ID NO:20). Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID 
NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30), Figure 32 (SEQ 
ID NO:32). Figure 34 (SEQ ID NO:34). Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 
(SEQ ID NO:40), Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46). Figure 
48 (SEQ ID NO:48), Figure 50 (SEQ ID NO:50). Figure 52 (SEQ ID NO:52). Figure 54 (SEQ ID NO:54). 
Figure 56 (SEQ ID NO:56), Figure 58 (SEQ ID NO:58). Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID 
NO:62). Figure 64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68), Figure 70 (SEQ 
ID NO:70). Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74). Figure 76 (SEQ ID NO:76), Figure 78 
(SEQ ID NO:78). Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82), Figure 84 (SEQ ID NO:84). Figure 
86 (SEQ ID NO:86). Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), 
Figure 94 (SEQ ID NO:94). Figure 96 (SEQ ID NO:96). Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID 
NO:100). Figure 102 (SEQ ID NO: 102). Figure 104 (SEQ ID NO: 104). Figure 106 (SEQ ID NO: 106), Figure 
108 (SEQ ID NO: 108). -Figure 110 (SEQ ID NO: 110). Figure 1 12 (SEQ ID NO: 112), Figure 114 (SEQ ID 
NO:114). Figure 116(SEQIDNO:116). Figure 118(SEQIDNO:118). Figure 120 (SEQ ID NO: 120), Figure 
122 (SEQ ID NO: 122). Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID 
NO:128). Figure 130 (SEQ ID NO:130), Figure 132 (SEQ ED NO: 132), Figure 134 (SEQ ID NO: 134), Figure 
136 (SEQ ID NO: 136), Figure 138 (SEQ ID NO: 138). Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID 
NO:142). Figure 144 (SEQ ID NO: 144), Figure 146 (SEQ ID NO:146). Figure 148 (SEQ ID NO: 148). Figure 
150 (SEQ ID NO: 150). Figure 152 (SEQ ID NO:152). Figure 154 (SEQ ID NO:154). Figure 156 (SEQ ID 
NO:156). Figure 158 (SEQ ID NO:158), Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162). Figure 
164 (SEQ ID NO: 164). Figure 166 (SEQ ID NO: 166). Figure 168 (SEQ ID NO:168), Figure 170 (SEQ ID 
NO: 170). Figure 172 (SEQ ID NO: 172), Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 
178 (SEQ ID NO:178), Figure 180 (SEQ ID NO:180), Figure 182 (SEQ ID NO:I82). Figure 184 (SEQ ID 
NO:184). Figure 186 (SEQ ID NO:186). Figure 188 (SEQ ID NO:188). Figure 190 (SEQ ID NO:190). Figure 
192 (SEQ ID NO: 192). Figure 194 (SEQ ID NO: 194), Figure 196 (SEQ ID NO: 196). Figure 198 (SEQ ID 
NO: 198). Figure 200 (SEQ ID NO:200). Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 
206 (SEQ ID NO:206). Figure 208 (SEQ ID NO:208). Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID 
NO:212). Figure 214 (SEQ ID NO:214). Figure 216 (SEQ ID NO:216). Figure 218 (SEQ ID NO:218). Figure 
220 (SEQ ID NO:220). Figure 222 (SEQ ID NO:222). Figure 224 (SEQ ID NO:224), Figure 226 (SEQ ID 
NO:226). Figure 228 (SEQ ID NO:228). Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232). Figure 
234 (SEQ ID NO:234), Figure 236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID • 
NO:240), Figure 242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246). Figure 
248 (SEQ ID NO:248). Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252). Figure 254 (SEQ ID 
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NO:254), Figure 256 (SEQ [D NO:256). Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO;260). Figure 
262 (SEQ ID NO:262). Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID 
NO:268). Figure 270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 
276 (SEQ ID NO:276). Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID 
NO:282), Figure 284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 
5 290 (SEQ ID NO:290). Figure 292 (SEQ ID NO:292). Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID 
NO:296). Figure 298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302). Figure 
304 (SEQ ID NO:304), Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308), Figure 310 (SEQ ID 
NO:310), Figure 312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:316). Figure 
318 (SEQ ID NO:318). Figure 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322). Figure 324 (SEQ ID 
10 NO:324). Figure 326 (SEQ ID NO:326). Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330). Figure 
332 (SEQ ID NO:332), Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID 
NO:338). Figure 340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344). Figure 
346 (SEQ ID NO:346). Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID 
NO:352). Figure 354 (SEQ ID NO:354). Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358), Figure 

15 360 (SEQ ID NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID 
NO:366). Figure 368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372). Figure 
374 (SEQ ID NO:374), Figure 376 (SEQ ID NO:376). Figure 378 (SEQ ID NO:378). Figure 380 (SEQ ID 
NO:380), Figure 382 (SEQ ID NO:382), Figure 384 (SEQ ID NO:384), Figure 386 (SEQ ID NO:386). Figure 
388 (SEQ ID NO:388), Figure 390 (SEQ ID NO:390), Figure 392 (SEQ ID NO:392), Figure 394<SEQ ID 

20 NO:394), Figure 396 (SEQ ID NO:396), Figure 398 (SEQ ID NO:398). Figure 400 (SEQ ID NO:400). Figure 
402 (SEQ ID NO:402), Figure 404 (SEQ ID NO:404). Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID 
NO:408), Figure 410 (SEQ ID NO:410). Figure 412 (SEQ ID NO:412), Figure 414 (SEQ ID NO:414), Figure 
.416 (SEQ ID NO:416). Figure 418 (SEQ ID NO:418), Figure 420 (SEQ ID NO:420), Figure 422 (SEQ ID 
NO:422). Figure 424 (SEQ ID NO:424), Figure 426 (SEQ ID NO:426), Figure 428 (SEQ ID NO:428), Figure 

25 430 (SEQ ID NO:430). Figure 432 (SEQ ID NO:432). Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID 
NO:436). Figure 438 (SEQ ID NO:438). Figure 440 (SEQ ID NO:440). Figure 442 (SEQ ID NO:442). Figure 
444 (SEQ ID NO:444). Figure 446 (SEQ ID NO:446), Figure 448 (SEQ ID NO:448). Figure 450 (SEQ ID 
NO:450). Figure 452 (SEQ ID NO:452). Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456), Figure 
458 (SEQ ID NO:458), Figure 460 (SEQ ID NO:460), Figure 462 (SEQ ID NO:462), Figure 464 (SEQ ID 

30 NO:464), Figure 466 (SEQ ID NO:466), Figure 468 (SEQ ID NO:468), Figure 470 (SEQ ID NO:470), Figure 
472 (SEQ ID n6:472). Figure 474 (SEQ ID NO:474), Figure 476 (SEQ ID NO:476). Figure 478 (SEQ ID 
NO:478), Figure 480 (SEQ ID NO:480), Figure 482 (SEQ ID NO:482). Figure 484 (SEQ ID NO:484), Figure 
486 (SEQ ID NO:486), Figure 488 (SEQ ID NO:488), Figure 490 (SEQ ID NO:490), Figure 492 (SEQ ID 
NO:492), Figure 494 (SEQ ID NO:494), Figure 496 (SEQ ID NO:496). Figure 498 (SEQ ID NO:498), Figure 

35 500 (SEQ ID NO:500), Figure 502 (SEQ ID NO:502). Figure 504 (SEQ ID NO:504), Figure 506 (SEQ ID 
NO:506), Figure 508 (SEQ ID NO:508), Figure 510 (SEQ ID NO:510), Figure 512 (SEQ ID NO:512). Figure 
514 (SEQ ID NO:514), Figure 516 (SEQ ID NO:516), Figure 518 (SEQ ID NO:518). Figure 520 (SEQ ID 
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NO:520). Figure 522 (SEQ ID NO:522). Figure 524 (SEQ ID NO:524), Figure 526 (SEQ ID NO:526). Figure 
528 (SEQ ID NO:528). Figure 530 (SEQ ID NO:530). Figure 532 (SEQ ID NO:532). Figure 534 (SEQ ID 
NO:534). Figure 536 (SEQ ID NO:536). Figure 538 (SEQ ID NO:538). Figure 540 (SEQ ID NO:540). Figure 
542 (SEQ ID NO:542). Figure 544 (SEQ ID NO:544), Figure 546 (SEQ ID NO:546). Figure 548 (SEQ ID 
NO:548) and Figure 550 (SEQ ID NO:550). 



2. Isolatedaucleicaddhavingatleast80% nucleic acid sequence identity to a nucleotide sequence 
selected from the group consisting of the nucleotide sequence shown in Hgure 1 (SEQ ID NO: 1). Figure 3 (SEQ 
ID NO:3). Figure 5 (SEQ ID NO:5). Figure 7 (SEQ ID NO:7). Figure 9 (SEQ ID NO:9). Figure 1 1 (SEQ ID 
NO: 1 1), Figure 13 (SEQ ID NO: 13). Figure 15 (SEQ ID NO: 15). Figure 17 (SEQ ID NO:17). Figure 19 (SEQ 
ID NO: 19). Figure 21 (SEQ ID N0:21), Figure 23 (SEQ ID NO:23), Figure 25 (SEQ ID NO:25). Figure 27 
(SEQ ID NO:27), Figure 29 (SEQ ID NO:29). Figure 3 1 (SEQ ID N0:31). Figure 33 (SEQ ID NO:33), Figure 
35 (SEQ ID NO:35), Figure 37 (SEQ ID NO:37), Figure 39 (SEQ ID NO:39), Figure 41 (SEQ ID NO:41), 
Figure 43 (SEQ ID NO:43). Figure 45 (SEQ ID NO:45), Figure 47 (SEQ ID NO:47). Figure 49 (SEQ ID 
NO:49), Figure 51 (SEQ ID NO:51), Figure 53 (SEQ ID NO:53), Figure 55 (SEQ ID NO:55), Figure 57 (SEQ 
ID NO:57), Figure 59 (SEQ ID NO:59), Figure 61 (SEQ ID NO:61). Figure 63 (SEQ ID NO:63). Figure 65 
(SEQ ID NO:65). Figure 67 (SEQ ID NO:67). Figure 69 (SEQ ID NO:69). Figure 71 (SEQ ID NO:71). Figure 
73 (SEQ ID NO:73). Figure 75 (SEQ ID NO:75), Figure 77 (SEQ ID NO:77), Figure 79 (SEQ ID NO:79), 
Figure 81 (SEQ ID N0:81), Figure 83 (SEQ ID NO:83), Figure 85 (SEQ ID NO:85), Figure 87 (SEQ ID 
NO:87). Figure 89 (SEQ ID NO:89), Figure 91 (SEQ ID N0:91), Figure 93 (SEQ ID NO:93), Figure 95 (SEQ 
ID NO:95), Figure 97 (SEQ ID NO:97). Figure 99 (SEQ ID NO:99), Figure 101 (SEQ ID NO: 101), Figure 
103 (SEQ ID NO; 103). Figure 105 (SEQ ID NO: 105), Figure 107 (SEQ ID NO: 107). Figure 109 (SEQ ID 
NO: 109). Figure 1 1 1 (SEQ ID NO: 1 1 1). Figure 1 13 (SEQ ID NO: 1 13), Figure 1 15 (SEQ ID NO: 115). Figure 
117 (SEQ ID NO:117), Figure 119 (SEQ ID N0:119). Figure 121 (SEQ ID Np:121), Figure 123 (SEQ ID 
N0:123). Figure 125 (SEQ ID N0:125), Figure 127 (SEQ ID NO:127), Figure 129 (SEQ ID NO:129), Figure 
131 (SEQ ID N0:131). Figure 133 (SEQ ID NO:133). Figure 135 (SEQ ID NO:135), Figure 137 (SEQ ID 
NO: 137). Figure 139 (SEQ ID NO: 1390). Figure 141 (SEQ ID N0:141). Figure 143 (SEQ ID NO: 143). Figure 
145 (SEQ ID NO: 145), Figure 147 (SEQ ID NO: 147), Figure 149 (SEQ ID NO:149). Figure 151 (SEQ ID 
NO: 151), Figure 153 (SEQ ID NO: 153), Figure 155 (SEQ ID NO:155). Figure 157 (SEQ ID NO:157). Figure 
159 (SEQ ID NO: 159), Figure 161 (SEQ ID NO: 161). Figure 163 (SEQ ID NO:163). Figure 165 (SEQ ID 
NO:165). Figure 167 (SEQ ID NO:167). Figure 169 (SEQ ID NO:169). Figure 171 (SEQ ID NO:171). Figure 
173 (SEQ ID NO: 173). Figure 175 (SEQ ID NO: 175), Figure 177 (SEQ ID NO: 177), Figure 179 (SEQ ID 
NO:179). Figure 181 (SEQ ID N0:181), Figure 183 (SEQ ID NO: 183). Figure 185 (SEQ ID NO:185). Figure 
187 (SEQ ID NO: 187). Figure 189 (SEQ ID NO: 189). Figure 191 (SEQ ID NO: 191). Figure 193 (SEQ ID 
NO: 193). Figure 195 (SEQ ID NO: 195). Figure 197 (SEQ ID NO: 197). Figure 199 (SEQ ID NO: 199). Figure 
201 (SEQ ID NO:201). Figure 203 (SEQ ID NO:203). Figure 205 (SEQ ID NO:205). Figure 207 (SEQ ID 
NO:207). Figure 209 (SEQ ID NO:209). Figure 21 1 (SEQ ID NO:21 1). Figure 213 (SEQ ID NO:213). Figure 
215 (SEQ ID NO:215). Figure 217 (SEQ ID NO:217). Figure 219 (SEQ ID NO:219), Figure 221 (SEQ ID 
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NO:22l). Figure 223 (SEQ ID NO:223). Figure 225 (SEQ ID NO:225). Figure 227 (SEQ ID NO:227). Figure 
229 (SEQ ID NO:229), Figure 231 (SEQ ID NO:231), Figure 233 (SEQ ID NO:233), Figure 235 (SEQ ID 
NO:235), Figure 237 (SEQ ID NO:237). Figure 239 (SEQ ID NO:239), Figure 241 (SEQ ID NO:241), Figure 
243 (SEQ ID NO:243), Figure 245 (SEQ ID NO:245). Figure 247 (SEQ ID NO:247), Figure 249 (SEQ ID 
NO:249). Figure 251 (SEQ ID NO:25I), Figure 253 (SEQ ID NO:253), Figure 255 (SEQ ID NO:255), Figure 
257 (SEQ ID NO:257). Figure 259 (SEQ ID NO:259). Figure 261 (SEQ ID NO:261). Figure 263 (SEQ ID 
NO:263). Figure 265 (SEQ ID NO:265), Figure 267 (SEQ ID NO:267), Figure 269 (SEQ ID NO:269), Figure 
271 (SEQ ID NO:271), Figure 273 (SEQ ID NO:273). Figure 275 (SEQ ID NO:275), Figure 277 (SEQ ID 
NO:277), Figure 279 (SEQ ID NO:279), Figure 281 (SEQ ID NO:281), Figure 283 (SEQ ID NO:283). Figure 
285 (SEQ ID NO:285), Figure 287 (SEQ ID NO:287). Figure 289 (SEQ ID NO:289), Figure 291 (SEQ ID 
NO:291). Figure 293 (SEQ ID NO:293). Figure 295 (SEQ ID NO:295). Figure 297 (SEQ ID NO:297). Figure 
299 (SEQ ID NC):299), Figure 301 (SEQ ID NO:301). Figure 303 (SEQ ID NO:303). Figure 305 (SEQ ID 
NO:305), Figure 307 (SEQ ID NO:307). Figure 309 (SEQ ID NO:309). Figure 311 (SEQ ID N0:311). Figure 
313 (SEQ ID NO:313), Figure 315 (SEQ ID NO:315). Figure 317 (SEQ ID NO:317), Figure 319 (SEQ ID 
NO:319), Figure 321 (SEQ ID NO:321). Figure 323 (SEQ ID NO:323), Figure 325 (SEQ ID NO:325). Figure 
327 (SEQ ID NO:327), Figure 329 (SEQ ID NO:329), Figure 331 (SEQ ID NO:331), Figure 333 (SEQ ID 
NO:333), Figure 335 (SEQ ID NO:335). Figure 337 (SEQ ID NO:337), Figure 339 (SEQ ID NO:339), Figure 
341 (SEQ ID NO:341), Figure 343 (SEQ ID NO:343). Figure 345 (SEQ ID NO:345). Figure 347 (SEQ ID 
NO:347). Figure 349 (SEQ ID NO:349). Figure 351 (SEQ ID NO:351). Figure 353 (SEQ ID NO:353), Figure 
355 (SEQ ID NO:355), Figure 357 (SEQ ID NO:357). Figure 359 (SEQ ID NO:359). Figure 361 (SEQ ID 
NO:361), Figure 363 (SEQ ID NO:363), Figure 365 (SEQ ID Np:365). Figure 367 (SEQ ID NO:367). Figure 
369 (SEQ ID NO:369), Figure 371 (SEQ ID NO:371). Figure 373 (SEQ ID NO:373), Figure 375 (SEQ ID 
NO:375). Figure 377 (SEQ ID NO:377), Figure 379 (SEQ ID NO:379), Figure 381 (SEQ ID NO:38I), Figure 
383 (SEQ ID NO:383), Figure 385 (SEQ ID NO:385), Figure 387 (SEQ ID NO:387). Figure 389 (SEQ ID 
NO:389), Figure 391 (SEQ ID NO:39l). Figure 393 (SEQ ID NO:393), Figure 395 (SEQ ID NO:395), Figure 
397 (SEQ ID NO:397), Figure 399 (SEQ ID NO:399). Figure 401 (SEQ ID NO:40l), Figure 403 (SEQ ID 
NO:403), Figure 405 (SEQ ID NO:405), Figure 407 (SEQ ID NO:407). Figure 409 (SEQ ID NO:409), Figure 
411 (SEQ ID N0:411), Figure 413 (SEQ ID NO:413), Figure 415 (SEQ ID NO:415), Figure 417 (SEQ ID 
NO:417). Figure 419 (SEQ ID NO:419), Figure 421 (SEQ ID NO:421). Figure 423 (SEQ ID NO:423), Figure 
425 (SEQ ID NO:425), Figure 427 (SEQ ID NO:427), Figure 429 (SEQ ID NO:429), Figure 431 (SEQ ID 
NO:431), Figure 433 (SEQ ID NO:433), Figure 435 (SEQ ID NO:435). Figure 437 (SEQ ID NO:437), Figure 
439 (SEQ ID NO:439), Figure 441 (SEQ ID NO:441), Figure 443 (SEQ ID NO:443), Figure 445 (SEQ ID 
NO:445), Figure 447 (SEQ ID NO:447), Figure 449 (SEQ ID NO:449), Figure 451 (SEQ ID NO:451), Figure 
453 (SEQ ID NO:453). Figure 455 (SEQ ID NO:455), Figure 457 (SEQ ID NO:457). Figure 459 (SEQ ID 
NO:459). Figure 461 (SEQ ID NO:461), Figure 463 (SEQ ID NO:463). Figure 465 (SEQ ID NO:465), Figure 
467 (SEQ ID NO:467), Figure 469 (SEQ ID NO:469), Figure 471 (SEQ ID NO:471), Figure 473 (SEQ ID 
NO:473). Figure 475 (SEQ ID NO:475), Figure 477 (SEQ ID NO:477), Figure 479 (SEQ ID NO:479), Figure 
481 (SEQ ID NO:481), Figure 483 (SEQ ID NO:483), Figure 485 (SEQ ID NO:485), Figure 487 (SEQ ID 
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NO:487). Figure 489 (SEQ ID NO:489). Figure 491 (SEQ ID NO:491). Figure 493 (SEQ ID NO:493), Figure 
495 (SEQ ID NO:495). Figure 497 (SEQ ID NO:497). Figure 499 (SEQ ID NO:499). Figure 501 (SEQ ID 
N0:501). Figure 503 (SEQ ID NO:503). Figure 505 (SEQ ID NO:505). Figure 507 (SEQ ID NO:5C7), Figure 
509 (SEQ ID NO:509), Figure 511 (SEQ ID NO:5U). Figure 513 (SEQ ID NO:513). Figure 515 (SEQ ID 
NO:515), Figure 517 (SEQ ID NO:517). Figure 519 (SEQ ID NO:519). Figure 521 (SEQ ID NO:521). Figure 
523 (SEQ ID NO:523), Figure 525 (SEQ ID NO:525). Figure 527 (SEQ ID NO:527). Figure 529 (SEQ ID 
NO:529). Figure 531 (SEQ ID NO:531), Figure 533 (SEQ ID NO:533). Figure 535 (SEQ ID NO:535). Figure 
537 (SEQ ID NO:537), Figure 539 (SEQ ID NO:539). Figure 541 (SEQ ID NO:541). Figure 543 (SEQ ID 
NO:543). Figure 545 (SEQ ID NO:545), Figure 547 (SEQ ID NO:547) and Figure 549 (SEQ ID NO:549). 

3. Isolated nucleic acid having at least 80% nucleic acid sequence idwitily to a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of Ae nucleotide sequence shown in Figure 
1 (SEQ ID NO:l). Figure 3 (SEQ ID N0:3). Figure 5 (SEQ ID N0:5). Figure 7 (SEQ ID N0:7). Figure 9 
(SEQ ID NO:9). Figure 11 (SEQ ID N0:11). Figure 13 (SEQ ID NO:13). Figure 15 (SEQ ID N0:15). Figure 
17 (SEQ ID N0:17), Figure 19 (SEQ ID NO:19). Figure 21 (SEQ ID N0:21). Figure 23 (SEQ ID NO:23), 
Figure 25 (SEQ ID NO:25). Figure 27 (SEQ ID NO:27). Figure 29 (SEQ ID NO:29). Figure 31 (SEQ ID 
N0:31). Figure 33 (SEQ ID NO:33). Figure 35 (SEQ ID NO:35). Figure 37 (SEQ ID NO:37). Figure 39 (SEQ 
ID NO:39). Figure 41 (SEQ ID NO:41), Figure 43 (SEQ ID NO:43). Figure 45 (SEQ ID NO:45). Figure 47 
(SEQ ID NO:47). Figure 49 (SEQ ID NO:49). Figure 51 (SEQ ID NO:51). Figure 53 (SEQ ID NO:53). Figure 
55 (SEQ ID NO:55), Figure 57 (SEQ ID NO:57). Figure 59 (SEQ ID NO:59), Figure 61 (SEQ ID N0:61). 
Figure 63 (SEQ ID NO:63). Figure 65 (SEQ ID NO:65). Figure 67 (SEQ ID NO:67). Figure 69 (SEQ ID 
NO:69). Figure 71 (SEQ ID NO:71). Figure 73 (SEQ ID NO:73). Figure 75 (SEQ ID NO:75), Figure 77 (SEQ 
ID NO:77). Figure 79 (SEQ ID NO:79). Figure 81 (SEQ ID N0:81). Figure 83 (SEQ ID NO:83). Figure 85 
(SEQ ID NO:85). Figure 87 (SEQ ID NO:87). Figure 89 (SEQ ID NO:89). Figure 91 (SEQ ID N0:91), Figure 
93 (SEQ ID NO:93). Figure 95 (SEQ ID NO:95), Figure 97 (SEQ ID NO:97). Figure 99 (SEQ ID NO:99), 
Figure 101 (SEQ ID NO.lOl). Figure 103 (SEQ ID NO:103), Figure 105 (SEQ ID NO:105), Figure 107 (SEQ 
ID NO:107). Figure 109 (SEQ ID NO:109), Figure 111 (SEQ ID NO:lll). Figure 113 (SEQ ID NO:113). 
Figure 115 (SEQ ID NO: 1 15). Figure 117 (SEQ ID NO: 1 17). Figure 1 19 (SEQ ID NOrl 19). Figure 121 (SEQ 
ID NO:121). Figure 123 (SEQ ID NO:123). Figure 125 (SEQ ID NO:125). Figure 127 (SEQ ID NO:127). 
Figure 129 (SEQ ID NO:129). Figure 131 (SEQ ID NO:131). Figure 133 (SEQ ID NO:133). Figure 135 (SEQ 
ID NO:135). Figure 137 (SEQ ID NO:137). Figure 139 (SEQ ID NO:1390). Figure 141 (SEQ ID NO:141). 
Figure 143 (SEQ ID NO:143). Figure 145 (SEQ ID NO: 145). Figure 147 (SEQ ID NO: 147). Figure 149 (SEQ 
ID NO:149). Figure 151 (SEQ ID NO:151). Figure 153 (SEQ ID NO:153). Figure 155 (SEQ ID NO:155), 
Figure 157 (SEQ ID NO:157). Figure 159 (SEQ ID NO: 159). Figure 161 (SEQ ID NO:161). Figure 163 (SEQ 
ID NO:163). Figure 165 (SEQ ID NO:165). Figure 167 (SEQ ID NO:167). Figure 169 (SEQ ID NO:169). 
Figure 171 (SEQ ID NO: 171). Figure 173 (SEQ ID NO:173). Figure 175 (SEQ ID NO: 175). Figure 177 (SEQ 
ID NO:177). Figure 179 (SEQ ID NO:179). Figure 181 (SEQ ID NO:181). Figure 183 (SEQ ID NO:183). 
Figure 185 (SEQ ID NO: 185). Figure 187 (SEQ ID NO:187). Figure 189 (SEQ ID NO:189). Figure 191 (SEQ 
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ID NO: 191), Figure 193 (SEQ ID NO: 193), Figure 195 (SEQ ID NO: 195). Figure 197 (SEQ ID NO: 197), 
Figure 199 (SEQ ID NO: 199), Figure 201 (SEQ ID NO:201), Figure 203 (SEQ ID NO:203). Figure 205 (SEQ 
ID NO:205), Figure 207 (SEQ ID NO:207). Figure 209 (SEQ ID NO:209). Figure 211 (SEQ ID N0:211), 
Figure 213 (SEQ ID N0:213), Figure 215 (SEQ ID NO:215), Figure 217 (SEQ ID NO:217), Figure 219 (SEQ 
ID NO:219), Figure 221 (SEQ ID NO:221). Figure 223 (SEQ ID NO:223). Figure 225 (SEQ ID NO:225), 
Figure 227 (SEQ ID NO:227). Figure 229 (SEQ ID NO:229), Figure 23 1 (SEQ ID NO:23 1). Figure 233 (SEQ 
ID NO:233), Figure 235 (SEQ ID NO:235), Figure 237 (SEQ ID NO:237), Figure 239 (SEQ ID NO:239), 
Figure 241 (SEQ ID NO:241), Figure 243 (SEQ ID NO:243), Figure 245 (SEQ ID NO:245). Figure 247 (SEQ 
ID N6:247), Figure 249 (SEQ ID NO:249). Figure 251 (SEQ ID NO:251). Figure 253 (SEQ ID NO:253), 
Figure 255 (SEQ ID NO:255), Figure 257 (SEQ ID NO:257), Figure 259 (SEQ ID NO:259), Figure 261 (SEQ 
ID NO:261), Figure 263 (SEQ ID NO:263). Figure 265 (SEQ ID NO:265). Figure 267 (SEQ ID NO:267). 
Figure 269 (SEQ ID NO:269), Figure 271 (SEQ ID NO:271), Figure 273 (SEQ ID NO:273), Figure 275 (SEQ 
ID NO:275). Figure 277 (SEQ ID NO:277), Figure 279 (SEQ ID NO:279), Figure 281 (SEQ ID NO:281), 
Figure 283 (SEQ ID NO:283). Figure 285 (SEQ ID NO:285). Figure 287 (SEQ ID NO:287), Figure 289 (SEQ 
ID NO:289), Figure 291 (SEQ ID NO:291), Figure 293 (SEQ ID NO:293). Figure 295 (SEQ ID NO:295), 
Figure 297 (SEQ ID NO:297), Figure 299 (SEQ ID NO:299). Figure 301 (SEQ ID NO:301), Figure 303 (SEQ 
ID NO:303), Figure 305 (SEQ ID NO:305). Figure 307 (SEQ ID NO:307). Figure 309 (SEQ ID NO:309), 
Figure3H (SEQIDN0:311), Figure 313 (SEQ ID NO:313). Figure 315 (SEQ ID N0:3 15). Figure317 (SEQ 
ID NO:317), Figure 319 (SEQ ID NO:319), Figure 321 (SEQ ID NO:321), Figure 323 (SEQ ID NO:323). 
Figure 325 (SEQ ID NO:325), Figure 327 (SEQ ID NO:327). Figure 329 (SEQ ID NO:329), Figure 331 (SEQ 
ID NO:331), Figure 333 (SEQ ID NO:333), Figure 335 (SEQ ID NO;335). Figure 337 (SEQ ID NO:337), 
Figure 339 (SEQ ID NO:339), Figure 341 (SEQ ID NO:341), Figure 343 (SEQ ID NO:343), Figure 345 (SEQ 
ID NO:345), Figure 347 (SEQ ID NO:347), Figure 349 (SEQ ID NO:349). Figure 351 (SEQ ID NO:351), 
Figure 353 (SEQ ID NO:353), Figure 355 (SEQ ID NO:355), Figure 357 (SEQ ID NO:357), Figure 359 (SEQ 
ID NO:359). Figure 361 (SEQ ID NO:361). Figure 363 (SEQ ID NO:363), Figure 365 (SEQ ID NO:365), 
Figure 367 (SEQ ID NO:367), Figure 369 (SEQ ID NO:369). Figure 371 (SEQ ID NO:371), Figure 373 (SEQ 
ID NO:373). Figure 375 (SEQ ID NO:375), Figure 377 (SEQ ID NO:377), Figure 379 (SEQ ID NO:379), 
Figure 381 (SEQ ID NO:381), Figure 383 (SEQ ID NO:383). Figure 385 (SEQ ID NO:385). Figure 387 (SEQ 
ID NO:387), Figure 389 (SEQ ID NO:389), Figure 391 (SEQ ID NO:391), Figure 393 (SEQ ID NO:393). 
Figure 395 (SEQ ID NO:395), Figure 397 (SEQ ID NO:397), Figure 399 (SEQ ID NO:399). Figure 401 (SEQ 
ID NO:401). Figure 403 (SEQ ID NO:403), Figure 405 (SEQ ID NO:405), Figure 407 (SEQ ID NO:407), 
Figure 409 (SEQ ID NO:409). Figure 41 1 (SEQ ID N0:411), Figure 413 (SEQ ID NO:413), Figure 415 (SEQ 
ID NO:415), Figure 417 (SECj ID NO:417), Figure 419 (SEQ ID NO:419), Figure 421 (SEQ ID NO:421), 
Figure 423 (SEQ ID NO:423), Figure 425 (SEQ ID NO:425), Figure 427 (SEQ ID NO:427), Figure 429 (SEQ 
ID NO:429). Figure 431 (SEQ ID NO:431), Figure 433 (SEQ ID NO:433), Figure 435 (SEQ ID NO:435), 
Figure 437 (SEQ ID NO:437), Figure 439 (SEQ ID NO:439), Figure 441 (SEQ ID NO:44l), Figure 443 (SEQ 
ID NO:443), Figure 445 (SEQ ID NO:445), Figure 447 (SEQ ID NO:447), Figure 449 (SEQ ID NO:449), 
Figure 451 (SEQ ID NO:451), Figure 453 (SEQ ID NO:453), Figure 455 (SEQ ID NO:455). Figure 457 (SEQ 
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ID NO:457). Figure 459 (SEQ ID NO:459). Figure 461 (SEQ ID NO:461). Figure 463 (SEQ ID NO:463). 
Figure 465 (SEQ ID NO:465). Figure 467 (SEQ ID NO:467). Figure 469 (SEQ ID NO:469). Figure 47 1 (SEQ 
ID NO:471). Figure 473 (SEQ ID NO:473), Figure 475 (SEQ ID NO:475). Figure 477 (SEQ ID NO:477). 
Figure 479 (SEQ ID NO:479). Figure 481 (SEQ ID NO:481). Figure 483 (SEQ ID NO:483). Figure 485 (SEQ 
ID NO:485). Figure 487 (SEQ ID NO:487). Figure 489 (SEQ ID NO:489). Figure 491 (SEQ ID NO:491), 
Figure 493 (SEQ ID NO:493). Figure 495 (SEQ ID NO:495). Figure 497 (SEQ ID NO:497). Figure 499 (SEQ 
ID NO:499). Figure 501 (SEQ ID NO:501). Figure 503 (SEQ ID NO:503). Figure 505 (SEQ ID NO:505). 
Figure 507 (SEQ ID NO:507). Figure 509 (SEQ ID NO:509). Figure 511 (SEQ ID NO:51 1). Figure 513 (SEQ 
ID NO:513). Figure 515 (SEQ ID NO:515). Figure 517 (SEQ ID NO:517). Figure 519 (SEQ ID NO:519). 
Figure 521 (SEQ ID NO:521). Figure 523 (SEQ ID NO:523). Figure 525 (SEQ ID NO:525). Figure 527 (SEQ 
ID NO:527). Figure 529 (SEQ ID NO:529). Figure 531 (SEQ ID NO:531). Figure 533 (SEQ ID NO:533). 
Figure 535 (SEQ ID NO:535). Figure 537 (SEQ ID NO:537). Figure 539 (SEQ ID NO:539). Figure 541 (SEQ 
ID N0:S41). Figure 543 (SEQ ID NO:543). Figure 545 (SEQ ID NO:545). Figure 547 (SEQ ID NO:547) and 
Figure 549 (SEQ ID NO:549). 

4 . Isolated nucleic acid having at least 80% nucleic acid sequence identity to the full-length coding 
sequence of the DNA deposited under any ATCC accession number shown in Table 7. 

5 . A vector comprising the nucleic acid of Claim 1 . 

6. Tl^ vector of Claim 5 operably linked to conttol sequences recognized by a host ceU 
transformed with the vector. 

7. A host cell comprising the vector of Claims. 

8. The host cell of Claim 7, wherein said cell is a CHO cell. 

9. The host cell of Claim 7, wherein said cell is an £. coU. 

10. The host cell of Claim 7, wherein said cell is a yeast cell. 

1 1. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

12. An isokited polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence selected from the group consisting of die amino acid sequence shown in Figure 2 (SEQ ID NO:2), 
Figure 4 (SEQ ID NO:4). Figure 6 (SEQ ID N0:6). Figure 8 (SEQ ID NO:8). Figure 10 (SEQ ID NO: 10), 
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Figure 12 (SEQ ID NO: 12). Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID 
NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22). Figure 24 (SEQ ID NO:24), Figure 26 (SEQ 
ID NO:26), Figure 28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30). Figure 32 (SEQ ID NO:32), Figure 34 
(SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38). Figure 40 (SEQ ID NO:40), Figure 
42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 <SEQ ID NO:48). 
5 Figure 50 (SEQ ID NO:50). Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54). Figure 56 (SEQ ID 
NO:56). Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62), Figure 64 (SEQ 
ID NO:64), Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68). Figure 70 (SEQ ID NO:70), Figure 72 
(SEQ ID NO:72). Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76). Figure 78 (SEQ ID NO:78), Figure 
80 (SEQ ID NO:80). Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84). Figure 86 (SEQ ID NO:86), 
10 Figure 88 (SEQ ID NO:88). Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92). Figure 94 (SEQ ID 
NO:94), Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NO: 100), Figure 102 
(SEQ ID NO: 102), Figure 104 (SEQ ID NO: 104). Figure 106 (SEQ ID NO: 106). Figure 108 (SEQ ID 
NO: 108). Figure 1 10 (SEQ ID NO: 1 10). Figure 1 12 (SEQ ID NO: 1 12). Figure 1 14 (SEQ ID NO: 1 14), Figure 
116 (SEQ ID N0:116), Figure 118 (SEQ ID NO:li8). Figure 120 (SEQ ID NO:120), Figure 122 (SEQ ID 

15 NO: 122). Figure 124 (SEQ ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128), Figure 
130 (SEQ ID NO: 130). Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID 
NO: 136), Figure 138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO: 142). Figure 
144 (SEQ ID NO:144), Figure 146 (SEQ ID NO:146), Figure 148 (SEQ ID NO:148). Figure 150 (SEQ ID 
NO: 150), Figure 152 (SEQ ID NO: 152). Figure 154 (SEQ ID NO: 154), Figure 156 (SEQ ID NO: 156). Figure 

20 158 (SEQ ID NO:158), Figure 160 (SEQ ID NO:160). Figure 162 (SEQ ID NO:162). Figure 164 (SEQ ID 
NO: 164), Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO: 168), Figure 170 (SEQ ID NO: 170), Figure 
172 (SEQ ID NO:172). Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID 
NO: 178), Figure 180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 182), Figure 184 (SEQ ID NO: 184). Figure 
186 (SEQ ID NO: 186), Figure 188 (SEQ ID NO: 188). Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID 

25 NO: 192), Figure 194 (SEQ ID NO: 194). Figure 196 (SEQ ID NO: 196), Figure 198 (SEQ ID NO: 198), Figure 
200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202), Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206), Figure 208 (SEQ ID NO:208). Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212). Figure 
214 (SEQ ID N0:214), Figure 216 (SEQ ID N0:216). Figure 218 (SEQ ID NO:218). Figure 220 (SEQ ID 
NO:220). Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID N0:224), Figure 226 (SEQ ID NO:226), Figure 

30 228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232). Figure 234 (SEQ ID 
NO:234), Figure 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240), Figure 
242 (SEQ ID NO:242), Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID 
NO:248). Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 
256 (SEQ ID NO:256), Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID 

35 NO:262). Figure 264 (SEQ ID NO:264), Figure 266 (SEQ ID NO:266), Figure 268 (SEQ ID NO:268), Figure 
270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274). Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278), Figure 280 (SEQ ID NO:280). Figure 282 (SEQ ID NO:282), Figure 
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284 (SEQ ID NO:284). Figure 286 (SEQ ID NO:286). Figure 288 (SEQ ID NO:288). Figure 290 (SEQ ID 
NO:290). Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298). Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
NO:304). Figure 306 (SEQ ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310). Figure 
312 (SEQ ID NO:312). Figure 314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:3l6). Figure 318 (SEQ ID 
NO:318). Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322). Figure 324 (SEQ ID NO:324). Figure 
326 (SEQ ID NO:326). Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330). Figure 332 (SEQ ID 
NO:332). Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338). Figure 
340 (SEQ ID NO:340). Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344). Figure 346 (SEQ ID 
NO:346), Figure 348 (SEQ ID NO:348). Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352). Figure 
354 (SEQ ID NO:354). Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358). Figure 360 (SEQ ID 
NO:360). Figure 362 (SEQ ID NO:362). Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366). Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370). Figure 372 (SEQ ID NO:372). Figure 374 (SEQ ID 
NO:374). Figure 376 (SEQ ID NO:376). Figure 378 (SEQ ID NO:378). Figure 380 (SEQ ID NO:380). Figure 
382 (SEQ ID NO:382). Figure 384 (SEQ ID NO:384). Figure 386 (SEQ ID NO:386). Figure 388 (SEQ ID 
NO:388). Figure 390 (SEQ ID NO:390). Figure 392 (SEQ ID NO:392). Figure 394 (SEQ ID NO:394). Figure 
396 (SEQ ID NO:396), Figure 398 (SEQ ID NO:398). Figure 400 (SEQ ID NO:400). Figure 402 (SEQ ID 
NO:402). Figure 404 (SEQ ID NO:404). Figure 406 (SEQ ID NO:406). Figure 408 (SEQ ID NO:408). Figure 
410 (SEQ ID NO:410). Figure 412 (SEQ ID NO:412), Figure 414 (SEQ ID NO:414). Figure 416 (SEQ ID 
NO:416). Figure 418 (SEQ ID NO:418). Figure 420 (SEQ ID NO:420). Figure 422 (SEQ ID NO:422). Figure 
424 (SEQ ID NO:424). Figure 426 (SEQ ID NO:426), Figure 428 (SEQ ID NO:428), Figure 430 (SEQ ID 
NO:430). Figure 432 (SEQ ID NO:432). Figure 434 (SEQ ID NO:434). Figure 436 (SEQ ID NO:436), Figure 
438 (SEQ ID NO:438), Figure 440 (SEQ ID NO:440), Figure 442 (SEQ ID NO:442). Figure 444 (SEQ ID 
NO:444). Figure 446 (SEQ ID NO:446). Figure 448 (SEQ ID NO:448). Figure 450 (SEQ ID NO:450). Figure 
452 (SEQ ID NO:452). Figure 454 (SEQ ID NO:454). Figure 456 (SEQ ID NO:456). Figure 458 (SEQ ID 
NO:458). Figure 460 (SEQ ID NO:460), Figure 462 (SEQ ID NO:462). Figure 464 (SEQ ID NO:464). Figure 
466 (SEQ ID NO:466). Figure 468 (SEQ ID NO:468). Figure 470 (SEQ ID NO:470). Figure 472 (SEQ ID 
NO:472). Figure 474 (SEQ ID NO:474). Figure 476 (SEQ ID NO:476). Figure 478 (SEQ ID NO:478). Figure 
480 (SEQ ID NO:480). Figure 482 (SEQ ID NO:482). Figure 484 (SEQ ID NO:484). Figure 486 (SEQ ID 
NO:486). Figure 488 (SEQ ID NO:488). Figure 490 (SEQ ID NO:490), Figure 492 (SEQ ID NO:492). Figure 
494 (SEQ ID NO:494). Figure 496 (SEQ ID NO:496), Figure 498 (SEQ ID NO:498). Figure 500 (SEQ ID 
NO:500). Figure 502 (SEQ ID NO:502); Figure 504 (SEQ ID NO:504). Figure 506 (SEQ ID NO:506). Figure 
508 (SEQ ID NO:508). Figure 510 (SEQ ID NO:5I0). Figure 512 (SEQ ID NO:512). Figure 514 (SEQ ID 
NO:514). Figure 516 (SEQ ID NO:516). Figure 518 (SEQ ID NO:518). Figure 520 (SEQ ID NO:520). Figure 
522 (SEQ ID NO:522). Figure 524 (SEQ ID NO:524). Figure 526 (SEQ ID NO:526). Figure 528 (SEQ ID 
NO:528). Figure 530 (SEQ ID NO:530). Figure 532 (SEQ ID NO:532). Figure 534 (SEQ ID NO:534). Figure 
536 (SEQ ID NO:536). Figure 538 (SEQ ID NO:538). Figure 540 (SEQ ID NO:540), Figure 542 (SEQ ID 
NO:542). Figure 544 (SEQ ID NO:544). Figure 546 (SEQ ID NO:546). Figure 548 (SEQ ID NO:548) and 
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Figure 550 (SEQ ID NO:550), 

13. An isolated polypeptide having at least 80% amino acid sequence identity to an amino acid 
sequence encoded by the full-length coding sequence of the DNA deposited under any ATCC accession number 
shown in Table 7. 



14. A chimeric molecule comjprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 

15. The chimeric molecule of Claim 14, wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14, wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

17. An antibody which specifically binds to a polypeptide according to Claim 12. 

18. The antibody of Claim 17, wherein said antibody is a monoclonal antibody, a humanized 
antibody or a single-chain antibody. 

19. Isolated nucleic acid having at least 80% nucleic acid sequ^ice identity to: 

(a) a nucleotide sequence encoding the polypeptide shown in Figure 2 (SEQ ID NO:2), Figure 4 
(SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8). Figure 10 (SEQ ID NO; 10). Figure 12 
(SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 
20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), 
Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID 
NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42 (SEQ 
ID NO:42), Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46). Figure 48 (SEQ ID NO:48). Figure 50 
(SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), Figure 
58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62), Figure 64 (SEQ ID NO:64), 
Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID N0:70). Figure 72 (SEQ ID 
NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID NO:78). Figure 80 (SEQ 
IDNO:80), Figure 82 (SEQ ID NO:82); Figure 84 (SEQ ID NO:84). Figure 86 (SEQ ID NO:86), Figure 88 
(SEQ ID NO:88). Figure 90 (SEQ ID NO:90). Figure 92 (SEQ ID NO:92), Figure 94 (SEQ ID NO:94). Figure 
96 (SEQ ID NO:96). Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO: 100). Figure 102 (SEQ ID NO: 102), 
Figure 104 (SEQ ID NO: 104), Figure 106 (SEQ ID NO: 106), Figure 108 (SEQ ID NO: 108). Figure 1 10 (SEQ 
ID NO: 110). Figure 112 (SEQ ID NO: 112). Figure 114 (SEQ ID NO: 114), Figure 116 (SEQ ID NO: 116). 
Figure 1 18 (SEQ ID N0:118). Figure 120 (SEQ ID NO:120), Figure 122 (SEQ ID NO:122). Figure 124 (SEQ 
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ID NO: 124), Figure 126 (SEQ ID NO: 126), Figure 128 (SEQ ID NO: 128). Figure 130 (SEQ ID NO: 130). 
Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ ID NO:136), Figure 138 (SEQ 
ID NO: 138), Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID NO: 142). Figure 144 (SEQ ID NO: 144), 
Figure 146 (SEQ ID NO: 146), Figure 148 (SEQ ID NO:148). Figure 150 (SEQ ID NO:150). Figure 152 (SEQ 
ID NO:152). Figure 154 (SEQ ID NO:154), Figure 156 (SEQ ID NO:156), Figure 158 (SEQ ID NO:158). 
5 Figure 160 (SEQ ID NO: 160). Figure 162 (SEQ ID NO:162), Figure 164 (SEQ ID NO: 164). Figure 166 (SEQ 
ID NO: 166), Figure 168 (SEQ ID NO: 168). Figure 170 (SEQ ID NO: 170), Figure 172 (SEQ ID NO: 172), 
Figure 174 (SEQ ID NO: 174), Figure 176 (SEQ ID NO: 176), Figure 178 (SEQ ID NO: 178), Figure 180 (SEQ 
ID NO: 180), Figure 182 (SEQ ID NO: 182). Figure 184 (SEQ ID NO: 184). Figure 186 (SEQ ID NO: 186). 
Figure 188 (SEQ ID NO: 188), Figure 190 (SEQ ID NO: 190), Figure 192 (SEQ ID NO: 192). Figure 194 (SEQ 

10 ID NO: 194), Figure 196 (SEQ ID NO: 196). Figure 198 (SEQ ID NO: 198). Figure 200 (SEQ ID NO:200). 
Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 206 (SEQ ID NO:206). Figure 208 (SEQ 
ID NO:208), Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID NO:214), 
Figure 216 (SEQ ID NO:216), Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID NO:220). Figure 222 (SEQ 
ID NO:222). Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID NO:228). 

15 Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232), Figure 234 (SEQ ID NO:234). Figure 236 (SEQ 
ID NO:236). Figure 238 (SEQ ID NO:238). Figure 240 (SEQ ID NO:240), Figure 242 (SEQ ID NO:242), 
Figure 244 (SEQ ID NO:244), Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID NO:248), Figure 250 (SEQ 
ID NO:250). Figure 252 (SEQ ID NO:252). Figure 254 (SEQ ID NO:254). Figure 256 (SEQ ID NO:256), 
Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260). Figure 262 (SEQ ID NO:262). Figure 264 (SEQ 

20 ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268). Figure 270 (SEQ ID NO:270). 
Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID NO:276). Figure 278 (SEQ 
ID NO:278), Figure 280 (SEQ ID NO:280). Figure 282 (SEQ ID NO:282). Figure 284 (SEQ ID NO:284). 
Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288). Figure 290 (SEQ ID NO:290). Figure 292 (SEQ 
ID NO:292), Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296). Figure 298 (SEQ ID NO:298), 

25 Figure 300 (SEQ ID NO:300), Figure 302 (SEQ ID NO:302). Figure 304 (SEQ ID NO:304). Figure 306 (SEQ 
ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310), Figure 312 (SEQ ID NO:312). 
Figure 314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID NO:318). Figure 320 (SEQ 
ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324). Figure 326 (SEQ ID NO:326). 
Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330). Figure 332 (SEQ ID NO:332), Figure 334 (SEQ 

30 ID NO:334). Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338). Figure 340 (SEQ ID NO:340), 
Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344). Figure 346 (SEQ ID NO:346), Figure 348 (SEQ 
ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 354 (SEQ ID NO:354). 
Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID NO:360). Figure 362 (SEQ 
ID NO:362), Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID NO:368). 

35 Figure 370 (SEQ ID NO:370). Figure 372 (SEQ ID NO:372). Figure 374 (SEQ ID NO:374), Figure 376 (SEQ 
ID NO:376). Figure 378 (SEQ ID NO:378). Figure 380 (SEQ ID NO:380), Figure 382 (SEQ ID NO:382), 
Figure 384 (SEQ ID NO:384), Figure 386 (SEQ ID NO:386). Figure 388 (SEQ ID NO:388). Figure 390 (SEQ 
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ID NO:390). Figure 392 (SEQ ID NO:392), Figure 394 (SEQ ID NO:394). Figure 396 (SEQ ID NO:396), 
Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400). Figure 402 (SEQ ID NO:402). Figure 404 (SEQ 
ID NO:404). Figure 406 (SEQ ID NO:406). Figure 408 (SEQ ID NO:408), Figure 410 (SEQ ID NO:410), 
Figure 412 (SEQ ID NO:412). Figure 414 (SEQ ID NO:414), Figure 416 (SEQ ID NO:416), Figure 418 (SEQ 
ID NO:418), Figure 420 (SEQ ID NO:420). Figure 422 (SEQ ID NO:422). Figure 424 (SEQ ID NO:424), 
5 Figure 426 (SEQ ID NO:426). Figure 428 (SEQ ID NO:428), Figure 430 (SEQ ID NO:430). Figure 432 (SEQ 
ID NO:432), Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID NO:436), Figure 438 (SEQ ID NO:438), 
Figure 440 (SEQ ID NO:440). Figure 442 (SEQ ID NO:442). Figure 444 (SEQ ID NO:444), Figure 446 (SEQ 
ID NO:446). Figure 448 (SEQ ID NO:448), Figure 450 (SEQ ID NO:450). Figure 452 (SEQ ID NO:452), 
Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456), Figure 458 (SEQ [D NO:458), Figure 460 (SEQ 
10 ID NO:460), Figure 462 (SEQ ID NO:462), Figure 464 (SEQ ID NO:464), Figure 466 (SEQ ID NO:466), 
Figure 468 (SEQ ID n6:468). Figure 470 (SEQ ID NO:470), Figure 472 (SEQ ID NO:472), Figure 474 (SEQ 
ID NO:474), Figure 476 (SEQ ID NO:476), Figure 478 (SEQ ID NO:478). Figure 480 (SEQ ID NO:480), 
Figure 482 (SEQ ID NO:482), Figure 484 (SEQ ID NO:484). Figure 486 (SEQ ID NO:486). Figure 488 (SEQ 
ID NO:488). Figure 490 (SEQ ID NO:490), Figure 492 (SEQ ID NO:492), Figure 494 (SEQ ID NO:494). 
15 Figure 496 (SEQ ID NO:496). Figure 498 (SEQ ID NO:498), Figure 500 (SEQ ID NO:500), Figure 502 (SEQ 
ID NO:502). Figure 504 (SEQ ID NO:504). Figure 506 (SEQ ID NO:506), Figure 508 (SEQ ID NO:508), 
Figure 510 (SEQ ID NO:510). Figure 512 (SEQ ID NO:512). Figure 514 (SEQ ID NO:514), Figure 516 (SEQ 
ID NO:516), Figure 518 (SEQ ID NO:518), Figure 520 (SEQ ID NO:520). Figure 522 (SEQ ID NO:522). 
Figure 524 (SEQ ID NO:524), Figure 526 (SEQ ID NO:526). Figure 528 (SEQ ID NO:528), Figure 530 (SEQ 

20 ID NO:530). Figure 532 (SEQ ID NO:532), Figure 534 (SEQ ID NO:534), Figure 536 (SEQ ID NO:536). 
Figure 538 (SEQ ID NO:538), Figure 540 (SEQ ID NO:540), Figure 542 (SEQ ID NO:542), Figure 544 (SEQ 
ID NO:544), Figure 546 (SEQ ID NO:546), Figure 548 (SEQ ID NO:548) or Figure 550 (SEQ ID NO:550). 
lacking its associated signal peptide; 

(b) a nucleotide sequence encoding an extracellular domain of the polypeptide shown in Figure 2 

25 (SEQ ID N0:2). Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8). Figure 10 (SEQ 
ID NO: 10), Figure 12 (SEQ ID NO: 12). Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 
(SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 22 (SEQ ID NO:22). Figure 24 (SEQ ID NO:24). Figure 
26 (SEQ ID 'N0:26), Figure 28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32). 
Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID 

30 NO:40),;Figure 42 (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ 
ID NO:48), Figure 50 (SEQ ID NO:50). Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54). Figure 56 
(SEQ ID NO:56). Figure 58 (SEQ ID NO:58). Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62), Figure 
64 (SEQ ID NO:64). Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70), 
Figure 72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74), Figure 76 (SEQ ID NO:76). Figure 78 (SEQ ID 

35 NO:78), Figure 80 (SEQ ID NO:80), Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84), Figure 86 (SEQ 
ID NO:86), Figure 88 (SEQ ID NO:88), Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92), Figure 94 
(SEQ ID NO:94). Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98), Figure 100 (SEQ ID NO:10iD), 
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Figure 102 (SEQ ID NO:102). Figure 104 (SEQ ID NO:104). Figure 106 (SEQ ID NO: 106). Figure 108 (SEQ 
ID NO:I08). Figure 110 (SEQ ID NO:110). Figure 112 (SEQ ID N0:n2). Figure 114 (SEQ ID NO.IU). 
Figure I16(SEQIDNO:116). Figure 118 (SEQ ID NO: 1 18). Figure 120 (SEQ ID NO: 120). Figure 122 (SEQ 
ID NO:122). Figure 124 (SEQ ID NO:124). Figure 126 (SEQ ID NO:126). Figure 128 (SEQ ID NO:128). 
Figure 130 (SEQ ID NO:130). Figure 132 (SEQ ID NO:132). Figure 134 (SEQ ID NO: 134). Figure 136 (SEcj 
ID NO: 136). Figure 138 (SEQ ID NO: 138). Figure 140 (SEQ ID NO: 140), Figure 142 (SEQ ID NO:142). 
Figure 144 (SEQ ID NO:144). Figure 146 (SEQ ID NO:146). Figure 148 (SEQ ID NO:148). Figure 150 (SEQ 
ID NO: 150). Figure 152 (SEQ ID NO: 152). Figure 154 (SEQ ID NO: 154). Figure 156 (SEQ ID NO: 156). 
Figure 158 (SEQ ID NO: 158). Figure 160 (SEQ ID NO:160). Figures 162 (SEQ ID NO: 162). Figure 164 (SEQ 
ID NO:164). Figure 166 (SEQ ID NO:I66). Figure 168 (SEQ ID NO:168). Figure 170 (SEQ ID NO:170). 
Figure 172 (SEQ ID NO: 172). Figure 174 (SEQ ID NO:174). Figure 176 (SEQ ID NO:176). Figure 178 (SEQ 
ID NO:178). Figure 180 (SEQ ID NO:180). Figure 182 (SEQ ID NO:182). Figure 184 (SEQ ID NO:184). 
Rgure 186 (SEQ ID NO:186). Figure 188 (SEQ ID NO:188). Figure 190 (SEQ ID NO:190). Figure 192 (SEQ 
ID NO:192), Figure 194 (SEQ ID NO:194). Figure 196 (SEQ ID NO:196). Figure 198 (SEQ ID NO:198). 
Figure 200 (SEQ ID NO:200). Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 206 (SEQ 
ID NO:206), Figure 208 (SEQ ID NO:208). Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID NO:212). 
Figure 214 (SEQ ID NO:214). Figure 216 (SEQ ID NO:2I6). Figure 218 (SEQ ID NO:218). Figure 220 (SEQ 
ID NO:220). Figure 222 (SEQ ID NO:222). Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226). 
Figure 228 (SEQ ID NO:228). Figure 230 (SEQ ID NO:230). Figure 232 (SEQ ID NO:232). Figure 234 (SEQ 
ID NO:234). Figure 236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238). Figure 240 (SEQ ID NO:240). 
Figure 242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ 
ID NO:248). Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252). Figure 254 (SEQ ID NO:254). 
Figure 256 (SEQ ID NO:256). Figure 258 (SEQ ID NO:258). Figure 260 (SEQ ID NO:260). Figure 262 (SEQ 
ID NO:262). Figure 264 (SEQ ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268). 
Figure 270 (SEQ ID NO:270). Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274). Figure 276 (SEQ 
ID NO:276), Figure 278 (SEQ ID NO:278). Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282). 
Figure 284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286). Figure 288 (SEQ ID NO:288). Figure 290 (SEQ 
ID NO:290), Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296). 
Figure 298 (SEQ ID NO:298). Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302). Figure 304 (SEQ 
ID NO:304). Figure 306 (SEQ ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310). 
Figure 312 (SEQ ID NO:312). Figure 314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:316). Figure 318 (SEQ 
ID NO:318). Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322). Figure 324 (SEQ ID NO:324). 
Figure 326 (SEQ ID NO:326). Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330). Figure 332 (SEQ 
ID NO:332). Figure 334 (SEQ ID NO:334). Hgure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338). 
Figure 340 (SEQ ID NO:340). Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344). Figure 346 (SEQ 
ID NO:346). Figure 348 (SEQ ID NO:348). Figure 350 (SEQ ID NO:350). Figure 352 (SEQ ID NO:352). 

Figure354 (SEQ ID NO:354). Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358). Figure 360 (SE(J 
ID NO:360). Figure 362 (SEQ ID NO:362). Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366). 
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Figure 368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372), Figure 374 (SEQ 
lb Nb:374). Figure 376 (SEQ ID NO:376), Figure 378 (SEQ ID NO:378), Figure 380 (SEQ ID NO:380). 
Figure 382 (SEQ ID NO:382), Figure 384 (SEQ ID NO:384). Figure 386 (SEQ ID NO:386). Figure 388 (SEQ 
ID NO:388). Figure 390 (SEQ ID NO:390), Figure 392 (SEQ ID NO:392). Figure 394 (SEQ ID NO:394), 
Figure 396 (SEQ ID NO:396). Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400), Figure 402 (SEQ 
ID NO:402), Figure 404 (SEQ ID NO:404), Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID NO:408), 
Figure 410 (SEQ ID NO:410), Figure 412 (SEQ ID NO:412). Figure 414 (SEQ ID NO:414). Figure 416 (SEQ 
ID NO:416), Figure 418 (SEQ ID NO:418). Figure 420 (SEQ ID NO:420). Figure 422 (SEQ ID NO:422). 
Figure 424 (SEQ ID NO:424). Figure 426 (SEQ ID NO:426), Figure 428 (SEQ ID NO:428), Figure 430 (SEQ 
ID NO:430), Figure 432 (SEQ ID NO:432). Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID NO:436), 
Figure 438 (SEQ ID NO:438), Figure 440 (SEQ ID NO:440), Figure 442 (SEQ ID NO:442). Figure 444 (SEQ 
ID NO:444), Figure 446 (SEQ ID NO:446). Figure 448 (SEQ ID NO:448). Figure 450 (SEQ ID NO:450), 
Figure 452 (SEQ ID NO:452), Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456), Figure 458 (SEQ 
ID NO:458). Figure 460 (SEQ ID NO:460). Figure 462 (SEQ ID NO:462). Figure 464 (SEQ ID NO:464), 
Figure 466 (SEQ ID NO:466). Figure 468 (SEQ ID NO:468), Figure 470 (SEQ ID NO:470). Figure 472 (SEQ 
ID NO:472), Figure 474 (SEQ ID NO:474). Figure 476 (SEQ ID NO:476). Figure 478 (SEQ ID NO:478), 
Figure 480 (SEQ ID NO:480), Figure 482 (SEQ ID NO;482). Figure 484 (SEQ ID NO:484). Figure 486 (SEQ 
ID NO:486), Figure 488 (SEQ ID n6:488). Figure 490 (SEQ ID NO:490). Figure 492 (SEQ ID NO:492), 
Figure 494 (SEQ ID NO:494), Figure 496 (SEQ ID NO:496). Figure 498 (SEQ ID NO:498), Figure 500 (SEQ 
ID NO:500). Figure 502 (SEQ ID NO:502). Figure 504 (SEQ ID NO:504). Figure 506 (SEQ ID NO:506), 
Figure 508 (SEQ ID NO:508). Figure 510 (SEQ ID NO:510), Figure 512 (SEQ ID NO:512). Figure 514 (SEQ 
ID NO:514). Figure 516 (SEQ ID NO:516), Figure 518 (SEQ ID NO:518), Figure 520 (SEQ ID NO:520), 
Figure 522 (SEQ ID NO:522), Figure 524 (SEQ ID NO:524). Figure 526 (SEQ ID NO:526), Figure 528 (SEQ 
ID NO:528). Figure 530 (SEQ ID NO:530), Figure 532 (SEQ ID NO:532), Figure 534 (SEQ ID NO:534), 
Figure 536 (SEQ ID NO:536). Figure 538 (SEQ ID NO:538). Figure 540 (SEQ ID NO:540), Figure 542 (SEQ 
ID NO:542). Figure 544 (SEQ ID NO:544), Figure 546 (SEQ ID NO:546), Figure 548 (SEQ ID NO:548) or 
Figure 550 (SEQ ID NO:550). with its associated signal peptide; or 

(c) a nucleotide sequence encoding an extracellular domain of the polypeptide shown iii Figure 2 
(SEQ ID N0:2). Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID NO:8). Figure 10 (SEQ 
ID NO: 10), Figure 12 (SEQ ID NO: 12). Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16). Figure 18 
(SEQ ID NO: 18), Figure 20 (SEQ ID NO:20). Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 
26 (SEQ ID NO:26). Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30). Figure 32 (SEQ ID NO:32). 
Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID 
NO:40), Figure 42 (SEQ ID NO:42). Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46), Figur^ 48 (SEQ 
ID NO:48). Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54). Figure 56 
(SEQ ID NO:56), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62). Figure 
64 (SEQ ID NO:64), Figure 66 (SEQ ID NO:66), Figure 68 (SEQ ID NO:68), Figure 70 (SEQ ID NO:70). 
Figure 72 (SEQ ED NO:72), Figure 74 (SEQ ID NO:74). Figure 76 (SEQ ID NO:76), Figure 78 (SEQ ID 
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NO:78>. Figure 80 (SEQ ID NO:80). Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84). Figure 86 (SEQ 
ID NO:86). Figure 88 (SEQ ID NO:88). Figure 90 (SEQ ID NO:90). Figure 92 (SEQ ID NO:92). Figure 94 
(SEQ ID NO:94). Figure 96 (SEQ ID NO:96). Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NO:100), 
Figure 102 (SEQ ID NO:102). Figure 104 (SEQ ID NO:104). Figure 106 (SEQ ID NO:106), Figure 108 (SEQ 
ID NO. IOS). Figure 110 (SEQ ID NO:110). Figure 112 (SEQ ID NO:112), Figure 114 (SEQ ID N0:114). 
Figure 1 16 (SEQ ID NO: 1 16), Figure 1 18 (SEQ ID NO: 1 18), Figure 120 (SEQ ID NO: 120). Figure 122 (SEQ 
ID NO:122). Figure 124 (SEQ ID NO:124), Figure 126 (SEQ ID NO:126). Figure 128 (SEQ ID NO:128). 
Figure 130 (SEQ ID NO: 130). Figure 132 (SEQ ID NO: 132), Figure 134 (SEQ ID NO: 134), Figure 136 (SEQ 
ID NO: 136). Figure 138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID NO: 142). 
Figure 144 (SEQ ID NO: 144). Figure 146 (SEQ ID NO: 146). Figure 148 (SEQ ID NO: 148), Figure 150 (SEQ 
ID NO: 150). Figure 152 (SEQ ID NO: 152). Figure 154 (SEQ ID NO: 154). Figure 156 (SEQ ID NO: 156). 
Figure 158 (SEQ ID NO: 158). Figure 160 (SEQ ID NO:160). Figure 162 (SEQ ID NO: 162). Figure 164 (SEQ 
ID NO: 164). Figure 166 (SEQ ID NO:166). Figure 168 (SEQ ID NO:168). Figure 170 (SEQ ID NO:170). 
Figure 172 (SEQ ID NO:172). Figure 174 (SEQ ID NO:174), Figure 176 (SEQ ID NO:176). Figure 178 (SEQ 
ID NO:178), Figure 180 (SEQ ID NO:180). Figure 182 (SEQ ID NO:182). Figure 184 (SEQ ID NO:i84). 
Figure 186 (SEQ ID NO: 186). Figure 188 (SEQ ID NO:188). Figure 190 (SEQ ID NO: 190). Figure 192 (SEQ 
ID NO: 192). Figure 194 (SEQ ID NO: 194). Figure 196 (SEQ ID NO: 196). Figure 198 (SEQ ID NO: 198). 
Figure 200 (SEQ ID NO:200). Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 206 (SEQ 
ID NO:206). Figure 208 (SEQ ID NO:208). Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212). 
Figure214 (SEQ ID NO:214). Figure 216 (SEQ ID NO:216). Figure 218 (SEQ ID NO:218). Figure 220 (SEQ 
ID NO:220). Figure 222 (SEQ ID NO:222), Figure 224 (SEQ ID NQ:224). Figure 226 (SEQ ID NO:226). 
Figure 228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230). Figure 232 (SEQ ID NO:232). Figure 234 (SEQ 
ID NO:234). Figure 236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238). Figure 240 (SEQ ID NO:240). 
Figure 242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ 
ID NO:248). Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254). 
Figure 256 (SEQ ID NO:256). Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ 
ID NO:262), Figure 264 (SEQ ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268), 
Figure 270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274). Figure 276 (SEQ 
ID NO:276). Figure 278 (SEQ ID NO:278). Figure 280 (SEQ ID NO:280). Figure 282 (SEQ ID NO:282). 
Figure 284 (SEQ ID NO:284). Figure 286 (SEQ ID NO:286). Figure 288 (SEQ ID NO:288). Figure 290 (SEQ 
ID NO:290). Figure 292 (SEQ ID NO:292). Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296), 
Figure 298 (SEQ ID NO:298). Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302). Figure 304 (SEQ 
ID NO:304), Figure 306 (SEQ ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310). 
Figure 312 (SEQ ID NO:312), Figure 314 (SEQ ID NO:314). Figure 316 (SEQ ID NO:3i6). Figure318 (SEQ 
ID NO:318). Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322). Figure 324 (SEQ ID NO:324). 
Figure 326 (SEQ ID NO:326). Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330). Figure 332 (SEQ 
ID NO:332). Figure 334 (SEQ ID NO:334). Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338). 
Figure 340 (SEQ ID NO:340). Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344). Figure 346 (SEQ 
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ID NO:346), Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), 
Figure 354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358), Figure 360 (SEQ 
ID NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366). 
Figure 368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370). Figure 372 (SEQ ID NO:372), Figure 374 (SEQ 
ID NO:374). Figure 376 (SEQ ID NO:376), Figure 378 (SEQ ID NO:378), Figure 380 (SEQ ID NO:380). 
5 Figure 382 (SEQ ID NO:382), Figure 384 (SEQ ID NO:384), Figure 386 (SEQ ID NO:386). Figure 388 (SEQ 
ID NO:388). Figure 390 (SEQ ID NO:390). Figure 392 (SEQ ID NO:392), Figure 394 (SEQ ID NO:394), 
Figure 396 (SEQ ID NO:396), Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400). Figure 402 (SEQ 
ID NO:402). Figure 404 (SEQ ID NO:404). Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID NO:408), 
Figure 410 (SEQ ID NO:410), Figure 412 (SEQ ID NO:412). Figure 414 (SEQ ID NO:414), Figure 416 (SEQ 

10 ID NO:416), Figure 418 (SEQ ID NO:418), Figure 420 (SEQ ID NO:420), Figure 422 (SEQ ID NO:422), 
Figure 424 (SEQ ID NO:424), Figure 426 (SEQ ID NO:426), Figure 428 (SEQ ID NO:428), Figure 430 (SEQ 
ID NO:430), Figure 432 (SEQ ID NO:432), Figure 434 (SEQ ID NO:434). Figure 436 (SEQ ID NO:436), 
Figure 438 (SEQ ID NO:438). Figure 440 (SEQ ID NO:440), Figure 442 (SEQ ID NO:442). Figure 444 (SEQ 
ID NO:444), Figure 446 (SEQ ID NO:446), Figure 448 (SEQ ID NO:448). Figure 450 (SEQ ID NO:450), 

15 Figure 452 (SEQ ID NO:452). Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456). Figure 458 (SEQ 
ID NO:458). Figure 460 (SEQ ID NO:460), Figure 462 (SEQ ID NO:462), Figure 464 (SEQ ID NO:464), 
Figure 466 (SEQ ID NO:466), Figure 468 (SEQ ID NO:468). Figure 470 (SEQ ID NO:470), Figure 472 (SEQ 
ID NO:472), Figure 474 (SEQ ID NO:474). Figure 476 (SEQ ID NO:476). Figure 478 (SEQ ID NO:478). 
. Figure 480 (SEQ ID NO:480), Figure 482 (SEQ ID NO:482), Figure 484 (SEQ ID NO:484), Figure 486 (SEQ 

20 ID NO:486), Figure 488 (SEQ ID NO:488), Figure 490 (SEQ ID:NO:490), Figure 492 (SEQ ID NO:492), 
Figure 494 (SEQ ID NO:494), Figure 496 (SEQ ID NO:496). Figure 498 (SEQ ID NO:498). Figure 500 (SEQ 
ID NO:500), Figure 502 (SEQ ID NO:502). Figure 504 (SEQ ID NO:504), Figure 506 (SEQ ID NO:506), 
Figure 508 (SEQ ID NO:508). Figure 510 (SEQ ID NO:510), Figure 512 (SEQ ID NO:512), Figure 514 (SEQ 
ID NO:514). Figure 516 (SEQ ID Np:516). Figure 518 (SEQ ID NO:518). Figure 520 (SEQ ID NO:520), 

25 Figure 522 (SEQ ID NO:522), Figure 524 (SEQ ID NO:524), Figure 526 (SEQ ID NO;526), Figure 528 (SEQ 
ID NO:528), Figure 530 (SEQ ID NO:530), Figure 532 (SEQ ID NO:532). Figure 534 (SEQ ID NO:534), 
Figure 536 (SEQ ID NO:536), Figure 538 (SEQ ID NO:538), Figure 540 (SEQ ID NO:540), Figure 542 (SEQ 
ID NO:542), Figure 544 (SEQ ID NO:544), Figure 546 (SEQ ID NO:546)» Figure 548 (SEQ ID NO:548) or 
Figure 550 (SEQ ID NO:556). lacking its associated signal peptide. 

30 

20. An isolated polypeptide having at least 80% amino acid sequence identity to: 
(a) an amino acid sequence of the polypeptide shown in Figure 2 (SEQ ID N0:2). Figure 4 (SEQ 
ID NO:4), Figure 6 (SEQ ID N0:6). Figure 8 (SEQ ID N0:8), Figure 10 (SEQ ID NO: 10), Figure 12 (SEQ 
ID NO: 12). Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 20 
35 (SEQ ID NO:20). Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 
28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), 
Figure 36 (SEQ ID NO:36). Figure 38 (SEQ ID NO:38), ipigure 40 (SEQ ID NO:40), Figure 42 (SEQ ID 
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NO:42). Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46). Figure 48 (SEQ ID NO:48). Figure 50 (SEQ 
ID NO:50). Figure 52 (SEQ ID NO:52). Figure 54 (SEQ ID NO:54). Figure 56 (SEQ ID NO:56). Figure 58 
(SEQ ID NO:58). Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62). Figure 64 (SEQ ID NO:64). Figure 
66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68). Figure 70 (SEQ ID N0:70). Figure 72 (SEQ ID NO:72), 
Figure 74 (SEQ ID NO:74). Figure 76 (SEQ ID NO:76). Figure 78 (SEQ ID NO:78). Figure 80 (SEQ ID 

NO:80). Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84). Figure 86 (SEQ ID NO:86). Figure 88(SEQ 
ID NO:88). Figure 90 (SEQ ID NO:90). Figure 92 (SEQ ID NO:92). Figure 94 (SEQ ID NO:94). Figure 96 
(SEQ ID NO:96). Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NOrlOO). Figure 102 (SEQ ID NO:102). 
Figure 104 (SEQ ID NO:104). Figure 106 (SEQ ID NO:106), Figure 108 (SEQ ID NO:108). Figure 1 10 (SEQ 
ID NO:110). Figure 112 (SEQ ID NO:112), Figure 114 (SEQ ID NO:114). Figure 116 (SEQ ID NO:116). 
Figure 118 (SEQ ID NO:l 18), Figure 120 (SEQ ID NO: 120). Figure 122 (SEQ ID NO: 122). Figure 124 (SEQ 
ID NO:124). Figure 126 (SEQ ID NO:126), Figure 128 (SEQ ID NO:128). Figure 130 (SEQ ID NO: 130). 
Figure 132 (SEQ ID NO:132). Figure 134 (SEQ ID NO:134). Figure 136 (SEQ ID NO:136). Figure 138 (SEQ 
ID NO:138). Figure 140 (SEQ ID NO:140). Figure 142 (SEQ ID NO:142). Figure 144 (SEQ ID NO:144), 
Figure 146 (SEQ ID NO: 146). Figure 148 (SEQ ID NO: 148). Figure 150 (SEQ ID NO: 150). Figure 152 (SEQ 
ID NO:152). Figure 154 (SEQ ID NO:154). Figure 156 (SEQ ID NO:156). Figure 158 (SEQ ID NO:158). 
Figure 160 (SEQ ID NO: 160). Figure 162 (SEQ ID NO: 162). Figure 164 (SEQ ID NO: 164). Figure 166 (SEQ 
ID NO:166). Figure 168 (SEQ ID NO:168). Figure 170 (SEQ ID NO:170). Figure 172 (SEQ ID NO:172). 
Figure 174 (SEQ ID NO:174). Figure 176 (SEQ ID NO:176). Figure 178 (SEQ ID NO:178). Figure 180 (SEQ 
ID NO:180). Figure 182 (SEQ ID NO:182). Figure 184 (SEQ ID NO:184). Figure 186 (SEQ ID NO:186). 
Figure 188 (SEQ IDNO:188). Figure 190 (SEQ ID NO:190). Figure 192 (SEQ ID NO:192). Figure 194 (SEQ 
ID NO:194). Figure 196 (SEQ ID NO:196). Figure 198 (SEQ ID NO:198). Figure 200 (SEQ ID NO:200). 
Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 206 (SEQ ID NO:206). Figure 208 (SEQ 
ID NO:208). Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID NO:212), Figure 214 (SEQ ID NO:214). 
Figure 216 (SEQ ID NO:216). Figure 218 (SEQ ID NO:218). Figure 220 (SEQ ID NO:220). Figure 222 (SEQ 
ID NO:222). Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226), Figure 228 (SEQ ID NO:228). 
Figure 230 (SEQ ID NO:230). Figure 232 (SEQ ID NO:232). Figure 234 (SEQ ID NO:234). Figure 236 (SEQ 
ID NO:236). Figure 238 (SEQ ID NO:238). Figure 240 (SEQ ID NO:240). Figure 242 (SEQ ID NO:242). 
Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID NO:248). Figure 250 (SEQ 
ID NO:250). Figure 252 (SEQ ID NO:252). Figure 254 (SEQ ID NO:254). Figure 256 (SEQ ID NO:256). 
Figure 258 (SEQ ID NO:258). Figure 260 (SEQ ID NO:260). Figure 262 (SEQ ID NO:262), Figure 264 (SEQ 
ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268). Figure 270 (SEQ ID NO:270). 
Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274). Figure 276 (SEQ ID NO:276). Figure 278 (SEQ 
ID NO:278). Figure 280 (SEQ ID NO:280). Figure 282 (SEQ ID NO:282). Figure 284 (SEQ ID NO:284). 
Figure 286 (SEQ ID NO:286). Figure 288 (SEQ ID NO:288). Figure 290 (SEQ ID NO:290). Figure 292 (SEQ 
ID NO:292). Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296). Figure 298 (SEQ ID NO:298). 
Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302). Figure 304 (SEQ ID NO:304). Figure 306 (SEQ 
ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310). Figure 312 (SEQ ID NO:312). 
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Figure314(SEQ ID NO:314), Figure3l6(SEQ ID NO:316), Figure 318 (SEQ ID NO:318), Figure 320 (SEQ 
ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324). Figure 326 (SEQ ID NO:326). 
Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID NO:332), Figure 334 (SEQ 
ID NO:334), Figure 336 (SEQ ID NO:336). Figure 338 (SEQ ID NO:338), Figure 340 (SEQ ID NO:340), 
Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344). Figure 346 (SEQ ID NO:346), Figure 348 (SEQ 
5 ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352). Figure 354 (SEQ ID NO:354), 
Figure 356 (SEQ ID NO:356). Figure 358 (SEQ ID NO:358). Figure 360 (SEQ ID NO:360). Figure 362 (SEQ 
ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO:366), Figure 368 (SEQ ID NO:368). 
Figure 370 (SEQ ID NO:370). Figure 372 (SEQ ID NO:372), Figure 374 (SEQ ID NO:374). Figure 376 (SEQ 
ID NO:376). Figure 378 (SEQ ID NO:378), Figure 380 (SEQ ID NO:380), Figure 382 (SEQ ID NO:382), 

10 Figure 384 (SEQ ID NO:384), Figure 386 (SEQ ID NO:386). Figure 388 (SEQ ID NO:388), Figure 390 (SEQ 
ID NO:390), Figure 392 (SEQ ID NO:392). Figure 394 (SEQ ID NO:394). Figure 396 (SEQ ID NO:396), 
Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400). Figure 402 (SEQ ID NO:402), Figure 404 (SEQ 
ID NO:404). Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID NO:408), Figure 410 (SEQ ID NO:410). 
Figure 412 (SEQ ID NO:412). Figure 414 (SEQ ID NO:414), Figure 416 (SEQ ID NO:416). Figure 418 (SEQ 

15 ID NO:418). Figure 420 (SEQ ID NO:420). Figure 422 (SEQ ID NO:422), Figure 424 (SEQ ID NO:424), 
Figure 426 (SEQ ID NO:426). Figure 428 (SEQ ID NO:428). Figure 430 (SEQ ID NO:430), Figure 432 (SEQ 
ID NO:432), Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID NO:436), Figure 438 (SEQ ID NO:438). 
Figure 440 (SEQ ID NO:440). Figure 442 (SEQ ID NO:442), Figure 444 (SEQ ID NO:444), Figure 446 (SEQ 
ID NO:446). Figure 448 (SEQ ID NO:448), Figure 450 (SEQ ID NO:450), Figure 452 (SEQ ID NO:452), 

20 Figure454 (SEQ ID NO:454). Figure456 (SEQ ID NO:456). Figure 458 (SEQ ID NO:458), Figure 460 (SEQ 
ID NO:460). Figure 462 (SEQ ID NO:462). Figure 464 (SEQ ID NO:464), Figure 466 (SEQ ID NO:466), 
Figure 468 (SEQ ID NO:468), Figure 470 (SEQ ID NO:470). Figure 472 (SEQ ID NO:472), Figure 474 (SEQ 
ID NO:474), Figure 476 (SEQ ID NO:476), Figure 478 (SEQ ID NO:478). Figure 480 (SEQ ID NO:480). 
Figure 482 (SEQ ID NO:482), Figure 484 (SEQ ID NO:484), Figure 486 (SEQ ID NO:486). Figure 488 (SEQ 

25 ID NO:488). Figure 490 (SEQ ID NO:490). Figure 492 (SEQ ID NO:492), Figure 494 (SEQ ID NO:494), 
Figure 496 (SEQ ID NO:496), Figure 498 (SEQ ID NO:498). Figure 500 (SEQ ID NO:500), Figure 502 (SEQ 
ID NO:502), Figure 504 (SEQ ID NO:504). Figure 506 (SEQ ID NO:506). Figure 508 (SEQ ID NO:508). 
Figure 510 (SEQ ID NO:510), Figure 512 (SEQ ID NO:5l2), Figure 514 (SEQ ID NO:514), Figure 516 (SEQ 
ID NO:516). Figure 518 (SEQ ID NO:518). Figure 520 (SEQ ID NO:520), Figure 522 (SEQ ID NO:522). 

30 Figure 524 (SEQ ID NO:524). Figure 526 (SEQ ID NO:526), Figure 528 (SEQ ID NO:528), Figure 530 (SEQ 
ID NO:530), Figure 532 (SEQ ID NO:532). Figure 534 (SEQ ID NO:534)/Figure 536 (SEQ ID NO:536), 
Figure 538 (SEQ ID NO:538), Figure 540 (SEQ ID NO:540), Figure 542 (SEQ ID NO:542). Figure 544 (SEQ 
ID NO:544), Figure 546 (SEQ ID NO:546), Figure 548 (SEQ ID NO:548) or Figure 550 (SEQ ID NO:550), 
lacking its associated signal peptide; 

35 (b) an amino acid sequence of an extracellular domain of the polypeptide shown in Figure 2 (SEQ 

ID N0:2), Figure 4 (SEQ ID N0:4), Figure 6 (SEQ ID N0:6), Figure 8 (SEQ ID N0:8), Figure 10 (SEQ ID 
NO: 10). Figure 12 (SEQ ID NO: 12), Figure 14 (SEQ ID NO: 14). Figure 16 (SEQ ID NO: 16). Figure 18 (SEQ 
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ID NO: 18). Figure 20 (SEQ ID N0:20). Figure 22 (SEQ ID NO:22). Figure 24 (SEQ ID NO:24). Figure 26 
{SEQ ID NO:26). Figure 28 (SEQ ID NO:28). Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32). Figure 
34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40). 
Figure 42 (SEQ ID NO:42). Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46). Figure 48 (SEQ ID 
NO:48). Figure 50 (SEQ ID NO:50). Figure 52 (SEQ ID NO:52). Figure 54 (SEQ ID NO:54). Figure 56 (SEQ 
ID NO:56). Figure 58 (SEQ ID NO:58). Figure 60 (SEQ ID NO:60). Figure 62 (SEQ ID NO:62). Figure 64 
(SEQ ID NO:64). Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68). Figure 70 (SEQ ID NG:70). Figure 
72 (SEQ ID NO:72). Figure 74 (SEQ ID NO:74). Figure 76 (SEQ ID NO:76). Figure 78 (SEQ ID NO:78). 
Figure 80 (SEQ ID NO:80). Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84). Figure 86 (SEQ ID 
NO:86). Figure 88 (SEQ ID NO:88). Figure 90 (SEQ ID NO:90). Figure 92 (SEQ ID NO:92). Figure 94 (SEQ 
ID NO:94), Figure 96 (SEQ ID NO:96). Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NOrlOO). Figure 
102 (SEQ ID NO:102), Figure 104 (SEQ ID NO:104). Figure 106 (SEQ ID NO:106). Figure 108 (SEQ ID 
NO:108), Figure 110 (SEQ ID NO: 110). Figure 112 (SEQ ID NO: 112). Figure 114 (SEQ ID NO: 114). Figure 
116 (SEQ ID NO: 116). Figure 118 (SEQ ID NO: 118). Figure 120 (SEQ ID NO: 120). Figure 122 (SEQ ID 
NO: 122). Figure 124 (SEQ ID NO: 124). Figure 126 (SEQ ID NO: 126). Figure 128 (SEQ ID NO: 128). Figure 
130 (SEQ ID NO:130). Figure 132 (SEQ ID NO:132). Figure 134 (SEQ ID NO:134). Figure 136 (SEQ ID 
NO: 136). Figure 138 (SEQ ID NO: 138). Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID NO: 142). Figure 
144 (SEQ ID NO:144). Figure 146 (SEQ ID NO:146). Figure 148 (SEQ ID NO:148). Figure 150 (SEQ ID 
N0:150). Figure 152 (SEQ ID NO:152). Figure 154 (SEQ ID NO:I54). Figure 156 (SEQ ID NO:156). Figure 
158 (SEQ ID NO:158). Figure 160 (SEQ ID NO: 160). Figure 162 (SEQ ID NO:162). Figure 164 (SEQ ID 
NO:164). Figure 166 (SEQ ID NO:166). Figure 168 (SEQ ID NO:168). Figure 170 (SEQ ID NO: 170). Figure 
172 (SEQ ID NO:172). Figure 174 (SEQ ID NO:174). Figure 176 (SEQ ID NO:176). Figure 178 (SEQ ID 
NO: 178), Figure 180 (SEQ ID NO: 180). Figure 182 (SEQ ID NO: 182). Figure 184 (SEQ ID NO: 184). Figure 
186 (SEQ ID NO:186). Figure 188 (SEQ ID NO:188), Figure 190 (SEQ ID NO:190). Figure 192 (SEQ ID 
NO: 192). Figure 194 (SEQ ID NO: 194). Figure 196 (SEQ ID NO: 196). Figure 198 (SEQ ID NO: 198). Figure 
200 (SEQ ID NO:200), Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204). Figure 206 (SEQ ID 
NO:206). Figure 208 (SEQ ID NO:208), Figure 210 (SEQ ID NO:210). Figure 212 (SEQ ID NO:212). Figure 
214 (SEQ ID NO:214). Figure 216 (SEQ ID NO:216). Figure 218 (SEQ ID NO:218), Figure 220 (SEQ ID 
NO:220). Figure 222 (SEQ ID NO:222). Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226). Figure 
228 (SEQ ID NO:228). Figure 230 (SEQ ID NO:230). Figure 232 (SEQ ID NO:232). Figure 234 (SEQ ID 
NO:234). Figure 236 (SEQ ID NO:236). Figure 238 (SEQ ID NO:238). Figure 240 (SEQ ID NO:240). Figure 
242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID 
NO:248). Figure 250 (SEQ ID NO:250). Figure 252 (SEQ ID NO:252). Figure 254 (SEQ ID NO:254). Figure 
256 (SEQ ID NO:256). Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260). Figure 262 (SEQ ID 
NO:262). Figure 264 (SEQ ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268). Figure 
270 (SEQ ID NO:270). Figure 272 (SEQ ID NO:272). Figure 274 (SEQ ID NO:274). Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278). Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282). Figure 
284 (SEQ ID NO:284). Figure 286 (SEQ ID NO:286). Figure 288 (SEQ ID NO:288). Figure 290 (SEQ ID 



252 



• 



wo 01/40466 



PCT/USOO/32678 



NO:290), Figure 292 (SEQ ID NO:292). Figure 294 (SEQ ID NO:294). Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
NO:304). Figure 306 (SEQ ID NO:306), Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310), Figure 
312 (SEQ ID NO:3l2). Figure 314 (SEQ ID NO:314), Figure 316 (SEQ ID NO:316), Figure 318 (SEQ ID 
NO:318), Figure 320 (SEQ ID NO:320). Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324), Figure 
326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328), Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID 
NO:332). Figure 334 (SEQ ID NO:334), Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338), Figure 
340 (SEQ ID NO:340). Figure 342 (SEQ ID NO:342), Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
NO:346), Figure 348 (SEQ ID NO:348). Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352), Figure 
354 (SEQ ID NO:354). Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358), Figure 360 (SEQ ID 
NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364). Figure 366 (SEQ ID NO:366), Figure 
368 (SEQ ID NO:368). Figure 370 (SEQ ID NO:370). Figure 372 (SEQ ID NO:372), Figure 374 (SEQ ID 
NO:374). Figure 376 (SEQ ID NO:376), Figure 378 (SEQ ID NO:378). Figure 380 (SEQ ID NO:380), Figure 
382 (SEQ ID NO:382), Figure 384 (SEQ ID NO:384). Figure 386 (SEQ ID NO:386). Figure 388 (SEQ ID 
NO:388). Figure 390 (SEQ ID NO:390), Figure 392 (SEQ ID NO:392), Figure 394 (SEQ ID NO:394), Figure 
396 (SEQ ID NO:396). Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400), Figure 402 (SEQ ID 
NO:402), Figure 404 (SEQ ID NO:404), Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID NO:408), Figure 
410 (SEQ ID NO:410), Figure 412 (SEQ ID NO:412), Figure 414 {SEQ ID NO:414), Figure 416 (SEQ ID 
NO:416), Figure 418 (SEQ ID NO:418), Figure 420 (SEQ ID NO:420). Figure 422 (SEQ ID NO:422), Figure 
424 (SEQ ID NO:424). Figure 426 (SEQ ID NO:426). Figure 428 (SEQ ID NO:428), Figure 430 (SEQ ID 
NO:430), Figure 432 (SEQ ID NO:432), Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID NO:436), Figure 
438 (SEQ ID NO:438). Figure 440 (SEQ ID NO:440), Figure 442 (SEQ ID NO:442), Figure 444 (SEQ ID 
NO:444). Figure 446 (SEQ ID NO:446). Figure 448 (SEQ ID NO:448), Figure 450 (SEQ ID NO:450), Figure 
452 (SEQ ID NO:452), Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456). Figure 458 (SEQ ID 
NO:458), Figure 460 (SEQ ID NO:460), Figure 462 (SEQ ID NO:462), Figure 464 (SEQ ID NO:464), Figure 
466 (SEQ ID NO:466). Figure 468 (SEQ ID NO:468), Figure 470 (SEQ ID NO:470), Figure 472 (SEQ ID 
NO:472), Figure 474 (SEQ ID NO:474). Figure 476 (SEQ ID NO:476), Figure 478 (SEQ ID NO:478). Figure 
480 (SEQ ID NO:480), Figure 482 (SEQ ID NO:482). Figure 484 (SEQ ID NO:484), Figure 486 (SEQ ID 
NO:486), Figure 488 (SEQ ID NO:488). Figure 490 (SEQ ID NO:490), Figure 492 (SEQ ID NO:492), Figure 
494 (SEQ ID NO:494), Figure 496 (SEQ ID NO:496). Figure 498 (SEQ ID NO:498), Figure 500 (SEQ ID 
NO:500). Figure 502 (SEQ ID NO:502), Figure 504 (SEQ ID NO:504), Figure 506 (SEQ ID NO:506), Figure 
508 (SEQ ID NO:508), Figure 510 (SEQ ID NO:510), Figure 512 (SEQ ID NO:512), Figure 514 (SEQ ID 
NO:514), Figure 516 (SEQ ID NO:516). Figure 518 (SEQ ID NO:518), Figure 520 (SEQ ID NO:520), Figure 
522 (SEQ ID NO:522), Figure 524 (SEQ ID NO:524), Figure 526 (SEQ ID NO:526). Figure 528 (SEQ ID 
NO:528). Figure 530 (SEQ ID NO:530), Figure 532 (SEQ ID NO:532), Figure 534 (SEQ ID NO:534). Figure 
536 (SEQ ID NO:536). Figure 538 (SEQ ID NO:538), Figure 540 (SEQ ID NO:540), Figure 542 (SEQ ID 
NO:542), Figure 544 (SEQ ID NO:544), Figure 546 (SEQ ID NO:546), Figure 548 (SEQ ID NO:548) or 
Figure 550 (SEQ ID NO:550),.with its associated signal peptide; or 
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(c) an amino acid sequence of an extracellular domain of the polypeptide shown in Figure 2 (SEQ 
ID N0:2). Figure 4 (SEQ ID N0:4). Figure 6 (SEQ ID N0:6). Figure 8 (SEQ ID NO:8). Figure 10 (SEQ ID 
NO: 10). Figure 12 (SEQ ID NO:12). Figure 14 (SEQ ID NO: 14). Figure 16 (SEQ ID N0:16). Figure 18 (SEQ 
ID NO: 18), Figure 20 (SEQ ID NO:20). Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 
(SEQ ID NO:26). Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30). Figure 32 (SEQ ID NO:32). Figure 
34 (SEQ ID NO:34). Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID N0:40), 
Figure 42 (SEQ ID NO:42). Figure 44 (SEQ ID NO:44). Figure 46 (SEQ ID NO:46). Figure 48 (SEQ ID 
NO:48). Figure 50 (SEQ ID NO:50). Figure 52 (SEQ ID NO:52). Figure 54 (SEQ ID NO:54). Figure 56 (SEQ 
ID NO:56). Figure 58 (SEQ ID NO:58). Figure 60 (SEQ ID NO:60), Figure 62 (SEQ ID NO:62), Figure 64 
(SEQ ID NO:64). Figure 66 (SEQ ID NO:66). Figure 68 (SEQ ID NO:68). Figure 70 (SEQ ID NO:70). Figure 
72 (SEQ ID NO:72), Figure 74 (SEQ ID NO:74). Figure 76 (SEQ ID NO:76). Figure 78 (SEQ ID NO:78). 
Figure 80 (SEQ ID NO:80). Figure 82 (SEQ ID NO:82). Figure 84 (SEQ ID NO:84). Figure 86 (SEQ ID 
NO:86). Figure 88 (SEQ ID NO:88). Figure 90 (SEQ ID NO:90), Figure 92 (SEQ ID NO:92). Figure 94 (SEQ 
ID NO:94). Figure 96 (SEQ ID NO:96), Figure 98 (SEQ ID NO:98). Figure 100 (SEQ ID NO: 100). Figure 
102 (SEQ ID NO: 102). Figure 104 (SEQ ID NO: 104). Figure 106 (SEQ ID NO: 106). Figure 108 (SEQ ID 
NO: 108). Figure 1 10 (SEQ ID NO: 1 10). Figure 112 (SEQ ID NO:l 12). Figure 1 14 (SEQ ID NO: 1 14). Figure 
116 (SEQ ID NO: 116), Figure 118 (SEQ ID NO: 118), Figure 120 (SEQ ID NO:120). Figure 122 (SEQ ID 
NO:122), Figure 124 (SEQ ID NO: 124). Figure 126 (SEQ ID NO: 126). Figure 128 (SEQ ID NO: 128). Figure 
130 (SEQ ID NO:130). Figure 132 (SEQ ID NO:132). Figure 134 (SEQ ID NO:134). Figure 136 (SEQ ID 
NO: 136). Figure 138 (SEQ ID NO: 138), Figure 140 (SEQ ID NO: 140). Figure 142 (SEQ ID NO: 142). Figure 
144 (SEQ ID NO: 144). Figure 146 (SEQ ID NO: 146). Figure 148 (SEQ ID NO: 148). Figure 150 (SEQ ID 
NO:150). Figure 152 (SEQ ID NO:152). Figure 154 (SEQ ID NO:I54). Figure 156 (SEQ ID NO:I56). Figure 
158 (SEQ ID NO:158), Figure 160 (SEQ ID NO:160), Figure 162 (SEQ ID NO:162). Figure 164 (SEQ ID 
NO:164). Figure 166 (SEQ ID NO: 166), Figure 168 (SEQ ID NO:168). Figure 170 (SEQ ID NO:170). Figure 
172 (SEQ ID NO:172). Figure 174 (SEQ ID NO:174), Figure 176 (SEQ ID NO:176). Figure 178 (SEQ ID 
NO: 178). Figure 180 (SEQ ID NO: 180), Figure 182 (SEQ ID NO: 1 82), Figure 184 (SEQ ID NO: 184). Figure 
186 (SEQ ID NO:186). Figure 188 (SEQ ID NO:188), Figure 190 (SEQ ID NO:190), Figure 192 (SEQ ID 
NO:192). Figure 194 (SEQ ID NO: 194). Figure 196 (SEQ ID NO: 196), Figure 198(SEQIDNO:198). Figure 
200 (SEQ ID NO:200). Figure 202 (SEQ ID NO:202). Figure 204 (SEQ ID NO:204), Figure 206 (SEQ ID 
NO:206). Figure 208 (SEQ ID NO:208). Figure 210 (SEQ ID NO:210), Figure 212 (SEQ ID NO:212). Figure 
214 (SEQ ID NO:214). Figure 216 (SEQ ID NO:216). Figure 218 (SEQ ID NO:218). Figure 220 (SEQ ID 
NO:220). Figure 222 (SEQ ID NO:222). Figure 224 (SEQ ID NO:224). Figure 226 (SEQ ID NO:226). Figure 
228 (SEQ ID NO:228), Figure 230 (SEQ ID NO:230), Figure 232 (SEQ ID NO:232). Figure 234 (SEQ ID 
NO:234), Figure 236 (SEQ ID NO:236), Figure 238 (SEQ ID NO:238), Figure 240 (SEQ ID NO:240). Figure 
242 (SEQ ID NO:242). Figure 244 (SEQ ID NO:244). Figure 246 (SEQ ID NO:246). Figure 248 (SEQ ID 
NO:248). Figure 250 (SEQ ID NO:250), Figure 252 (SEQ ID NO:252), Figure 254 (SEQ ID NO:254), Figure 
256 (SEQ ID NO:256). Figure 258 (SEQ ID NO:258), Figure 260 (SEQ ID NO:260), Figure 262 (SEQ ID 
NO:262). Figure 264 (SEQ ID NO:264). Figure 266 (SEQ ID NO:266). Figure 268 (SEQ ID NO:268). Figure 
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270 (SEQ ID NO:270), Figure 272 (SEQ ID NO:272), Figure 274 (SEQ ID NO:274), Figure 276 (SEQ ID 
NO:276), Figure 278 (SEQ ID NO:278). Figure 280 (SEQ ID NO:280), Figure 282 (SEQ ID NO:282), Figure 
284 (SEQ ID NO:284), Figure 286 (SEQ ID NO:286), Figure 288 (SEQ ID NO:288), Figure 290 (SEQ ID 
NO:290). Figure 292 (SEQ ID NO:292), Figure 294 (SEQ ID NO:294), Figure 296 (SEQ ID NO:296), Figure 
298 (SEQ ID NO:298), Figure 300 (SEQ ID NO:300). Figure 302 (SEQ ID NO:302), Figure 304 (SEQ ID 
5 NO:304). Figure 306 (SEQ ID NO:306). Figure 308 (SEQ ID NO:308). Figure 310 (SEQ ID NO:310), Figure 
312 (SEQ ID NO:312), Figure 314 (SEQ ID NO!314). Figure 316 (SEQ ID N0:316). Figure 318 (SEQ ID 
NO:318). Figure 320 (SEQ ID NO:320), Figure 322 (SEQ ID NO:322), Figure 324 (SEQ ID NO:324). Figure 
326 (SEQ ID NO:326), Figure 328 (SEQ ID NO:328). Figure 330 (SEQ ID NO:330), Figure 332 (SEQ ID 
NO:332), Figure 334 (SEQ ID NO:334). Figure 336 (SEQ ID NO:336), Figure 338 (SEQ ID NO:338). Figure 

10 340 (SEQ ID NO:340), Figure 342 (SEQ ID NO:342). Figure 344 (SEQ ID NO:344), Figure 346 (SEQ ID 
NO:346). Figure 348 (SEQ ID NO:348), Figure 350 (SEQ ID NO:350), Figure 352 (SEQ ID NO:352). Figure 
354 (SEQ ID NO:354), Figure 356 (SEQ ID NO:356), Figure 358 (SEQ ID NO:358). Figure 360 (SEQ ID 
NO:360), Figure 362 (SEQ ID NO:362), Figure 364 (SEQ ID NO:364), Figure 366 (SEQ ID NO;366), Figure 
368 (SEQ ID NO:368), Figure 370 (SEQ ID NO:370), Figure 372 (SEQ ID NO:372), Figure 374 (SEQ ID 

15 Np:374), Figure 376 (SEQ ID NO:376), Figure 378 (SEQ ID NO:378). Figure 380 (SEQ ID NO:380), Figure 
382 (SEQ ID NO:382). Figure 384 (SEQ ID NO:384), Figure 386 (SEQ ID NO:386), Figure 388 (SEQ ID 
NO:388). Figure 390 (SEQ ID NO:390), Figure 392 (SEQ ID NO:392). Figure 394 (SEQ ID NO:394), Figure 
396 (SEQ ID NO:396), Figure 398 (SEQ ID NO:398), Figure 400 (SEQ ID NO:400), Figure 402 (SEQ ID 
NO:402), Figure 404 (SEQ ID NO:404), Figure 406 (SEQ ID NO:406), Figure 408 (SEQ ID NO:4D8), Figure 

20 410 (SEQ ID NO:410), Figure 412 (SEQ ID NO:412), Figure 414 (SEQ ID NO:414), Figure 416 (SEQ ID 
NO:416). Figure 418 (SEQ ID NO:418). Figure 420 (SEQ ID NO:420), Figure 422 (SEQ ID NO:422). Figure 
424 (SEQ ID NO:424), Figure 426 (SEQ ID NO:426). Figure 428 (SEQ ID NO:428). Figure 430 (SEQ ID 
NO:430). Figure 432 (SEQ ID NO:432). Figure 434 (SEQ ID NO:434), Figure 436 (SEQ ID NO:436). Figure 
438 (SEQ ID NO:438), Figure 440 (SEQ ID NO:440). Figure 442 (SEQ ID NO:442), Figure 444 (SEQ ID 

25 NO:444). Figure 446 (SEQ ID NO:446), Figure 448 (SEQ ID NO:448), Figure 450 (SEQ ID NO:450), Figure 
452 (SEQ ID NO:452), Figure 454 (SEQ ID NO:454), Figure 456 (SEQ ID NO:456). Figure 458 (SEQ ID 
NO:458), Figure 460 (SEQ ID NO:460). Figure 462 (SEQ ID NO:462), Figure 464 (SEQ ID NO:464). Figure 
466 (SEQ ID NO:466). Figure 468 (SEQ ID NO:468), Figure 470 (SEQ ID NO:470), Figure 472 (SEQ ID 
NO:472), Figure 474 (SEQ ID NO:474). Figure 476 (SEQ ID NO:476). Figure 478 (SEQ ID NO:478), Figure 

30 480 (SEQ ID NO:480), Figure 482 (SEQ ID NO:482). Figure 484 (SEQ ID NO:484). Figure 486 (SEQ ID 
NO:486). Figure 488 (SEQ ID NO:488), Figure 490 (SEQ ID NO:490), Figure 492 (SEQ ID kO:492). Figure 
494 (SEQ ID NO:494), Figure 496 (SEQ ID NO:496), Figure 498 (SEQ ID NO:498), Figure 500 (SEQ ID 
NO:500), Figure 502 (SEQ ID NO:502). Figure 504 (SEQ ID NO:504). Figure 506 (SEQ ID NO:506), Figure 
508 (SEQ ID NO:508), Figure 510 (SEQ ID NO:5I0), Figure 512 (SEQ ID NO:512). Figure 514 (SEQ ID 

35 NO:514), Figure 516 (SEQ ID NO:516). Figure 518 (SEQ ID NO:518), Figure 520 (SEQ ID NO:520), Figure 
522 (SEQ ID NO:522). Figure 524 (SEQ ID NO:524), Figure 526 (SEQ ID NO:526), Figure 528 (SEQ ID 
NO:528). Figure 530 (SEQ ID NO:530). Figure 532 (SEQ ID NO:532), Figure 534 (SEQ ID NO:534). Figure 
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536 (SEQ ID NO:536). Figure 538 (SEQ ID NO:538). Figure 540 (SEQ ID NO:540), Figure 542 (SEQ ID 
NO:542), Figure 544 (SEQ ID NO:544), Figure 546 (SEQ ID NO:546), Figure 548 (SEQ ID NO:548) or 
Figure 550 (SEQ ID NO:550), lacking its associated signal peptide. 

21 . A method of detecting a PRO1801 polypeptide in a sample suspected of containing a PRO1801 
polypq)tide, said method comprising contacting said sample with a PROIIM or PR04978 polypeptide and 
determining the fonnationofaPRO1801/PROU14orPRO1801/PRO4978polyp^^ 

wherein the formation of said conjugate is indicative of the presence of a PRO1801 polypeptide in said sample. 

22. TTie method according to Claim 21, wherein said sample comprises cells suspected of 
expressing said PRO1801 polypeptide. 

23. The method according to Claim 21, v^erein said PROl 1 14 or PR04978 polypeptide is labeled 
with a detectable label. 

24. ThemethodacoordmgtoClaim21, whereinsaid PROII 14 or PRD4978polyp^tide is attached 
to a solid support. 

25 . A method of detecting a PRO 1 11 4 or PR04978 polypeptide in a sample suspected of containmg 
a PROl 114 or PR04978 polypeptide, said method comprising contacting said sample with a PRO 1801 
polypeptide and determining the formation of a PRO1801/PRO1114 or PRO1801/PRO4978 polypeptide 
conjugate in said san^le. wherein the formation of said conjugate is indicative of the presence of a PROl 1 14 
or PR04978 polypeptide in said sample. 

26. The method according to Claim 25, wherein . said san^le comprises cells suspected of 
ejqjressing said PRO 1 114 or PR04978 polyp^tide. 

27. The method according to Claim 25, wherein said PRO1801 polypeptide is labeled with a 
detectable label. 

28. The method according to Claim 25, wherem said PRO1801 polypeptide is attached to a soUd 

si:q}port. 

29. A method of Imking a bioactive molecule to a cell expressing a PRO1801 polypeptide, said 
method comprising contacting said cell with a PROl 1 14 or PR04978 polypeptide that is bound to said bioactive 
molecule and aUowmg said PRO180I and said PROl 114 or PR04978 polypeptides to buid to one another, 
diereby linking said bioactive molecules to said cell. 
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The method according to Claim 29, wherein said bioactive molecule is a toxin, a radiolabel 



31. The method according to Claim 29, wherein said bioactive molecule causes the death of said 

cell. 

32. A method of linking a bioactive molecule to a cell expressing a PR01114 or PR04978 
polypeptide, said method comprising contacting said cell widi a PRO1801 polypeptide that is bound to said 
bioactive molecule and allowing said PRO1801 and said PR01114 or PR04978 polypeptides to bind to one 
another, diereby linking said bioactive molecules to said cell. 

33. The method according to Claim 32, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

34. The method according to Claim 32. wherein said bioactive molecule causes the death of said 

cell. 

35. A method of modulating at least one biological activity of a cell expressing a PRO1801 
polypeptide, said method comprising contacting said cell with a PROl 1 14 or PR04978 polypeptide or an anti- 
PRO1801 polypeptide antibody, whereby said PROl 1 14 or PR04978 polypeptide or anti-PRO1801 polypeptide 
antibody binds to said PRO1801 polypeptide, thereby modulating at least one biological activity of said cell. 

36. The method according to Claim 35, wherein said cell is killed. 

37. A method of modulating at least one biological activity of a cell expressing a PROl 114 or 
PR04978 polypeptide, said method comprising contacting said cell with a PRO1801 polypeptide or an anti- 
PROl 1 14 or anti-PR04978 polypeptide antibody, whereby said PRO1801 polypeptide or anti-PROl 1 14 or anti- 
PR04978 polypeptide antibody binds to said PROl 1 14 or PR04978 polypeptide, thereby modulating at least 
one biological activity of said cell. 

38. The niethod according to Claim 37, wherein said cell is killed. 

39. A method of detecting a PROl 1 14 polypeptide in a sample suspected of containing a PROl 1 14 
polypeptide, said method comprising contacting said sample with a PROlOO polypeptide and determining the 
formation of a PROIOO/PROI 1 14 polypeptide conjugate in said sample, wherein the formation of said conjugate 
is indicative of the presence of a PROl 1 14 polypeptide in said sample. 
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40. The method according to Claim 39, wherein said sample comprises ceUs suspected of 
expressing said PR01114 polypeptide. 

41. The method according to Claim 39, wherein said PROlOO polypepUde is labeled witii a 
detectable label. 



42. TTie method accoiding id Qaim 39. wherein said PROlOO polypeptide is attached to a soUd 

siqiport. 

43. A method of detecting a PROlOO polypeptide in a sample suspected of containing a PROlOO 
polypeptide, said method comprising contacting said sample with a PROl 114 polypeptide and determining the 

fomiationofaPROlOO/PROllHpolypeptide conjugate in said sample, wherein the formationof said con^^^ 
is indicative of the presence of a PROlOO polypepade in said sample. 

44. Hie method according to Claim 43, wherein said sample comprises cells suspected of 
e;q>ressiDg said PROlOO polypqnide. 

45. The method according to Claim 43, wherein said PROl 1 14 polypeptide is labeled with a 
detectable label. 

46. The method according to Claim 43. wherein said PRoil 14 polypeptide is attached to a soUd 

support. 

47. A method of linking a bioactive molecule to a cell expressing a PROlOO polypeptide, said 
method comprising contacting said ceU with a PROl 1 14 polypeptide that is bound to said bioactive molecule and 
allowiiig said PROlOO and said PR01114 polypeptides to bind to one another, thereby linking said bioactive 
molecules to said cell. 



48. The method according to Claim 47, v^rein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 

49. The method according to Claim 47, wherein said bioactive molecule causes die death of said 

cell. 



50, A method of Imking a bioactive molecule to a cell expressing a PR01114 polypeptide, said 
method comprising contacting said ceU with a PROlOO polypeptide that is bound to said bioactive molecule and 
allowing said PROlOO and said PR0II14 polypeptides to bind to one another, thereby linking said bioactive 
molecules to said cell. 
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51. The method according to Claim 50, wherein said bioactive molecule is a toxin, a radiolabel 
or an antibody. 



52. The method according to Claim 50, wherein said bioactive molecule causes the death of said 

cell. 

53. A method of modulating at least one biological activity of a cell expressing a PROlOO 
polypeptide, said method comprising contacting said cell with a PR01114 polypeptide or an anti-PROlOO 
polypeptide antibody, whereby said PROl 114 polypeptide or anti-PROlOO polypeptide antibody binds to said 
PROlOO polypeptide, thereby modulating at least one biological activity of said cell. 

54. The mediod according to Claim 53, wherein said cell is killed. 

55. A method of modulating at least one biological activity of a cell expressing a PROl 114 
polypeptide, said method comprising contacting said cell with a PROlOO polypeptide or an anti-PR01114 
polypeptide antibody, whereby said PROlOO polypeptide or anti-PROl 1 14 polypeptide antibody binds to said 
PROl 114 polypeptide, thereby modulating at least one biological activity of said cell. 

56. The method according to Claim 55, herein said cell Is killed. 

57. A method for stimulating the release of TNF-a from human blood, said method con^rising 
contacting said blood with a PR0195, PRO202. PR0215. PR0221, PR0217, PR0222, PR0198, PR0245, 
PR0172. PR6265, PR0266, PR0344. PR0337, PR0322, PR01286, PR01279, PR01338 or PR01343 
polypeptide, wherein the release of TNF-a from said blood is stimulated. 

58. A method for modulating the uptake of glucose or FFA by skeletal muscle cells, said method 
comprising contacting said cells with a PR0182, PR0366, PR0198, PR0172 or PR0719 polypeptide, wherein 
the uptake of glucose or FFA by said cells is modulated. 

59. A method for stimulating the proliferation or differentiation of chondrocyte ceils, said method 
comprising contacting said cells with a PR0182, PR0366, PRO 198. PRO 1868, PRO202, PR0224, PR0172, 
PRO301 or PR01312 polypeptide, wherein the proliferation or differentiation of said cells is stimulated. 

60. A mediod for modulating die uptake of glucose or FFA by adipocyte cells, said method 
comprising contacting said cells with a PRO202, PR0211, PR0344 or PR01338 polypeptide, wherein the 
uptake of glucose or FFA by said cells is modulated. 
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61 . A method for stimulating the proliferation of or gene expression in pericyte cells, said method 
comprising contacting said cells with a PR0366 polypeptide, wherein the proliferation of or gene expression in 
said cells is stimulated. 



62. A method for stimulating the release of proteoglycans from cartilage, said method comprising 
contacting said cartilage with a PR02I6 polypeptide, wherein the release of proteoglycans from said cartilage 
is stimulated. 



63. A method for stimulating the proliferation of inner ear utricular supporting cells, said method 
comprising contacting said cells with a PR0172 polypeptide, wherein die proliferation of said cells is stimulated, 

64. A method for stimulating the proliferation of T-lymphocyte cells, said method comprising 
contacting said cells with a PR0344 polypeptide, wherein the proliferation of said cells is stimulated. 

65. A method for stimulating the release of a cytokine from PBMC cells, said method comprising 
contacting said cells with a PR0526 or PR01343 polypeptide, wherein die release of a cytokine from said cells 
is stimulated. 



66. A method for inhibiting the binding of A-peptide to factor VIIA. said method comprising 
contacting a composition con^rising said A-peptide and said faaor VIIA with a PR0182 polypeptide, wherein 
the binding of said A-pqitide to said factor VIIA is inhibited. 

67. A method for inhibiting the differentiation of adipocyte cells, said method comprising 
contacting said cells with a PR0185 or PR0198 polypeptide, vlierein the differentiation of said cells is inhibited, 

68. A method for stimulating the proliferation of endothelial cells, said method comprising 
contacting said cells widi a PR0222 polypeptide, wherein the proliferation of said cells is inhibited, 

69. A method for detecting the presence of tumor in an mammal, said method comprising 
comparing the level of expression of any PRO polypeptide shown in Table 8 in (a) a test sample of cells taken 
from said mammal and (b) a control sample of normal cells of the same cell type, wherein a higher level of 
egression of said PRO polypeptide ui the test sample as compared to the control sample is indicative of the 
presence of tumor in said mammal. 

70. The method of Claim 69, wherein said tumor is lung tumor, colon nrnior, breast mmor. 
prostate tumor, rectal tumor, cervical tumor or liver tumor. 
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71. An oligonucleotide probe derived from any of the nucleotide sequences shown in the 
accompanying figures. 
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1/550 



FIGURE 1 



GTTACTCGGTGGTGGCGGAGTCTACGGAAGCCGTTTTCGCTTCACTTTTCCTGGCTGTAGAGC 

GCTTTCCCCCTGGCGGGTGAGAGTGCAGAGACGAAGGTGCGAGATGAGCACTATGTTCGCGGA 

CACTCTCCTCATCGTTTTTATCTCTGTGTGCACGGCTCTGCTCGCAGAGGGCATAACCTGGGT 

CCTGGTTTACAGGACAGACAAGTACAAGAGACTGAAGGCAGAAGTGGAAAAACAGAGTAAAAA 

ATTGGAAAAGAAGAAGGAAACAATAACAGAGTCAGCTGGTCGACAACAGAAAAAGAAAATAGA 

GAGACAAGAAGAGAAACTGAAGAATAACAACAGAGATCTATCAATGGTTCGAATGAAATCCAT 

GTTTGCTATTGGCTTTTGTTTTACTGCCCTAATGGGAATGTTCAATTCCATATTTGATGGTAG 

AGTGGTGGCAAAGCTTCCTTTTACCCCTCTTTCTTACATCCAAGGACTGTCTCATCGAAATCT 

GCTGGGAGATGACACCACAGACTGTTCCTTCATTTTCCTGTATATTCTCTGTACTATGTCGAT 

TCGACAGAACATTCAGAAGATTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGCAGGCAGG 

TGGATTTCTTGGCCCACCACCTCCTTCTGGGAAGTTCTCTT^CTCAAGAACTCTTTATTTT. 

CTATCATTCTTTCTAGACACACACACATCAGACTGGCAACTGTTTTGTAGCAAGAGCCATAGG 

TAGCCTTACTACTTGGGCCTCTTTCTAGTTTTGAATTATTTCTAAGCCTTTTGGGTATGATTA 

GAGTGAAAATGGCAGCCAGCAAACTTGATAGTGCTTTTGGTCCTAGATGATTTTTATCAAATA 

AGTGGATTGATTAGTTAAGTTCAGGTAATGTTTATGTAATGAAAAACAAATAGCATCCTTCTT 

GTTTCATTTACATAAGTATTTTCTGTGGGACCGACTCTCAAGGCACTGTGTATGCCCTGCAAG 

TTGGCTGTCTATGAGCATTTAGAGATTTAGAAGAAAAATTTAGTTTGTTTAACCCTTGTAACT 

GTTTGTTTTGTTGTTGTTTTTTTTTCAAGCCAAATACATGACATAAGATCAATAAAGAGGCCA 

AATTTTTAGCTGTTTTATGTACAAGGAGAGATCTGTTTCATTTTGTTTTGCCGTATTTCTAGA 

TATAAGTTTTAGCATGGGCCAGGAAGGACTAAAATAAAAGTTTTTAAGGTACAAAAAAAAAAA 
AAAA 
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FIGURE 2 

MSTMFADTLLIVFISVCTALLAEGITWVLVYRTDKYKRLKAEVEKQSKKLEKKKETITESAGR 
QQKKKIERQEEKLKNNNRDLSMYRMKSMFAIGFCFTALMGMFNSIFDGRVVAKLPFTPLSYIQ 
GLSHRNLLGDDTTDCSFIFLYILCTMSIRQNIQKILGLAPSRAATKQAGGFLGPPPPSGKFS 

In^ortant fea'bures: 
Signal pept:±de: 

amino acids 1-22 

N-myrxstoylaticn sites « . 

amino acids 103-109, 163-169 



cAMP- and cGMP-dependent protein kinase phosphorylation site . 

amino acids 53-57 
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FIGURE 3 



AGCCGGGGGCGGGTTTGAAGACGCGTCGTTGGGTTTTGGAGGCCGTGAAACAGCCGTTTGAGT 

TTGGCTGCGGGTGGAGAACGTTTGTCAGGGGCCCGGCCAAGAAGGAGGCCCGCCTGTTACGAT 

GGTGTCCATGAGTTTCAAGCGGAACCGCAGTGACCGGTTCTACAGCACCCGGTGCTGCGGCTG 

TTGCCATGTCCGCACCGGGACGATCATCCTGGGGACCTGGTACATGGTAGTAAACCTATTGAT 

GGCAATTTTGCTGACTGTGGAAGTGACTCATCCAAACTCCATGCCAGCTGTCAACATTCAGTA 

TGAAGTCATCGGTAATTACTATTCGTCTGAGAGAATGGCTGATAATGCCTGTGTTCTTTTTGC 

CGTCTCTGTTCTTATGTTTATAATCAGTTCAATGCTGGTTTATGGAGCAATTTCTTATCAAGT 

GGGTTGGCTGATTCCATTCTTCTGTTACCGACTTTTTGACTTCGTCCTCAGTTGCCTGGTTGC 

TATTAGTTCTCTCACCTATTTGCCAAGAATCAAAGAATATCTGGATCAACTACCTGATTTTCC 

CTACAAAGATGACCTCCTGGCCTTGGACTCCAGCTGCCTCCTGTTCATTGTTCTTGTGTTCTT 

TGCCTTATTCATCATTTTTAAGGCTTATCTAATTAACTGTGTTTGGAACTGCTATAAATACAT 

CAACAACCGAAACGTGCCGGAGATTGCTGTGTACCCTGCCTTTGAAAGCACCTCCTCAGTACG 

TTTTGCCAACCTATGAAATGGCCGTGAAAATGCCTGAAAAAGAACCACCACCTCCTTACTTAC 

CTGCCTGAAGAAATTCTGCCTTTGACAATAAATCCTATACCAGCTTTTTGTTTGTTTATGTTA 

CAGAATGCTGCAATTCAGGGCTCTTCAAACTTGTTTGATATAAAATATGTTGTCTTTTGTTTA 

AGCATTTATTTTCAAACACTAAGGAGCTTTtTGACATCTGTTAAACGTCTTTTTGTTTTTTTG 

TTAAGTCTTTTACATTTTAATAGTTTTTGAAGACAATCTAGGTTAAGCAAGAGCAAAGTGCCA 

TTGTTTGCCTTTAATTGGGGGGTGGGAAGGGAAAGAGGGTACTTGCCACATAGTTTCCTTTTT 

AACTGCACTTTCTTTATATAATCGTTTGCATTTTGTTACTTGCTACCCTGAGTACTTTCAGGA 

AGACTGACTTAAATATTCGGGGTGAGTAAGTAGTTGGGTATAAGATCTGAACTTTTCATCTGC 

AGAGGCAAGAAAAATATTTGACATTGTGACTTGACTGTGGAAGATGATGGTTGCATGTTTCTA 

GTTTGTATATGTTTCCATCTTTGTGATAAGATGATTTAATAAATCTCTTTAAATACTAAAAAA 
AAAAAAAAA 
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FIGURE 4 



MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMVVNLLMAILLTVEVTHPNSMPAVNIQ 
YEVIGNYYSSERMADNACVLFAVSVLMFIISSMLVYGAISYQVGWLIPFFCYRLFDFVLSCLV 
AISSLTYLPRIKEYLDQLPDFPYKDDLLALDSSCLLFIVLVFFALFIIFKAYLINCVWNCYKY 
INNRNVPEIAVYPAFESTSSVRFANL 

Important features of the protein: 

Transmembrane domain (Possible type II transmembrane protein) : 

amino acids 30-49, 81-100, 111-131, 158-175 

N-glycosylation site. 

amino acids 9-13 

Tyrosine kinase phosphorylation sites. 

amino acids 8-16, 193-202 . . 

N-myristoylation site. 

amino acids 68-74 
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FIGURE 5 

CCCGCTGGCCCGTCAGTGCTCTCCCCGTCGTTTGCCCTCTCCAGTTCCCCCAGTGCCTGCCCT 

ACGCACCCCGAT6GCGGAGCTGCGGCCTAGCGGCGCCCCCGGCCCCACCGCGCCCCCGGCCCC 

TGGCCCGACTGCCCCCCCGGCCTTCGCTTCGCTCTTTCCCCCGGGACTGCACGCCATCTACGG 

AGAGTGCCGCCGCCTTTACCCTGACCAGCCGAACCCGCTCCAGGTTACCGCTATCGTCAAGTA 

CTGGTTGGGTGGCCCAGACCCCTTGGACTATGTTAGCATGTACAGGAATGTGGGGAGCCCTTC 

TGCTAACATCCCCGAGCACTGGCACTACATCAGCTTCGGCCTGAGTGATCTCTATGGTGACAA 

CAGAGTCCATGAGTTTACAGGAACAGATGGACCTAGTGGTTTTGGCTTTGAGTTGACCTTTCG 

TCTGAAGAGAGAAACTGGGGAGTCTGCCCCACCAACATGGCCCGCAGAGTTAATGCAGGGCTT 

GGCACGATACGTGTTCCAGTCAGAGAACACCTTCTGCAGTGGGGACCATGTGTCCTGGCACAG 

CCCTTTGGATAACAGTGAGTCAAGAATTCAGCACATGCTGCTGACAGAGGACCCACAGATGCA 

GCCCGTGCAGACACCCTTTGGGGTAGTTACCTTCCTCCAGATCGTTGGTGTCTGCACTGAAGA 

GCTACACTCAGCCCAGCAGTGGAACGGGCAGGGCATCCTGGAGCTGCTGCGGACAGTGCCTAT 

TGCTGGCGGCCCCTGGCTGATAACTGACATGCGGAGGGGAGAGACCATATTTGAGATCGATCC 

ACACCTGCAAGAGAGAGTTGACAAAGGCATCGAGACAGATGGCTCCAACCTGAGTGGTGTCAG 

TGCCAAGTGTGCCTGGGATGACCTGAGCCGGCCCCCCGAGGATGACGAGGACAGCCGGAGCAT 

CTGCATCGGCACACAGCCCCGGCGACTCTCTGGCAAAGACACAGAGCAGATCCGGGAGACCCT 

GAGGAGAGGACTCGAGATCAACAGCAAACCTGTCCTTCCACCAATCAACCCTCAGCGGCAGAA 

TGGCCTCGCCCACGACCGGGCCCCGAGCCGCAAAGACAGCCTGGAAAGTGACAGCTCCACGGC 

CATCATTCCCCATGAGCTGATTCGCACGCGGCAGCTTGAGAGCGTACATCTGAAATTCAACCA 

GGAGTCCGGAGCCCTCATTCCTCTCTGCCTAAGGGGCAGGCTCCTGCATGGACGGCACTTTAC 

ATATAAAAGTATCACAGGTGACATGGCCATCACGTTTGTCTCCACGGGAGTGGAAGGCGCCTT 

TGCCACTGAGGAGCATCCTTACGCGGCTCATGGACCCTGGTTACAACTCTGaACCTATCCTCG 

GAGCTCTGCCCTCCCGTCCTGGAACGTCTTTCTGCCCTGAGGAGAGGGTAGTCAGCATCTCCA 

ATTTTCAGCAGCTCAAGAACCTTGGCCCCCACAGGACTTCGCAGATGTCACATTGCCCCTCAG 

TCCCCTGAATGCCCTTCGGACCCAACCCCAATTCCCCAAGCCCCTGACCCCCTAGCTGCCGGG 

GTTCCCACTCCCAGTGCCACAACCCCCTCACCTCCCCTGGCAGCCCCTCAGCGAGCCTGAGGC 

CCAGCACCCGCTGGCTCCCCAGCACATGGTCCCCTCCCATGGGCTGTTGCCCAGGGAACCGGG 

GCGCGGTGGGAACGAGCTGCTGGCCTCGGCATGTTTCAATAAAGTTGCTGTGCTGGGAG 
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FIGURE 6 

MAELRPSGAPGPTAPPAPGPTAPPAFASLFPPGLHAIYGECRRLYPDQPNPLQVTAIVKYWLG 
GPDPLDYVSMYRNVGSPST^IPEHWHYISFGLSDLYGDNRVHEFTGTDGPSGFGFELTFRLKR 
ETGESAPPTWPAELMQGLARYVFQSENTFCSGDHVSWHSPLDNSESRIQHMLLTEDPQMQPVQ 
TPFGWTFLQIVGVCTEELHSAQQWNGQGILELLRTVPIAGGPWLITDMRRGETIFEIDPHLQ 
ERVDKGIETDGSNLSGVSAKCAWDDLSRPPEDDEDSRSICIGTQPRRLSGKDTEQIRETLRRG 
LEINSKPVLPPINPQRQNGLAHDRAPSRKDSLESDSSTAIIPHELIRTRQLESVHLKFNQESG 
ALIPLCLRGRLLHGRHFTYKSITGDMAITFVSTGVEGAFATEEHPYAAHGPWLQL 

In^ortant features: 
N-glycosylation site* 

amino acids 265-268 
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FIGURE 7 

CGCGAATGAAGTTTGCATTTTCCTCTGTTCTTGAGCCCAGCTTCTTCTCGTCTCCCACCCCAG 

CTTCCCGGCATTGGAAGAAGGGACCGTCCTCTTCCTTGTCTTGGCCACCCAAATCCTGGTATC 

GAAAGGGTTGAACGGACCGGAAGTGTGCAGCAGCGACGGGTCCCCAGCTAATCGACGCCGGAA 

GTAGCAATTACTAGACAAGCATTCCGCCGCCGGCTTCGCTATSGCGGCAATTCCCCCAGATTC 

CTGGCAGCCACCCAACGTTTACTTGGAGACCAGCATGGGAATCATTGTGCTGGAGCTGTACTG 

GAAGCATGCTCCAAAGACCTGTAAGAACTTTGCTGAGTTGGCTCGTCGAGGTTACTACAATGG 

CACAAAATTCCACAGAATTATCAAAGACTTCATGATCCAAGGAGGTGACCCAACAGGGACAGG 

TCGAGGTGGTGCATCTATCTATGGCAAACAATTTGAAGATGAACTTCATCCAGACTTGAAATT 

CACGGGGGCTGGAATTCTCGCAATGGCCAATGCGGGGCCAGATACCAATGGCAGCCAGTTCTT 

TGTGACCCTCGCCCCCACCCAGTGGCTTGACGGCAAACACACCATTTTTGGCCGAGTGTGTCA 

GGGCATAGGAATGGTGAATCGCGTGGGAATGGTAGAAACAAACTCCCAGGACCGCCCTGTGGA 

CGACGTGAAGATCATTAAGGCATACCCTTCTGGGTAGACTTGCTACCCTCTTGAGCAGCTCTT 

CTGAGATGGCCCCAGTGAACCAGCTTCTAGATGACATAGAATGACATGTAATGCTAAATTTCA 

TTTTGGCTTTGCAAGTCATGAAGCTTAGGAGGCCTGGCATCTTGGGTGAGTTAGAGATGGAAG 

TACATTTTAATAGGATGCTTCTTTTCTCTTCCCCCAGTGCCTAGGTTGCCAGAGCATTTGCAC 

AAATGCCCCTGTTTATCAATAGGTGACTACTTACTACACATGAACCATAATGCTGCTTCTTGT 

GCATGTCTGCTCTGATATACGTCGAACAATGTAGCAGCCACTGTCATTTCTCAGTGGTTTTGC 

CTAACCAAACTTCTTCCTAAGGAGATTTATATTCTGGCCTACACAGCAGTCCTTGATGGCTGA 

CAGCCACAGAATTCCAAACCAAGTAGTGTCTGTCAGCCCTCTTAACTCTGTGCACGCCCTATT 

TCAGTCTTTTACATTTGTTCTTCTAGGGAATGTATGCATCTCTATATATATTTTCCCTCTCAA 

AACCAGAACATCAACAGTGCTGTTTCTGACACTTCAGACATCCCACGCAAAGCCACATTGAAT 

TTTTGCCAAATGAAAAACACATCCAACAATCAAGTTTCTAAGAAGGTGTCAAGTGGGGAATAA 

TAATAATGTATAATAATCAAGAAATTAGTTTATTAAAAGGAAGCAGAAGCATTGACCATTTTT 

TCCCAGAGAAGAGGAGAAATCTGTAGTGAGCAAAGGACAGACCATGAATCCTCCTTGAGAAGT 

AGTACTCTCAGAAAGGAGAAGCGCCACTCAAGTTCTTTTAACCCAAGACTTTAGAGAAATTAG 

GTCCAAGATTTTTATATGTTCAGTTGTTTATGTATAAAAATAACTTTCTGGATTTTGTGGGGA 

GGAGCAGGAGAGGAAGGAAGTTAATACCTATGTAATACATAGAAACTTCCACAATAAAATGCC 
ATTGATGGTTAAAAAAAAAAAAAAAAAAA 



m 
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FIGURE 8 



MAAIPPDSWQPPNVYLETSMGIIVLELYWKHAPKTCKNFAELARRGYYNGTKFHRIIKDFMIQ 
GGDPTGTGRGGASIYGKQFEDELHPDLKFTGAGILAMANAGPDTNGSQFFVTLAPTQWLDGKH 
TIFGRVCQGIGMVNRVGMVETNSQDRPVDDVKIIKAYPSG 

Important features: 
N-glycosylation sites: 

amino acids 49-52, 108-111 

N-myristoylation sites: 

amino acids 64-69, 69-74, 143-148 

Cyclophilin-type pep tidyl -prolyl cis- trans isomerase signature: 

amino acids 48-65 
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FIGURE 9 

CGGACGCGTGGGCGCGCGCGAGCGCAGCGGTGGGAGGCGGCGACCAGCCGGTTGAGGCCCCAG 
GCTTGGCCTCACCACAaiSTGGCACGAGGCTCGGAAGCATGAGCGGAAGCTTCGAGGCATGAT 
GGTCGACTACAAGAAGAGGGCGGAGCGGAGACGGGAGTATTATGAAAAGATCAAGAAGGACCC 
AGCCCAGTTCCTGCAGGTACATGGCCGAGCTTGCAAGGTGCACCTGGATTCTGCAGTCGCCCT 
^S^S^^SS^'^^^^'^^'^'^^^^'^^^'^^^^CTGGCAGGGGGACACCAACAACATGATTGACCG 
ATTCGATGTCCGTGCCCAQCTGGACCACATCCCCGACTACACCCCCCCTCTGCTCACCACCAT 
CTCCCCAGAACAGGAGTCGGACGAACGGAAGTGTAACTACGAGCGCTACAGAGGCCTGGTGCA 
GAACGACTTTGCCGGCATCTCAGAGGAGCAGTGCCTGTACCAGATCTACATTGATGAGTTGTA 
SS^^^^^^^^^^^^^'^^^^^^^^ATGAGAAGAAGAAGCTGGCAGAGAAGAAGGCTTCCAT 
?S?J™S^'^^^°^^^^^^^'^^^^^'^^^^CGAGGTAGAGAAGGCGGCAGAAAAGCCAGAGGA 
GGAGGAGTCAGCGGCCGAGGAGGAGAGCAACTCGGACGAAGATGAGGTCATCCCCGACATCGA 
SJJSS^^^'^'^'^^^^'^^^^'^'^^'^'^^^^^AGGAGCAGGTGGCAGATCTCAACAAACAGGCCAC 
SoS^^S^^^^^'^^^^^^^^^'^^^^'^'fCGTCAGGATGCTCCGGAAAGACAAGGAGGAGGCAGA 
GGCCATCAAGCATGCCAAGGCTCTTGAGGAGGAGAAGGCCATGTACTCGGGACGCCGCTCTCG 
ACGCCAGCGGAGAGAGTTTCGGGAGAAGCGGCTGAGGGGTCGCAAGATCAGCCCACCCAGCTA 
;S^S^*^^^^^^*^^^^^^^^^^'''^^^A^'^CCTATAAGCGGTCACCCTCGGAGTCCAGCTCAGA 
SSSSS^'^^^'^^^'^'^^^^^'^^^^^^^'^^^<=^GG<^CCGCGAGGAGAAGATCACGTTCATCACCAG 
TTTTGGGGGCAGCGATGAGGAGGCAGCCGCAGCCGCTGCTGCCGCAGCAGCATCAGGAGTCAC 
SJEJJ2SJ?^^^^^^^^^^^'^^^^^^^^^^^^GC<5GCCCCGCCCCGGGACGTAATGCCAGCGC 
S22SSS™S^^^^'^^^^'^^^^^^^^^^*^^"^^^T*='rTCTGCCTCGAGGACCTCCAGCTCCCGCTC 
CAGCTCTCGCTCCAGCTCCCGCTCTCGCCGTGGTGGGGGCTACTACCGTTCCGGCCGCCACGC 
CCGCTCCCGGTCCCGCTCCTGGTCCCGCTCCCGCTCCCGCTCCCGGCGCTATTCCCGGTCCCG 
JJSSoS^^^^^^^^^^^^^'^^^^^'^^^^^^C'fCCCGAGACGGACACCGGTACTCCCGCTCGCC 
S^J^SSSoJ^^'^^^'^'^^^^^^^^^^^^^^CA^AAGCAGGAGCCGCTCCCACTCAGGGGACCG 
Si^S^ ^^^^^^^^^^^^^^^^"^^^G^^^ACCACAGCAGTAGCCGCAGCCGCAGCAGCTGGTC 
JJ^S^*^^^^'^^^'^^^^^'^^^"=^^^^'^^^<^TCGCAGCCGCAGCCATAGCCCCAGCCCCAGCCA 
S^SS^^^^^^^^^^^^^^^^CCGCAGCCAGAGCCCCTCGCCATCACCCGCAAGAGAGAAGCT 
S???^^^^^^^'^*^^^^^^^^^'^^^'r^'r^GGCGAGAAGCTGAAAAAGACCGAACCTGCCGCTGG 
TAAAGAGACAGGAGCTGCCAAAGTCACCCAAGCTGACGCCTCAGGAGAAGCTGAAACTGAGGA 
TGCAGAAGGCGCTGAACAGGCAGTTCAAGGCGGATAaGAAGGCGGCACAAGAAAAGATGATCC 
AGCAGGAGCATGAGCGGCAGGAGCGGGAAGACGAGCTTCGAGCCATGGCCCGCAAGATCCGCA 
TGAAGGAGCGGGAACGCCGAGAGAAGGAGAGAGAAGAGTGGGAACGCCAGTACAGCCGGCAGA 
GCCGCTCACCCTCCCCCCGATACAGTCGAGAATACAGCTCTTCTCGAAGGCGCTCAAGGTCCC 
S^JS^^^^'^^^^^^^'^'^^^^^^^^T'^AGGCAGAAGAGTGGGGGGTGGGGAGGACAAGGGGGT 
2« ^^^^^^^^^^^■'^^^^A'^^CTGCTGGTTTTATCTCTAGTGAAATAAAGTCAAAAGTTA 
TTTAATTCCCGTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE IQ 

MWHEARKHERKLRGMMVDYKKRAERRREYYEKIKKDPAQFLQVHGRACKVHLDSAVALAAESP 
VNMMPWQGDTNNMIDRFDVRAHLDHIPDYTPPLLTTISPEQESDERKCNYERYRGLVQNDFAG 
ISEEQCLYQIYIDELYGGLQRPSEDEKKKLAEKKASIGYTYEDSTVAEVEKAAEKPEEEESAA 
EEESNSDEDEVIPDIDVEVDVDELNQEQVADLNKQATTYGMADGDFVRMLRKDKEEAEAIKHA 
KALEEEKAMYSGRRSRRQRREFREKRLRGRKISPPSYARRDSPTYDPYKRSPSESSSESRSRS 
RSPTPGREEKITFITSFGGSDEEAAAAAAAAAASGVTTGKPPAPPQPGGPAPGRNASARRRSS 
SSSSSSSASRTSSSRSSSRSSSRSRRGGGYYRSGRHARSRSRSWSRSRSRSRRYSRSRSRGRR 
HSGGGSRDGHRYSRSPARRGGYGPRRRSRSRSHSGDRYRRGGRGLRHHSSSRSRSSWSLSPSR 
SRSLTRSRSHSPSPSQSRSRSRSRSQSPSPSPAREKLTRPAASPAVGEKLKKTEPAAGKETGA 
AKVTQADASGEAETEDAEGAEQAVQGG .. 

Important features: 
N-glycosylation site: 

amino acids 370-373 

Glycosaminoglycan attachment site: 

amino acids 4 43-446 

cAMP- and cGSdP-dependent protein kinase phosphorylation site: 
amino acids 159-162, 282-285, 291-294, 374-377, 375-378, 430-433, 
440-443, 466-469 

Casein kinase II phosphorylation site: 

amino acids 149-152, 166-169, 171-174, 187-190, 193-196, 195-198, 
303-306, 307-310, 335-338, 571-574 

M-nryristoylation sites: 

amino acids 118-123, 229-234, 350-355, 446-451, 586-591 
Amidation sites: 

amino acids 263-266, 280-283, 438-441 
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FIGURE 11 



GGTAGGCGCGCCCAGACCTGAGACGGGTTGGGACTGGGCTGCGTCACGCGCGGGCTCTAAGCG 

CCCGGGGCCCCGCCCAGTGGCCGGCACAGCCAATCGCAGCGCGGGAAGGCGGTGGGGGCGGGG 

AAGGCCGCCTGGAAACTTAAATCCCGAGGCGGGCGAACCTGCACCAGACCGCGGACGTCTGTA 

ATCTCAGAGGCTTGTTTGCTGAGGGTGCCTGCGCAGCTGCGACGGCTGCTGGTTTTGAAACM 

GAATCTTTCGCTCGTCCTGGCTGCCTTTTGCTTGGGAATAGCCTCCGCTGTTCCAAAATTTGA 

CCAAAATTTGGATACAAAGTGGTACCAGTGGAAGGCAACACACAGAAGATTATATGGCGCGAA 

TGAAGAAGGATGGAGGAGAGCAGTGTGGGAAAAGAATATGAAAATGATTGAACTGCACAATGG 

GGAATACAGCCAAGGGAAACATGGCTTCACAATGGCCATGAATGCTTTTGGTGACATGACCAA 

TGAAGAATTCAGGCAGATGATGGGTTGCTTTCGAAACCAGAAATTCAGGAAGGGGAAAGTGTT 

CCGTGAGCCTCTGTTTCTTGATCTTCCCAAATCTGTGGATTGGAGAAAGAAAGGCTACGTGAC 

GCCAGTGAAGAATCAGAAACAGTGTGGTTCTTGTTGGGCTTTTAGTGCGACTGGTGCTCTTGA 

AGGACAGATGTTCCGGAAAACTGGGAAACTTGTCTCACTGAGCGAGCAGAATCTGGTGGACTG 

TTCGCGTCCTCAAGGCAATCAGGGCTGCAATGGTGGCTTCATGGCTAGGGCCTTCCAGTATGT 

CAAGGAGAACGGAGGCCTGGACTCTGAGGAATCCTATCCATATGTAGCAGTGGATGAAATCTG 

TAAGTACAGACCTGAGAATTCTGTTGCTAATGACACTGGCTTCACAGTGGTCGCACCTGGAAA 

GGAGAAGGCCCTGATGAAAGCAGTCGCAACTGTGGGGCCCATCTCCGTTGCTATGGATGCAGG 

CCATTCGTCCTTCCAGTTCTACAAATCAGGCATTTATTTTGAACCAGACTGCAGCAGCAAAAA 

CCTGGATCATGGTGTTCTGGTGGTTGGCTACGGCTTTGAAGGAGCAAATTCGAATAACAGCAA 

GTATTGGCTCGTCAAAAACAGCTGGGGTCCAGAATGGGGCTCGAATGGCTATGTAAAAATAGC 

CAAAGACAAGAACAACCACTGTGGAATCGCCACAGCAGCCAGCTACCCCAATGTGISaGCTGA 

TGGATGGTGAGGAGGAAGGACTTAAGGACAGCATGTCTGGGGAAATTTTATCTTGAAACTGAC 

CAAACGCTTATTGTGTAAGATAAACCAGTTGAATCATGGAGGATCCAAGTTGAGATTTTAATT 

CTGTGACATTTTTACAAGGGTAAAATGTTACCACTACTTTAATTATTGTTATACACAGCTTTA 

TGATATCAAAGACTCATTGCTTAATTCTAAGACTTTTGAATTTTCATTTTTTAAAAAGATGTA 

CAAAACAGTTTGAAATAAATTTTAATTCGTATATA 
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FIGURE 12 

MNLSLVIAAFCLGIASAVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHN 
GEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFREPLFLDLPKSVDWRKKGYV 
TPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 
VKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTWAPGKEKALMICAVATVGPI SVAMDA 
GHSSFQFYKSGIYFEPDCSSKNLDHGVLWGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKI 
AKDKNNHCGIATAASYPNV 

Important features: 
Signal sequence 

amino acids 1-17 

N-glycosylation sites. 

amino acids 2-6, 221-225, 292-296 

N-myristoylation sites. 

amino acids 13-19, 93-99, 136-142, 145-151, 174-180, 177-183, - 
180-186, 194-200, 288-294, 324-330 

Eukaxyotic thiol (cysteine) proteases cysteine active site. 

amino acids 132-144 

Eukaryotic thiol (cysteine) proteases histidine active site. 

amino acids 275-286 
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FIGURE 13 



GGCGGCGTCATGTGATCCGCTTCCCTGCTCCTTTAAGCGTCCACAGGCGGCGGAGCGGCCACA 

ATCACAGCTCCGGGCATTGGGGGAACCCGAGCCGGCTGCGCCGGGGGAATCCGTGCGGGCGCC 

TTCCGTCCCGGTCCCATCCTCGCCGCGCTCCAGCACCTCTGAAGTTTTGCAGCGCCCAGAAAG 

GAGGCGAGGAAGGAGGGAGTGTGTGAGAGGAGGGAGCAAAAAGCTCACCCTAAAACATTTATT 

TCAAGGAGAAAAGAAAAAGGGGGGGCGCAAAAATGGCTGGGGCAATTATAGAAAACATGAGCA 

CCAAGAAGCTGTGCATTGTTGGTGGGATTCTGCTCGTGTTCCAAATCATCGCCTTTCTGGTGG 

GAGGCTTGATTGCTCCAGGGCCCACAACGGCAGTGTCCTACATGTCGGTGAAATGTGTGGATG 

CCCGTAAGAACCATCACAAGACAAAATGGTTCGTGCCTTGGGGACCCAATCATTGTGACAAGA 

TCCGAGACATTGAAGAGGCAATTCC/y^GGGAAATTGAAGCCAATGACATCGTGTTTTCTGTTC 

ACATTCCCCTCCCCCACATGGAGATGAGTCCTTGGTTCCAATTCATGCTGTTTATCCTGCAGC 

TGGACATTGCCTTCAAGCTAAACAACCAAATCAGAGAAAATGCAGAAGTCTCCATGGACGTTT 

CCCTGGCTTACCGTGATGACGCATTTGCTGAGTGGACTGAAATGGCCCATGAAAGAGTACCAC 

GGAAACTCAAATGCACCTTCACATCTCCCAAGACTCCAGAGCATGAGGGCCGTTACTATGAAT 

GTGATGTCCTTCCTTTCATGGAAATTGGGTCTGTGGCCCATAAGTTTTACCTTTTAAACATCC 

GGCTGCCTGTGAATGAGAAGAAGAAAATCAATGTGGGAATTGGGGAGATAAAGGATATCCGGT 

TGGTGGGGATCCACCAAAATGGAGGCTTCACCAAGGTGTGGTTTGCCATGAAGACCTTCCTTA 

CGCCCAGCATCTTCATCATTATGGTGTGGTATTGGAGGAGGATCACCATGATGTCCCGACCCC 

CAGTGCTTCTGGAAAAAGTCATCTTTGCCCTTGGGATTTCCATGACCTTTATCAATATCCCAG 

TGGAATGGTTTTCCATCGGGTTTGACTGGACCTGGATGCTGCTGTTTGGTGACATCCGACAGG 

GCATCTTCTATGCGATGCTTCTGTCCTTCTGGATCATCTTCTGTGGCGAGCACATGATGGATC 

AGCACGAGCGGAACCACATCGCAGGGTATTGGAAGCAAGTCGGACCCATTGCCGTTGGCTCCT 

TCTGCCTCTTCATATTTGACATGTGTGAGAGAGGGGTACAACTCACGAATCCCTTCTACAGTA 

TCTGGACTACAGACATTGGAACAGAGCTGGCCATGGCCTTCATCATCGTGGCTGGAATCTGCC 

TCTGCCTCTACTTCCTGTTTCTATGCTTCATGGTATTTCAGGTGTTTCGGAACATCAGTGGGA 

AGCAGTCCAGCCTGCCAGCTATGAGCAAAGTCCGGCGGCTACACTATGAGGGGCTAATTTTTA 

GGTTCAAGTTCCTCATGCTTATCACCTTGGCCTGCGCTGCCATGACTGTCATCTTCTTCATCG 

TTAGTCAGGTAACGGAAGGCCATTGGAAATGGGGCGGCGTCACAGTCCAAGTGAACAGTGCCT 

TTTTCACAGGCATCTATGGGATGTGGAATCTGTATGTCTTTGCTCTGATGTTCTTGTATGCAC 

CATCCCATAAAAACTATGGAGAAGACCAGTCCAATGGCGATCTGGGTGTCCATAGTGGGGAAG 

AACTCCAGCTCACCACCACTATCACCCATGTGGACGGACCCACTGAGATCTACAAGTTGACCC 

GCAAGGAGGCCCAGGAGTAGGAGGCTGCAGCGCCCGGCTGGGACGGTCTCTCCATACCCCAGC 

CCCTCTAACTAGAGTGGGGAGCATGCCAGAGAGAGCTCAATGTACAAATGAATGCCTCATGGC 

TCTTAGCTGTGGTTTCTTGGACCAGCGGCATGGACATTTGTCAGTTTGCCTTCTGACGGTAGC 

TTTTGGAGGAAGATTCCTGCAGCCACTAATGCATTGTGTATGATAACAAAAACTCTGGTATGA 

CACATTTTCTGTGATCATTGTTAATTAGTGACATAGTAACATCTGTAGCAGCTGGTTAGTAAA 

CCTCATGTGGGGGTGGGGTGGGGGTGTATTCCTTGGGGGATGGTTTGGGCCGAATGGGGAGTG 

GAATATTTGACATTTTTCCTGTTTTAAATTCTAGGATAGATTTTAACATCCTTTGCGGTCCCA 

GTCCAAGGTAGGCTGGTGTCATAGTCTTCTCACTCCTAATCCATGACCACTGTTTTTTTCCTA 

TTTATATCACCAGGTAGCCTACTGAGTTAATATTTAAGTTGTCAATAGATAAGTGTCCCTGTT 

TTGTGGCATAATATAACTGAATTTCATGAGAAGATTTATTCCACCAGGGGTATTTCAGCTTTG 

AAACCAAATCTGTGTATCTAATACTAACCAATCTGTTGGATGTGGATTTTAAAAAATGTTTGC 

TAAACTACCCAAGTAAGATTTACTGTATTAAATGGCCTTCGGGTCTGAAAAGCTTTTTTAACC 

TCTTGCTTAAAATGCGTTTTATTTTGATAAGATACTTCAAATAGCCTCCAAAAGTGTAGATCC 

AATCACTTAAATAAACCTGTATGTATATGCAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 14 

MAGAIIENMSTKKLCIVGGILLVFQIIAFLVGGLIAPGPTTAVSYMSVKCVDARKNHHKTKWF 
VPWGPNHCDKIRDIEEAIPREIEANDIVFSVHIPLPHMEMSPWFQFMLFILQLDIAFKLNNQI 
RENAEVSMDVSLAYRDDAFAEWTEMAHERVPRKLKCTFTSPKTPEHEGRYYECDVLPFMEIGS 
VAHKFYLLNIRLPVNEKKKINVGIGEIKDIRLVGIHQNGGFTKVWFAMKTFLTPSIFIIMVWY 
WRRITMMSRPPVLLEKVIFALGISMTFINIPVEWFSIGFDWTWMLLFGDIRQGIFYAMLLSFW 
IIFCGEHMMDQHERNHIAGYWKQVGPIAVGSFCLFIFDMCERGVQLTNPFYSIWTTDIGTELA 
MAFIIVAGICLCLYFLFLCFMVFQVFRNISGKQSSLPAMSKVRRLHYEGLIFRFKFLMLITLA 
CAAMTVIFFIVSQVTEGHWKWGGVTVQVNSAFFTGIYGMWNLYVFALMFLYAPSHKNYGEDQS 
NGDLGVHSGEELQLTTTITHVDGPTEIYKLTRKEAQE 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

Transmembrane domains : 

amino acids 239-253, 269-284, 302-318, 338-352, 377-399, 434-452, 
471-488 

N-glycosylation sites* 

amino acids 8-12, 406-410 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 254-258 

N-o^ristoylation sites. 

amino acids 223-229, 274-280, 305-311, 358-364, 374-380, 386-392, 
509-515 
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FIGURE 15 

GTGAGGGGAACAGCTGATCCGTCTGTTGGGAGGACAGATATCTCAAGGCCAGGATgGAAGAAT 
CACCACTAAGCCGGGCACCATCCCGTGGTGGAGTCAACTTTCTCAATGTAGCCCGGACCTACA 
TCCCCAACACCAAGGTGGAATGTCACTACACCCTTCCCCCAGGCACCATGCCCAGTGCCAGTG 
ACTGGATTGGCATCTTCAAGGTGGAGGCTGCCTGTGTTCGGGATTACCACACATTTGTGTGGT 
CTTCCGTGCCTGAAAGTACAACTGATGGTTCCCCCATTCACACCAGTGTCCAGTTCCAAGCCA 
GCTACCTGCCCAAACCAGGAGCrCAGCTCTACCAGTTCCGATATGTGAACCGCCAGGGCCAGG 
TGTGTGGGCAGAGCCCCCCTTTCCAGTTCCGAGAGCCAAGGCCCATGGATGAACTGGTGACCC 
TGGAGGAGGCTGATGGGGGCTCTGACATCCTGCTGGTTGTCCCCAAGGCAACTGTGTTACAGA 
ACCAGCTCGATGAGAGCCAGCAAGAACGGAATGACCTGATGCAGCTGAAGCTACAGCTGGAGG 
GACAGGTGACAGAGCTGAGGAGCCGAGTGCAGGAGCTCGAGAGGGCTCTGGCAACTGCCAGGC 
AGGAGCACACGGAGCTGATGGAACAGTACAAGGGGATTTCCCGGTCCCATGGGGAGATCACAG 
AAGAGAGGGACATCCTGAGCCGGCAACAGGGAGACCATGTGGCACGCATCCTGGAGCTAGAGG 
?^?5^^^^^^^^^^^'^^^^'^^^^^^'^<=C'rGACGAAGGAAGTGGAGCTGGACAGGCTTAGAG 
ACACAGTGAAGGCCCTGACTCGGGAACAAGAGAAGCTCCTTGGGCAACTGAAAGAAGTACAAG 
CAGACAAGGAGCAAAGTGAGGCTGAGCTCCAAGTGGCACAACAGGAGAACCATCACTTAAATT 
TGGACCTGAAGGAGGCGAAGAGCTGGCAAGAGGAGCAGAGTGCTCAGGCTCAGCGACTGAAAG 
^S^5^'^^*^^^^^^^'^^^^^^^^^<^CTAGGCCAGGCCCAGCAGCGGGTGGCCGAGCTGGAGC 
CCTTGAAGGAGCAGCTTCGAGGGGCCCAGGAGCTTGCAGCCTCAAGCCAGCAGAAAGCCACCC 
TTCTTGGGGAGGAGTTGGCCAGTGCAGCAGCAGCCAGGGAGCGCACCATAGCCGAACTACACC 
GCAGCCGCCTGGAAGTGGCTGAAGTTAACGGCAGGCTGGCTGAGCTCGGTTTGCACTTGAAGG 
AAGAAAAATGCCAATGGAGCAAGGAGCGGGCAGGGCTGCTGCAGAGTGTGGAGGCAGAGAAGG 
ACAAGATCCTGAAGCTGAGTGCAGAGATACTTCGATTGGAGAAGGCAGTTCAGGAGGAGAGGA 
CCCAAAACCAAGTGTTCAAGACTGAGCTGGCCCGGGAGAAGGATTCTAGCCTGGTACAGTTGT 
CAGAAAGTAAGCGGGAGCTGACAGAGCTGCGGTCAGCCCTGCGTGTGCTCCAGAAGGAAAAGG 
^o^^^^^^^^^'^^^^^^'^^^^^^'^'^GCTAGAGTACATGAGAAAGCTAGAGGCCCGCCTGG 
^^^^^^'^^^^^^'^^^^'^^AGGATGCCACCACAGAGGATGAGGAGGCCGCTGTGG 
o^JJSS'^^'^^'^^^^^^^^^^^^^^^^^^^'rC^SAGGACGAGTCCCCAGAAGACATGAGGCTCC 
CACCCTATGGCCTTTGTGAGCGTGGAGACCCAGGCTCCTCTCCTGCTGGGCCTCGAGAGGCTT 
CTCCCCTTGTTGTCATCAGCCAGCCGGCTCCCATTTCTCCTCACCTCTCTGGGCCAGCTGAGG 
ACAGTAGCTCTGACTCGGAGGCTGAAGATGAGAAGTCAGTCCTGATGGCAGCTGTGCAGAGTG 
SSSSZ^^^^^^^^^^^^'^'^^^'^^^'^TC^TGAACTGGGCAGTGCCTTCTATGACATGGCCAGTG 
JSIoIJS^^'^^^^^^^^^'^^^^^^^^^A^CACTGGGGGCCCTGCCACCCCCACATGGAAGG 
AGTGTCCTATCTGTAAGGAGCGCTTTCCTGCTGAGAGTGACAAGGATGCCCTGGAGGACCACA 
TGGATGGACACTTCTTTTTCAGCACCCAGGACCCCTTCACCTTTGAG2GATCTTACTCCCTCG 
TACATGCACAAATACACACTCATGCACACACACACTCACACACATGCATACACTTAGGTTTCA 
JS2™r"^^'^^'^^^*=^^^'^^'^^^'^^^^'rGATATTCTGTTCCCTAAGAACTGCTTCTGTGTGC 
CCTGTTTTCATCCCAAGATTTCTCACTTCATCCTCTCCTACCTGGCTCTTTTGTCCCAGGGAG 
GGGTCCTGTTCGGAAGCAGTGGCTGAATTTATCCCCTGAAAGTGGTTTTGGAGGAACCGGGAT 
GGAGGAGGCCTTCCCCTGTGGGAATAGAATCGTCCACTCCTAGCCCTGGTTGCTTCTGATACA 
CAGCCACTGCACACACACACTCACACTCACACTCCCTTGTCTGATGCCCCAAAGCCAATTCCT 
GGGGCACCCTACCCTCTCTTATTTGGAGTTTCCGTTGGTTTACCTGAGTTTTCTCTGGGGTCT 
GCACAGAGGCAGCAGCATGGACATCATGGCCTCTCAGGTCCCTTTTGGTTCTCAGTTTCATTG 
GTTCCTCTTTCTGTTCCCCCATTGACTTCTGTGCCCCACCCTAGCCTTTTCCATAACCTTAGG 
TATTCAGTTTGGAGGGGTTTTTTGTATTTTTGAGGATTCCTGTATTCTGTATCCTCTCCTCGC 
SIS^SP^^^^'^^^^^^^^'^'^'^A*'^'^TGTGCCTTCTGTGAGGAATGGGGGGAACAAGTG 
SSSS™^'^^^^^^'^'^'^^^^^^^^^^<=^'^CCCTCTCCAGGTCCCCCCACAGCAATAAAAG 
CTTCCCCCTGATATCCATCCCTTTGTAGTTTGAACAAATATATTTATATGATATGTAA 
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FIGURE 16 

MEESPLSRAPSRGGVNFLNVARTYIPNTKVECHYTLPPGTMPSASDWIGIFKVEAACVRDYHT 
FVWSSVPESTTDGSPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQSPPFQFREPRPMDE 
LVTLEEADGGSDILLVVPKATVLQNQLDESQQERNDLMQLKLQLEGQVTELRSRVQELERALA 
TARQEHTELMEQYKGISRSHGEITEERDILSRQQGDHVARILELEDDIQTISEKVLTKEVELD 
RLRDTVKALTREQEKLLGQLKEVQADKEQSEAELQVAQQENHHLNLDLKEAKSWQEEQSAQAQ 
RLKDKVAQMKDTLGQAQQRVAELEPLKEQLRGAQELAASSQbKATLLGEELASTVAAARDRTIA 
ELHRSRLEVAEVNGRLAELGLHLKEEKCQWSKERAGLLQSVEAEKDKILKLSAEILRLEKAVQ 
EERTQNQVFKTELAREKDSSLVQLSESKRELTELRSALRVLQKEKEQLQEEKQELLEYMRKLE 
ARLEKVADEKWNEDATTEDEEAAVGLSCPAALTDSEDESPEDMRLPPYGLCERGDPGSSPAGP 
REASPLWISQPAPISPHLSGPAEDSSSDSEAEDEKSVLI^AAVQSGGEEANLLLPELGSAFYD 
MASGFTVGTLSETSTGGPATPTWKECPICKERFPAESDKDALEDHMDGHFFFSTQDPFTFE 

Important features : 

Casein kinase II phosphorylation sites: 

amino acids 28-31, 43-46, 68-71, 72-75, 129-132, 156-159, 208- 

211, 239-242, 282-285, 305-308, 376-379, 383-383, 468-471, 520- 

523, 521-524, 537-540, 539-542, 543-546, 593-596, 595-598, 597- 

600, 612-615, 639-642, 652-655, 667-670, 683-686 

N-myristoylation sites: 

amino acids 39-44, 107-112, 204-209, 414-419, 561-566, 613-618 

Cell attachment sequence: 

amino acids 557-559 



Leucine zipper pattern sequence: 

amino acids 163-184, 475-496, 482-503 
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FIGURE 17 

GCAAGTTGGGAATTTTAGACTGTCACTGCACATGGACCTCTGGGAAGACGTCTGGCGAGAGCT 

AGGCCCACTGGCCCTACAGACGGATCTTGCTGGCTCACCTGTCCCTGTGGAGGTTCCCCTGGG 

AAGGCAAGAT6CCCAACAACAGCACTGCTCTGTCATTGGCCAATGTTACCTACATCACCATGG 

AAATTTTCATTGGACTCTGCGCCATAGTGGGCAACGTGCTGGTCATCTGCGTGGTCAAGCTGA 

ACCCCAGCCTGCAGACCACCACCTTCTATTTCATTGTCTCTCTAGCCCTGGCTGACATTGCTG 

TTGGGGTGCTGGTCATGCCTTTGGCCATTGTTGTCAGCCTGGGCATCACAATCCACTTCTACA 

GCTGCCTTTTTATGACTTGCCTACTGCTTATCTTTACCCACGCCTCCATCATGTCCTTGCTGG 

CCATCGCTGTGGACCGATACTTGCGGGTCAAGCTTACCGTCAGATTCAGAATTCCTGGGCTCC 

CTGGGTGCATTCTATCATTCCAGTTGAAAGTTTGCTTCCTTCCAGTCATGTGGCTCTTCATTC 

TACTCTCCTTGGCTCTCATTTCAGATGCCATGGTCATGGATGAAAAGGTCAAGAGAAGCTTTG 

TGCTGGACACGGCTTCTGCCATCTGCAACTACAATGCCCACTACAAGAATCACCCCAAATACT 

GGTGCCGAGGCTATTTCCGTGACTACTGCAACATCATCGCCTTCTCCCCTAACAGCACCAATC 

ATGTGGCCCTGAGGGACACAGGGAACCAGCTCATTGTCACTATGTCCTGCCTGACCAAAGAGG 

ACACGGGCTGGTACTGGTGTGGCATCCAGCGGGACTTTGCCAGGGATGACATGGATTTTACAG 

AGCTGATTGTAACTGACGACAAAGGAACCCTGGCCAATGACTTTTGGTCTGGGAAAGACCTAT 

CAGGCAACAAAACCAGAAGCTGCAAGGCTCCCAAAGTTGTCCGCAAGGCTGACCGCTCCAGGA 

CGTCCATTCTCATCATTTGCATACTGATCACGGGTTTGGGAATCATCTCTGTAATCAGTCATT 

TGACCAAAAGGAGGAGAAGTCAAAGGAATAGAAGGGTAGGCAACACTTTGAAGCCCTTCTCGC 

GTGTCCTGACTCCAAAGGAAATGGCTCCTACTGAACAGATGIfi&CTGAAGATTTTTTTAATTT 

AGTTCATAAAGTGATGCTACAACAGAATAATCACCATGACAACTGGCCCACACCTCAGAGACT 

GATTCTGATCTCCCAGGAATTCTGAAGGACCCTCTATCCTTGACAACAATCATTTGCAGCCAG 

GTAGCAACGGCGGTAGTCAGAGGAGCTATGATAGACCACACCCAAGCAAGGCTGCCCTCAAAT 

AACATCTCAAGATCTTAGTTCTTATGCATTCCATCAGTCAGAAGTGAAGAAGAGGTGGAGAAT 

CTGGATTGGGGACCAGGAAATCACTTGTATTTTGTTAGCCAATAAATTCCTAGCCAGTGTTGA 
ATGAAAAAAAAAAAAA 
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FIGURE 18 

MPNNSTALSLANVTYITMEIFIGLCAIVGNVLVICVVKLNPSLQTTTFYFIVSLALADIAVGV 
LVMPLAIWSLGITIHFYSCLFMTCLLLIFTHASIMSLLAIAVDRYLRVKLTVRFRIPGLPGC 
I LS FQLKVC FL P VMWL FI LLS LAL I S DAMVMDEKVKRS FVLDT AS AI CN YNAH YKNH PKYWCR 
GYFRDYCNIIAFSPNSTNHVALRDTGNQLIVTMSCLTKEDTGWYWCGIQRDFARDDMDFTELI 
VTDDKGTLANDFWSGKDLSGNKTRSCKAPKVVRKADRSRTSILIICILITGLGIISVISHLTK 
RRRSQRNRRVGNTLKPFSRVLTPKEMAPTEQM ■ 

Important features of the protein: 
Transmembrane domains : 

amino acids 16-35, . 62-80^ 89-101, 134-152, 292-311 
N-glycosylation sites. 

amino acids 3-7, 4-8, 12-16, 204-208, 273-277 

cAMP- and cGMP -dependent protein kinase phosphorylation site 

amino acids 316-320 

N-n^istoylation sites. 

amino acicis 122-128, 125-131, 258-264 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 214-225 



6-protein coupled receptors proteins. 

amino acids 29-59, 76-116 
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FIGURE IQ 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCCGGCCTCTCCAATG 
^™^^THP'^^°"°^^°°^'^^<^^<^^<=^°'^CTTTCGGCAAAGGCAGTCGAGTGTTTGCAGACCGGGGCGAGTC 
CTGTGAAAGCAGATAAAAGAAAACATTTATTAACGTGTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACT 
CCCCGCGGAACATTTGGCTCCCTCCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCG 
TTTCCAGCCAAGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCCCTG 
GTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGTTTCCAGCTCCTGGGCG 
^J^^^^^^'^^'^^'^'^TCAACTCTCCGCCGAGGGCGAGCAGGAGCGAGAGTGTGTCGAATCTGCGAGTGAAGAGGG 
ACGAGGGAAAAGAAACAAAGCCACAGACGCAACTTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAA 
AAAGAAGATGGGCCCCCCGAGCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTC 
^^S^^^^^^^^^^^^^^^^^'^^^'^'^^'^^^^^'^GTTTCAGAGGGACCGCAGGAACATCCGCCCCAACATCATCCT 

^°I^^J^^^<5<^acgaccaggatgtggagctgggttccatgcaggtgatgaacaagacccggcgcatcatSag^^ 

GGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCOiTGTGCTGCCCCTCACGCTCCTCCATS^ 
^^S^^^^'^^^^'^'^^^^^^^^^^'^^^CCAACAATGAGftACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGA 

^™^H^^^^^°^°^^^^^^'^®^T°'5^^GTGGGTCGGACTCCTTAAAAACTCCCGCTTTTATAACTACAC 

gctctgtcggaacggggtgaaagagaagcacggctccgactactccaaggattacctcacagacctcatS^^ 

I^^^^n^^^*'*=''^'^''"'^^^^^''^^^^<^^'^'^^'5™CCCGCACAGGCCAGTCCTCATGGT^^^ 

agccccccacggccctgaggattcagccccacaatattcacgcctcttcccaaacgcatctcagcacatcacgcc 

™»^o^S^^^^^°^^^^^°c°^^^cgcttgcagaccctcatgtcggtggacgactccatggagacgattta 
^^^^^^^^^^''^'''^'''^'^'^'^^'^'''^^'''^^^^^^^^^tacatcgtatacaccgccgaccacggttaccacatcg^ 
ccagtttggcctggtgaaagggaaatccatgccatatgagtttgacatcagggtcccgttctacgtgaggggccc 
^^^^^^^?^^^C'^^ct<^tctgaatccccacatcgtcctcaacattgacctggcccccaccatcctggacattgc 

rSSS^^SSS^^^'^^^^^'^^^'^^'^^'^^^'^'^^T^CATCCTCAAGCTGCTGGACACGGAGCGGCCGGTC^ 
Arl^^nS^To^^^^^^'^^^^^'^^'^^^^^^^'^'^^C'rCCTTCTTGGTGGAGAGAGGCAAGCTG^^ 

^n^^^^^^^^^^^^'^'^^'^^^'^^^^'^^'^^'^'^GGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAACTCTT 
GTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCACCAAGCGGCACTGGCCAGGGGCCCCTGAGGA 

o™^S^5?^^'^*^^*^^^^'^^^^T^TGCATCCTTTCAGGAAGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTT 
,^^°S!^5?^^^i^^^^^^<5^G^CTCCGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCC 
^S^^n^^^^S^^^^^^^^^^^'^^^^^^^^A^'^^CTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGC 
.^SS^S^^S ^^^^^'^^^^^'^'^^^'f^^'^'^C^TGAGGACCATCAATGAGACTCACAATrTCCTCTT 

^^^^^^^^^^^TCCTCAACCAGCTACACGTACAGCTCATGGAGCTGAGGAGCTGCAAGGGTTACAAGO^GTCT^ 
CCCCCGGACTCGAAACATGGACCTGGATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCC 
^^™™^^^^^^^^^^'^^^^*^^<='^<^<^^^^^CTGTGGGAAGGCTGGGAAGGTIAAGAAACAAC^^^ 
GGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTGATTTCCAGCAGACC 

GCAGGA.:,CAGAGATAACTTCAG6AAGTCCATTTTTGCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCAC 
TTTAGGGGAACCTAATAAGAAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAA^ 
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FIGURE 20 

MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGSMQV 
MNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHESRT 
FAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSK 
DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITPSYNY 
APNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYT 
ADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDIAGLDIP 
ADMDGKSILKLLDTERPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEENFLPKYQR 
VKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKYYGQGSEAC 
TCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQPRNLTKRHW 
PGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKSLQAWKDHKLH 
IDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSLHPFRKGLQEKD 
KVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLGPFCACTSANNNT 
YWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQLHVQLMELRSCKGY 
KQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 

In^rtant features: . 
Signal peptide: 

amino acids 1-17 

Sulfatases signature 1. 

amino. acids 86-99 

Homologous region to sulfa tase: 

amino acids 87-106, 133-146, 216-229, 291-320, 365-375 
N-glycosylation sites. 

amino acids 65-69, 112-116, 132-136, 149-153, 171-175, 198-202, 
241-245, 561-565, 608-612, 717-721, 754-758, 764-768 
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FIGURE 21 



GGGCGCGCGAGAGCTGCTAGGGCGGTTTCTCTGCCTCGGGCCTGTTGGGCAGGGCCGGCT 

AAGGTGCGCGTGCTCGCTGGTTCTAACCCTTCTGTTGGGCGTTTCTGCTGAGAGGCGGGA 

GGCGCTGAGAGTCTGTGCGGAGGTCCGTGGACAGACTGCTTTGCTCGTTGTTGCTCTTCG 

GAGGCGGCGATCCCCGAAGGCGAGCTGAAATACGGCTGCAGGCTACAATTTGCAGCCGAC 

GATTATGGAAGACGGAAGCGGGAGAGGTGGCCCACCCTCATGGAGCGCTTGTGCTCGGAT 

GGCTTCGCATTTCCCCAATACCCCATTAAACCGTATCATCTGAAGAGGATCCACAGAGCT 

GTCTTACATGGTAATCTAGAGAAACTGAAGTACCTTCTGCTCACGTATTATGACGCCAAT 

AAGAGAGACAGGAAGGAAAGGACCGCCCTACATTTGGCCTGTGCCACTGGCCAACCGGAA 

ATGGTACATCTCCTGGTGTCCAGAAGATGTGAGCTTAACCTCTGCGACCGTGAAGACAGG 

ACACCTCTGATCAAGGCTGTACAACTGAGGCAGGAGGCTTGTGCAACTCTTCTGCTGCAA 

AATGGCGCCAATCCAAATATTACGGATTTCTTTGGAAGGACTGCTCTGCACTACGCTGTG 

TATAATGAAGATACATCCATGATAGAAAAACTTCTTTCACATGGTACAAATATTGAAGAA 

TGCAGCAAGGTASeaSGTCAACCAATGTTATTTTCAAACTATCTGAAATGAATTTATTTTA 

ACATTGACACATGTAAGGGTCAATTTTTCATATTTGGAAGCTCAAACATTCCTTGAATGA 

AAATATTTTGAAATGCCTTAACTGTCTAAGATTTTACTTTAAATATTGGAACTTTTAAAG 

AAGCATTATAGGGAACAGCCTTTTTTCATGCACTTATGGTAAATAACTATAAAAACAAAT 

GAATTACAATAAATTTATAATTCATGACAACTGAATTTGGGAAAGGTAATAGTTAAGTGT 
TTTTCCACTAAATTACTTTTT 
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FIGURE 22 

MERLCSDGFAFPQYPIKPYHLKRIHRAVLHGNLEKLKYLLLTYYDANKRDRKERTALHLACAT 
GQPEMVHLLVSRRCELNLCDREDRTPLIKAVQLRQEACATLLLQNGANPNITDFFGRTALHYA 
VYNEDTSMIEKLLSHGTNIEECSKV 

Isportant features of the protein: 
N-glycosylation site. 

amino acids 113-117 

N-myristoylation site. 

amino acids 109-115 



Microbodies C-terminal targeting signal. 

amino acids 14 9-153 
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FIGURE 23 

GAGGCAGAAAGGCAGAAAGGAGAAAATTCAGGATAACTCTCCTGAGGGGTGAGCCAAGCCCTG 

CCATGTAGTGCACGCAGGACATCAACAAACACAGATAACAGGAAATGATCCATTCCCTGTGGT 

CACTTATTCTAAAGGCCCCAACCTTCAAAGTTCAAGTAGTGATATGGATGACTCCACAGAAAG 

GGAGCAGTCACGCCTTACTTCTTGCCTTAAGAAAAGAGAAGAAATGAAACTGAAGGAGTGTGT 

TTCCATCCTCCCACGGAAGGAAAGCCCCTCTGTCCGATCCTCCAAAGACGGAAAGCTGCTGGC 

TGCAACCTTGCTGCTGGCACTGCTGTCTTGCTGCCTCACGGTGGTGTCTTTCTACCAGGTGGC 

CGCCCTGCAAGGGGACCTGGCCAGCCTCCGGGCAGAGCTGCAGGGCCACCACGCGGAGAAGCT 

GCCAGCAGGAGCAGGAGCCCCCAAGGCCGGCCTGGAGGAAGCTCCAGCTGTCACCGCGGGACT 

GAAAATCTTTGAACCACCAGCTCCAGGAGAAGGCAACTCCAGTCAGAACAGCAGAAATAAGCG 

TGCCGTTCAGGGTCCAGAAGAAACAGTCACTCAAGACTGCTTGCAACTGATTGCAGACAGTGA 

AACACCAACTATACAAAAAGGATCTTACACATTTGTTCCATGGCTTCTCAGCTTTAAAAGGGG 

AAGTGCCCTAGAAGAAAAAGAGAATAAAATATTGGTCAAAGAAACTGGTTACTTTTTTATATA 

TGGTCAGGTTTTATATACTGATAAGACCTACGCCATGGGACATCTAATTCAGAGGAAGAAGGT 

CCATGTCTTTGGGGATGAATTGAGTCTGGTGACTTTGTTTCGATGTATTCAAAATATGCCTGA 

AACACTACCCAATAATTCCTGCTATTCAGCTGGCATTGCAAAACTGGAAGAAGGAGATGAACT 

CCAACTTGCAATACCAAGAGAAAATGCACAAATATCACTGGATGGAGATGTCACATTTTTTGG 

TGCATTGAAACTGCTGTGECCTACTTACACCATGTCTGTAGCTATTTTCCTCCCTTTCTCTGT 

ACCTCTAAGAAGAAAGAATCTAACTGAAAATACCAAAAAAAAAAAAAAAA 
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FIGURE 24 



MDDSTEREQSRLTSCLKKREEMKLKECVSILPRKESPSVRSSKDGKLLAATLLIiALLSCCLTV 
VSFYQVAALQGDLASLRAELQGHHAEKLPAGAGAPKAGLEEAPAVTAGLKIFEPPAPGEGNSS 
QNSRNKRAVQGPEETVTQDCLQLIADSETPTIQKGSYTFVPWLLSFKRGSALEEKENKILVKE 
TGYFFIYGQVLYTDKTYAMGHLIQRKKVHVFGDELSLVTLFRCIQNMPETLPNNSCYSAGIAK 
LEEGDELQLAIPRENAQISLDGDVTFFGALKLL 

Transmembrane domain: 

amino acids 4 7-72 

N-glycosylation site. 

amino acids 124-127, 242-245 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36, 173-176 

N-niyristoylation site, 
amino acids 96-101 

TNF family proteins. 

amino acids 172-206 
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FIGURE 25 

CTGCTTGGATACCTCCAGTCCCCAAACTGTGTTCCAGGAGTTTTCTTGGCCGAAGCTGCCCGA 

TGTTTGAGCCTTTTCTTCCCAGAGAAGAAGATGGACTGAAAGCTGCCAGTTGGGGACTTTTTG 

TGATCACGGCGTTGCAGCGTTTTAAAGGAGGTGATGGGGCTTGCGCTGGCTTGTCTTCCCACC 

CAAGTGAAGAGTTGATGTTCACTGGTTATGCTTAGACAATGTGCAGTTTGTGTTAATTTAAAA 

TTTTGGGTGGGATAGGGGCATAGGCTTGTGAAGGGCAGTCCGGATCCGGAGGAACTCGTCTTT 

GTCCCTGGTAGGAGAGACACCCCCAGTCTATCCTCGATGCCGTCAGCCTTGGCCATCTTCACT 

TGCCGCCCGAACTCGCACCCGTTTCAGGAGCGTCATGTCTACCTGGACGAGCCCATCAAAATC 

GGCCGCTCAGTGGCCCGCTGTCGACCAGCGCAGAATAATGCCACTTTTGATTGCAAAGTGCTA 

TCAAGGAACCACGCTCTCGTCTGGTTTGATCACAAGACGGGCAAGTTTTATCTTCAAGACACT 

AAAAGTAGTAATGGTACTTTTATAAATAGCCAGAGATTGAGTCGAGGCTCTGAAGAAAGTCCA 

CCATGTGAAATTCTTTCCGGTGACATTATCCAGTTTGGAGTAGACGTGACAGAGAATACACGG 

AAAGTTACCCATGGGTGTATTGTTTCCACAATAAAACTTTTTCTACCAGATGGTaTGGAAGCC 

CGGCTCCGCTCAGATGTCATCCATGCACCATTACCAAGTCCTGTTGACAAAGTTGCTGCTAAC 

ACTCCAAGTATGTACTCTCAGGAACTATTCCAGCTTTCTCAGTATCTACAGGAGGCCTTACAT 

CGGGAACAAATGTTGGAACAGAAGTTAGCCACGCTTCAGCGGCTACTAGCCATCACCCAAGAG 

GCTTCAGATACCAGTTGGCAGGCTTTAATAGATGAAGATAGACTCTTATCACGGTTAGAAGTT 

ATGGGAAACCAATTACAGGCATGCTCCAAAAATCAAACAGAAGATAGTTTACGAAAGGAACTT 

ATAGCATTACAAGAGGATAAACATAACTATGAGACAACAGCCAAAGAGTCCCTGAGGCGGGTT 

CTTCAGGAGAAAATTGAAGTGGTTAGAAAACTTTCAGAAGTTGAGCGAAGTCTGAGTAATACT 

GAAGATGAATGTACCCATCTGAAAGAAATGAATGAAAGGACTCAGGAAGAATT7VAGAGAATTA 

GCCAACAAATATAATGGAGCAGTTAATGAGATTAAAGATTTATCTGATAAATTAAAGGTAGCA 

GAGGGAAAACAAGAGGAAATCCAACAGAAGGGACAGGCTGAGAAAAAAGAATTACAACATAAA 

ATAGATGAAATGGAAGAAAAAGAACAGGAGCTCCAGGCAAAAATAGAAGCTTTGCAAGCTGAT 

AATGATTTCACCAATGAAAGGCTAACAGCTTTACAAGTACGGTTAGAACATCTTCAGGAGAAA 

ACTCTTAAAGAATGCAGCAGCTTGGCTGATCGTCGAAGGGCATCTAACCAAAGCGGTAGAAGA 

AACAAAGCTTTCAAAAGGTTTGTTTTCTGTTTTTCTATGTTTTTTGACAGTTCTTTTGGATAA 

TGAAGGTTAGTGTATATTTTCAAGGTTATAGTATTTTAACCATCAGTTTACTTCTTATAGCTC 

ACAAAATAGCAAGCCAGTAACAGTATCAGATAATATATAAAATAATCAGACTTCTGTTTTAAG 

AAGGGTATCGTAACTGGAATGTGTCTTTTTAAGTGGATGTATATTTATGGTTTTTTGAATGTT 

AGTACTTGATATAGGTTTCTTTAGGTATTAAAGATTTGTTGCAATCTCTGTCATTCCCAGCAT 

TAATTTCAGCTTTGATCTCAAATTTTAATCAAACACAATGTAAGTCGTTTGTGATACAACTTA 

AGTGAAACATGCTTGCACTTCTATTTTGGGGGTTACAGTACCTTTAAAATCTCTTATGATGTT 

TAATATTTCCTTAATTTTTGGCATCTCAGTTTGATTTAAACAAAATTAATGACTTTTGTGAAT 

GTAGAATCTTCTTATATTTTATGAGTAGTCCAGTAATTGCCCAAAGTAGTTTATTGTGTTAAT 

TCTGTTACAGTTGTCAGAGAAGAAAAGTGAGTTTTAAAGCACCATATTGTCAAGTCACTTTTA 

TACATAGGGAAATTAGGCAAATAAATTTGGTGGCATGTGTTTATCATAGTAGAACTTTCATTA 

GACTATACCAGTATAAAATTTAAAACTAGATTCACAGTCCTTTTGGCCAATTAAAACATTGAG 

TTACAAAAGTTTGAGATACTTAATTTTAGTACATTCTATTTTATTAAAGTAACTGGATTCATT 

TGACTTTTTTAACCATGTAAGAGGATGGTGTTATTTCAAATATCTCGTGGTTTCCATTCTGAA 

TTTTGTGCACGGCAGATGCCATATTTGGGGAAAAAATGCATAGAATATGCATCATTAATATTG 

TTTTGGCAAACAGGCATTGAGTTTCAGAACAGTGAACTATTTTTAGTACATATGGCAATTTTT 

TTCACCTTATTAAAGTGAGATGAGAACAGACCTTAAAATAGCTTTTACCTCACCATCCAAATA 

CCTATTCAGATTAGTTGGTTGAATAGCCAGCACTTTGAAGTAGAGCCTTAGG 




wo 01/40466 PCT/USOO/32678 

26/550 

FIGURE 26 

MEARLRSDVIHAPLPSPVDKVAANTPSMYSQELFQLSQYLQEALHREQMLEQKLATLQRLLAI 
TQEASDTSWQALIDEDRLLSRLEVMGNQLQACSKNQTEDSLRKELIALQEDKHNYETTAKESL 
RRVLQEKIEVVRKLSEVERSLSNTEDECTHLKEMNERTQEELRELANKYNGAVNEIKDLSDKL 
KVAEGKQEEIQQKGQAEKKELQHKIDEMEEKEQELQAKIEALQADNDFTNERLTALQVRLEHL 
QEKTLKECSSLADRRRASNQSGRRNKAFKRFVFCFSMFFDSSFG 

In^ortant features of the protein: 
N-glycosylation sites . 

amino acids 98-102, 271-275 

CAMP- and cO^P-dependent protein kinase phosphorylation site. 

amino acids 138-142, 267-271 

Amidation site. 

amino acids 273-277 

Tropomyosins proteins. 

amino acids 169-217 
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FIGURE 27 



GAACCTGGCGCCGCCGGAACTGATCGCGGCCTAGTCCCGACGCGTGTGTGCTAGTGAGCCGGA 

GCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTGCCATGGGGG 

AGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGACCCCGCGACAGG 

GGCAGCGGCGGCGGCTCGTGAGCCCCGGGATGGAGGAGAAATACGGCGGGGACGTGCTGGCCG 

GCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCGCTCGATTAACAAAATATA 

TTGTGTTACTATGTTTCACTAAATTTTTGAAGGCTGTGGGACTTTTCGAATCATATGATCTCC 

TAAAAGCTGTTCACATTGTTCAGTTCATTTTTATATTAAAACTTGGGACTGCATTTTTTATGG 

TTTTGTTTCAAAAGCCATTTTCTTCTGGGAAAACTATTACCAAACACCAGTGGATCAAAATAT 

TTAAACATGCAGTTGCTGGGTGTATTATTTCACTCTTGTGGTTTTTTGGCCTCACTCTTTGTG 

GACCACTAAGGACTTTGCTGCTATTTGAGCACAGTGATATTGTTGTCATTTCACTACTCAGTG 

TTTTGTTCACCAGTTCTGGAGGAGGACCAGCAAAGACAAGGGGAGCTGCTTTTTTCATTATTG 

CTGTGATCTGTTTATTGCTTTTTGACAATGATGATCTCATGGCTAAAATGGCTGAACACCCTG 

AAGGACATCATGACAGTGCTCTAACTCATATGCTTTACACAGCCATTGCCTTCTTAGGTGTGG 

CAGATCACAAGGGTGGAGTATTATTGCTAGTACTGGCTTTGTGTTGTAAAGTTGGTTTTCATA 

CAGCTTCCAGAAAGCTCTCTGTCGACGTTGGTGGAGCTAAACGTCTTCAAGCTTTATCTCATC 

TTGTTTCTGTGCTTCTCTTGTGCCCATGGGTCATTGTTCTTTCTGTGACAACTGAGAGTAAAG 

TGGAGTCTTGGTTTTCTCTCATTATGCCTTTTGCAACGGTTATCTTTTTTGTCATGATCCTGG 

ATTTCTACGTGGATTCCATTTGTTCAGTCAAAATGGAAGTTTCCAAATGTGCTCGTTATGGAT 

CCTTTCCCATTTTTATTAGTGCTCTCCTTTTTGGAAATTTTTGGACACATCCAATAACAGACC 

AGCTTCGGGCTATGAACAAAGCAGCACACCAGGAGAGCACTGAACACGTCCTGTCTGGAGGAG 

TGGTAGTGAGTGCTATATTCTTCATTTTGTCTGCCAATATCTTATCATCTCCCTCTAAGAGAG 

GACAAAAAGGTACCCTTATTGGATATTCTCCTGAAGGAACACCTCTTTATAACTTCATGGGTG 

ATGCTTTTCAGCATAGCTCTCAATCGATCCCTAGGTTTATTAAGGAATCACTAAAACAAATTC 

TTGAGGAGAGTGACTCTAGGCAGATCTTTTACTTCTTGTGCTTGAATCTGCTTTTTACCTTTG 

TGGAATTATTCTATGGCGTGCTGACCAATAGTCTGGGCCTGATCTCGGATGGATTCCACATGC 

TTTTTGACTGCTCTGCTTTAGTCATGGGACTTTTTGCTGCCCTGATGAGTAGGTGGAAAGCCA 

CTCGGATTTTCTCCTATGGGTACGGCCGAATAGAAATTCTGTCTGGATTTATTAATGGACTTT 

TTCTAATAGTAATAGCGTTTTTTGTGTTTATGGAGTCAGTGGCTAGATTGATTGATCCTCCAG 

AATTAGACACTCACATGTTAACACCAGTCTCAGTTGGAGGGCTGATAGTAAACCTTATTGGTA 

TCTGTGCCTTTAGCCATGCCCATAGCCATGCCCATGGAGCTTCTCAAGGAAGCTGTCACTCAT 

CTGATCACAGCCATTCACACCATATGCATGGACACAGTGACCATGGGCATGGTCACAGCCACG 

GATCTGCGGGTGGAGGCATGAATGCTAACATGAGGGGTGTATTTCTACATGTTTTGGCAGATA 

CACTTGGCAGCATTGGTGTGATCGTATCCACAGTTCTTATAGAGCAGTTTGGATGGTTCATCG 

CTGACCCACTCTGTTCTCTTTCTACTGCTATATTAATATTTCTCAGTGTTGTTCCACTGATTA 

AAGATGCCTGCCAGGTTCTACTCCTGAGATTGCCACCAGAATATGAAAAAGAACTACATATTG 

CTTTAGAAAAGATACAGAAAATTGAAGGATTAATATCATACCGAGACCCTCATTTTTGGCGTC 

ATTCTGCTAGTATTGTGGCAGGAACAATTCATATACAGGTGACATCTGATGTGCTAGAACAAA 

GAATAGTACAGCAGGTTACAGGAATACTTAAAGATGCTGGAGTAAACAATTTAACAATTCAAG 

TGGAAAAGGAGGCATACTTTCAACATATGTCTGGCCTAAGTACTGGATTTCATGATGTTCTGG 

CTATGACAAAACAAATGGAATCCATGAAATACTGCAAAGATGGTACTTACATCATGTGAGATA 

ACTCAAGAATTACCCCTGGAGAATAAACAATGAAGATTAAATGACTCAGTATTTGTAATATTG 

CCAGAAGGATAAAAATTACACATTAACTGTACAGAAACAGAGTTCCCTACTACTGGATCAAGG 

AATCTTTCTTGAAGGAAATTTAAATACAGAATGAAACATTAATGGTAAAAAAAA 
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FIGURE 28 

MEEKYGGDVLAGPGGGGGLGPVDVPSARLTKYIVLLCFTKFLKAVGLFESYDLLKAVHIVQFI 

FILKLGTAFFMVLFQKPFSSGKTITKHQWIKIFKHAVAGCIISLLWFFGLTLCGPLRTLLLFE 

HSDIWISLLSVLFTSSGGGPAKTRGAAFFIIAVICLLLFDNDDLMAKMAEHPEGHHDSALTH 

MLYTAIAFLGVADHKGGVLLLVLALCCKVGFHTASRKLSVDVGGAKRLQALSHLVSVLLLCPW 

VIVLSVTTESKVESWFSLIMPFATVIFFVMILDFYVDSICSVKMEVSKCARYGSFPIFISALL 

FGNFWTHPITDQLRAMNKAAHQESTEHVLSGGVWSAIFFILSANILSSPSKRGQKGTLIGYS 

PEGTPLYNFMGDAFQHSSQSIPRFIKESLKQILEESDSRQIFYFLCLNLLFTFVELFYGVLTN 

SLGLISDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYGRIEILSGFINGLFLIVIAFFVF 

MESVARLIDPPELDTHMLTPVSVGGLIVNLIGICAFSHAHSHAHGASQGSCHSSDHSHSHHMH 

GHSDHGHGHSHGSAGGGMNANMRGVFLHVLADTLGSIGVIVSTVLIEQFGWFIADPLCSLSTA 

ILIFLSWPLIKDACQVLLLRLPPEYEKELHIALEKIQKIEGLISYRDPHFWRHSASIVAGTI 

HIQVTSDVLEQRIVQQVTGILKDAGVNNLTIQVEKEAYFQHMSGLSTGFHDVLAMTKQMESMK 
YCKDGTYIM. 

Importeuit features of the protein: 
Signal peptide: 
amino acids 1-46 
Transmembrane domains: 

amino acids 59-77, 101-119, 150-167, 205-223, 239-258, 267-284, 
305-324, 343-360, 421-440, 452-469, 486-505, 522-539, 592-612, 
621-641 

N-glycosylation site. 

amino acids 721-725 
Glycosaminoglycan attachment site. 

amino acids 143-147 

cAMP- and cG^-dependent protein kinase phosphorylation site, 
amino acids 225-229 

Tyrosine kinase phosphorylation sites. 

amino acids 750-758, 756-764 
N-myristoylation sites . 

amino acids .14-20, 46-52, 102-108, 112-118, 144-150, 317-323, 
347-353, 369-375, 372-378, 437-443, 462-468, 529-535, 549-555, 
553-559, 579-585, 582-588, 583-589, 584-590,. 605-611, 737-743 
Multicopper oxidases protein: 
amino acids 561-569 
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FIGURE 29 

GGCACGAGGGCAGGATATTAGAAATGGCTACTCCCCAGTCAATTTTCATCTTTGCAATCTGCA 

TTTTAATGATAACAGAATTAATTCTGGCCTCAAAAAGCTACTATGATATCTTAGGTGTGCCAA 

AATCGGCATCAGAGCGCCAAATCAAGAAGGCCTTTCACAAGTTGGCCATGAAGTACCACCCTG 

ACAAAAATAAGAGCCCGGATGCTGAAGCAAAATTCAGAGAGATTGCAGAAGCATATGAAACAC 

TCTCAGATGCTAATAGACGAAAAGAGTATGATACACTTGGACACAGTGCTTTTACTAGTGGTA 

AAGGACAAAGAGGTAGTGGAAGTTCTTTTGAGCAGTCATTTAACTTCAATTTTGATGACTTAT 

TTAAAGACTTTGGCTTTTTTGGTCAAAACCAAAACACTGGATCCAAGAAGCGTTTTGAAAATC 

ATTTCCAGACACGCCAGGATGGTGGTTCCAGTAGACAAAGGCATCATTTCCAAGAATTTTCTT 

TTGGAGGTGGATTATTTGATGACATGTTTGAAGATATGGAGAAAATGTTTTCTTTTAGTGGTT 

TTGACTCTACCAATCAGCATACAGTACAGACTGAAAATAGATTTCATGGATCTAGCAAGCACT 

GCAGGACTGTCACTCAACGAAGAGGAAATATGGTTACTACATACACTGACTGTTCAGGACAGT 

aSTTCTTATTCTATTCTCACTAAATCCAACTGGTTGACTCTTCCTCATTATCTTTGATGCTAA 

ACAATTTTCTGTGAACTATTTTGACAAGTGCATGATTTCACTTTAAACAATTTGATATAGCTA 

TTAAATATATTTAAGGGTTTTTTTTTTTGACAAATTCAACATTCAACGAGTAGACAAAATGCT 

AATTATTTCCCTGATTAGGAAAGTTTCTTTAAAAAACACGTAATTTTGCCTAGTGCTTTTTCT 

CTACCTGCCCTTGGGCTCACTAATATCACCAGTATTATTACCAAGAAAATATTGAGTTTACCT 

GATTAAACTTTAAAAGTTAATTGTAGATTTAAATTGTGTGAACCTAATGATTTTTGCAGTGAA 

ACCTTTACTAATTCAAAGTTGCATGTTCTATGACATCTGTGACTTGCGTTGCAGAGTGTACAT 

GAAACTGTATAATTGAGTCATTCAGTAAAGGAGAACAGTATCTTGGTTAATTGCTACTGAAAG 

GTTGAGAAAGGAATGGTTTGATATTTACCACAGCGCTGTGCCTTTCTACAGTAGAACTGGGGT 

AAAGGAAATGGTTTTATTGCCCATAGTCATTTAGGCTGGAAAAAAGTTGAAAACTTAACGAAA 

TATTGCCAAGAGATTGTTATGTGTTTGGTTCCAGCCTAAAAATGATTTTGTAGTGTTGAAATC 

ATAGCTACTTACATAGCTTTTTCATATTTCTTTCTTAGTTGTTGGCACTCTTAGGTCTTAGTA 

TGGATTTATGTGTTTGTGTGTGTGTAGTTTATCCTCTCTCTCATCTTTATCTAGAGATTGACT 

GATACCTCATTCTGTTTGTAAAACCAGCCAGTAATTTCTGTGCAACCTTACTATGTGCAATAT 

TTTTAAATCCTGAGAAATGTGTGCTTTTGTTTTCGGATAGACTTATTTCTTTAGTTCTGCACT 

TTTCCACATTATACTCCATATGAGTATTAATCCTATGGATACATATTAAAACAAGTGTCTCAT 
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FIGURE 30 



MATPQSIFIFAICILMITELILASKSYYDILGVPKSASERQIKKAFHKLAMKYHPDKNKiSPDA 
EAKFREIAEAYETLSDANRRKEYDTLGHSAFTSGKGQRGSGSSFEQSFNFNFDDLFKDFGFFG 
QNQNTGSKKRFENHFQTRQDGGSSRQRHHFQEFSFGGGLFDDMFEDMEKMFSFSGFDSTNQHT 
VQTENRFHGSSKHCRTVTQRRGNMVTTYTDCSGQ 

Inqportant features of the protein: 
Signal peptide: 

amino acids 1-23 

Nt-dnaJ domain signature. 

amino acids 27-59, 66-90 

Glycosaminoglycan attachment site. 

amino acids 96-100 

N-myristoylation sites. 

amino acids 32-38, 99-105, 102-108, 126-132, 211-217 
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FIGURE 31 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTGG 

GCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACAAT 

TCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCTGAG 

ATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCAAATG 

CAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCTACGCA 

TTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTCTCTGTA 

CTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGAAACAGTG 

TACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCTGGATCCCC 

AGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATCACGGCCACT 

GTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTGGAGCATCCTG 

AAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGATCACCAAAGAT 

GGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTCCTTGTGGCCTAC 

TGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGGGGGTATTCCAGTG 

CACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGACATTCGTGAAGGCC 

ATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCC 

CTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGTGGTCGTGCCACTGTTC 

GTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGGTGGTCCTCCCAGACACC 

TTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGGGAGGAGGTGGATGCCTGT 

GCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGATCTCAia^TTTGCGGAAGG 

GCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACCATGAGGGGACAAGTTGTGTT 

TCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGAGCCTGTTGTCTACAAGTCTAG 

AAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACACTGACTGAGGCTTAGGGGATGTG 

ACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACCCTGGGAAAAGTGACTTCATCCCT 

TCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACCTACACACCTGCTAAACACACACAC 

ACAGAGTCTCTCTCTATATATACACACGTACACATAAATACACCCAGCACTTGCAAGGCTAGA 

GGGAAACTGGTGACACTCTACAGTCTGACTGATTCAGTGTTTCTGGAGAGCAGGACATAAATG 

TATGATGAGAATGATCAAGGACTCTACACACTGGGTGGCTTGGAGAGCCCACTTTCCCAGAAT 

AATCCTTGAGAGAAAAGGAATCATGGGAGCAATGGTGTTGAGTTCACTTCAAGCCCAATGCCG 

GTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGTAGGTGACCTGGAGGAAGGTCACAGCCACA 

CTGAAAATGGGATGTGCATGAACACGGAGGATCCATGAACTACTGTAAAGTGTTGACAGTGTG 

TGCACACTGCAGACAGCAGGTGAAATGTATGTGTGCAATGCGACGAGAATGCAGAAGTCAGTA 

ACATGTGCATGTTTGTTGTGCTCCTTTTTTCTGTTGGTAAAGTACAGAATTCAGCAAATAAAA 

AGGGCCACCCTGGCCAAAAGCGGTAAAAAAAAAAAAAAAA 




( 
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FIGURE 32 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPyiAPGET 
VYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAWSI 
LKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVECMVRSGGIP 
VHLETMEPGAAYCVECAQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILWVPL 
FVWKMGRLLQYSCCPVWLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylatibn sites. 

amino acids 40-44, 134-138 

Tissue factor proteins. 

amino acids 92-120 



Integrlns alpha chain proteins. 

amino acids 232-263 
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FIGURE 33 



GAGACACGCGAGCGGGGAGACCTCCAAGGCAGCGAGGCATCGGACATGTGTCAGCACATCTGG 

GGCGCACATCCGTCGAGCCCGAGGGGAGATTTGCCGGAAC7U\TTCAAACTGCGATATTGATCT 

TGGGGGTGACTGTCCCTGGCCGGCTGTCGGGTGGGAGTGCGAGTGTGCACTCGCTCGGAAGTG 

TGTGCGAGTGTGTATGTGTGTGTGCCGTGTCGGGCTCCCCCCTTCCCCCCGTTTTCCCGTCGA 

GTGATGCACTTGGAATGAGAATCAGAGGM6GAAATAGTCTGGGAGGTGCTTTTTCTTCTTCA 

AGCCAATTTCATCGTCTGCATATCAGCTCAACAGAATTCACCAAAAATCCATGAAGGCTGGTG 

GGCATACAAGGAGGTGGTCCAGGGAAGCTTTGTTCCAGTTCCTTCTTTCTGGGGATTGGTGAA 

CTCAGCTTGGAATCTTTGCTCTGTGGGGAAACGGCAGTCGCCAGTCAACATAGAGACCAGTCA 

CATGATCTTCGACCCCTTTCTGACACCTCTTCGCATCAACACGGGGGGCAGGAAGGTCAGTGG 

GACCATGTACAACACTGGAAGACACGTATCCCTTCGCCTGGACAAGGAGCACTTGGTCAACAT 

ATCTGGAGGGCCCATGACATACAGCCACCGGCTGGAGGAGATCCGACTACACTTTGGGAGTGA 

GGACAGCCAAGGGTCGGAGCACCTCCTCAATGGACAGGCCTTCTCTGGGGAGGTGCAGCTCAT 

CCACTATAACCATGAGCTATATACGAATGTCACAGAAGCTGCAAAGAGTCCAAATGGATTGGT 

GGTAGTTTCTATATTTATAAAAGTTTCTGATTCATCAAACCCATTTCTTAATCGAATGCTCAA 

CAGAGATACTATCACAAGAATAACATATAAAAATGATGCATATTTACTACAGGGGCTTAATAT 

AGAGGAACTATATCCAGAGACCTCTAGTTTCATCACTTACGATGGGTCGATGACTATCCCACC 

CTGCTATGAGACAGCAAGTTGGATCATAATGAACAAACCTGTCTATATAACCAGGATGCAGAT 

GCATTCCTTGCGCCTGCTCAGCCAGAACCAGCCATCTCAGATCTTTCTGAGCATGAGTGACAA 

CTTCAGGCCTGTCCAGCCACTCAACAACCGCTGCATCCGCACCAATATCAACTTCAGTTTACA 

GGGGAAGGACTGT.CCAAACAACCGAGCCCAGAAGCTTCAGTATAGAGTAAATGAATGGCTCCT 

CAAGTASGGAACAAAGCCAAGAAGAATCCCACCTCAGTGAAATGCTACAACTGTGAATTGACG 

TAACCTAGAATGTCCCCCTTCTTGCTTCTCTCTCCTTCTTTCCCCCAAGCCTCATTCATTCTT 

GGGATTGGCCCTTTCTTCATGAAAAGTGTCTGCGAAACCATGGCAGAGGAATACATCTCTCAC 

ACATACTCACAAACACACACACAAGCACTTGCACATACATACAAACACATGCAAACATACCTA 

CACACACACACTCTCTTACAACCTCCATCATGGGAAGTCAAGTTTCAGAAACAAAAGTCTCAT 

TCATAAGAGGTCTTAGAAGAAAATAACCAGTTAACCTGATTTCAATTTTGATACCGTTTTCCT 

GAACTAATAAATCTACCCAATGAGACTTTTCAGCCTTTGTACATACAAAATTCTTCCAARAGA 

GAGAGGAGAAAATACAGCTCTGATGGCATCAAACGGACTTTGCATCAAGTAATTTCAGATAGT 

GTCCTAGGATCCTTTGAGGGTGCTGGTAGCAGGTGAGCAGGACAAAGTTGACCAAGGACACTT 

ATTTCTAGATTATGATTCTTCTGTTTACTCAACAATTTACAAAGAAAAAAAGGACAGACATTG 

AAGAGCTACACATTGTATATATATCACCACAGACTATAAGGAAATGGAATTATTTCCCTCTTT 

GTCACATATCTGTAGTAGGATTTGCCAAGATCAGAAATGATCCATTTGCTGTTTCTTGTTTTC 

CAAAGGTCATACATTGTGTTTGGTTATTGTTACCAGCTCAATAAATGTGTTTAACGAGTTAAT 

TTCATTTTTCTGGCTTTGGTCTGTTCTCCTTCCTTACAGGCTAAGCCCTGGCTCCATGCAACT 

GCATTCTTTGATTTCACTTGTTCCTTCATCTACATGTTTTGTTCATTTGCAGCCAGTTTTTAC 

TGAGTTTGTGGCAATCAGGAATGCATTTGCTAAGCAAGTATGACTTTAATTCCACTCCATGGC 

TCAATCATTCACATGAGGTGAGCTTCAGCCTGAGATAGCAGGCGACAGACTTCTTGCGTTTCA 

AAACTGCCATGCCCCCCTGTGATGCTCCCGTGAAGGAATGCACTTTGCCTTGTAAGTTCCTGG 

GAAAGGGGTATGTTTTCTCTCCAGGTGCAGCCAGATCTCACAAAGTACAAAACGAATGCCTTT 

GTTTTCTTGTTTATAATGGTCACTCACTGTGTTTGGTTACTGTCAAGAAATCAATAAATGTGT 
TTAACAAGTTA 



• 
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FIGURE 34 



MEIVWEVLFLLQANFIVCISAQQNSPKIHEGWWAYKEVVQGSFVPVPSFWGLVNSAWNLCSVG 
KRQSPVNIETSHMIFDPFLTPLRINTGGRKVSGTMYNTGRHVSLRLDKEHLVNISGGPMTYSH 
RLEEIRLHFGSEDSQGSEHLLNGQAFSGEVQLIHYNHELYTNVTEAAKSPNGLVWSIFIKVS 
DSSNPFLNRMLNRDTITRITYKNDAYLLQGLNIEELYPETSSFITYDGSMTIPPCYETASWII 
MNKPVYITRMQMHSLRLLSQNQPSQIFLSMSDNFRPVQPLNNRCIRTNINFSLQGKDCPNNRA 
QKLQYRVNEWLLK 

Important features: 
Signal peptide: 

amino acids 1-20 

Eukaryo tic- type carbonic anhydrases proteins . 

amino acids 126-162, 220-269, 43-91 

N-glycosylation sites. 

amino acids 116-119, 168-171, 302-305 
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FIGURE 35 

GTCGGAACCCCCTCAGGCCACCCTCGGGAGTCCTGGGGTCCAGAGGGGTGTCCCTGTACCCCTTGCAC 

ACAGGACCCTCACTCTGCAGGGATAAGCCAGCTGCGCCTGCAGCCTAGGGTGCCAAGGAGGCTGCTGA 

TTGTGGCCCACAGCCTCATCTGAACGCCAGGAGACCAGGATACCGAGGCACCGGATCCCCTCTCTGTG 

CCCTGGGGAGCCCCAGTGCTGCCCAGTCACCCCAGGGCTGAGGTCTGCGTCCCTAGTGGTGCAAGGCC 

TGGTAGGACCACGGGGCAGGGAATGTGAGCGCCATCCGAGCTCACGGTGTCCTGAGTCGCGGCTTCGT 

GACTTTGGCAGGGGCCTCCGGACCAGTGACCCCAGTCAAACCCAGAGGGTCTTGGGCGGCAGCGACGA 

AGGAGGTATTCAGGCTCCAGGCCAGGTGGGGCCGGACGCCCCCAGCCATCCACCATGGTGGTGGCACA 

CCCCACCGCCACTGCCACCACCACGCCCACTGCCACTGTCACGGCCACCGTTGTGATGACCACGGCCA 

CCATGGACCTGCGGGACTGGCTGTTCCTCTGCTACGGGCTCATCGCCTTCCTGACGGAGGTCATCGAC 

AGCACCACCTGCCCCTCGGTGTGCCGCTGCGACAACGGCTTCATCTACTGCAACGACCGGGGACTCAC 

ATCCATCCCCGCAGATATCCCTGATGACGCCACCACCCTCTACCTGCAGAACAACCAGATCAACAACG 

CCGGCATCCCCCAGGACCTCAAGACCAAGGTCAACGTGCAGGTCATCTACCTATACGAGAATGACCTG 

GATGAGTTCCCCATCAACCTGCCCCGCTCCCTCCGGGAGCTGCACCTGCAGGACAACAATGTGCGCAC 

CATTGCCAGGGACTCGCTGGCCCGCATCCCGCTGCTGGAGAAGCTGCACCTGGATGACAACTCCGTGT 

CCACCGTCAGCATTGAGGAGGACGCCTTCGCCGACAGCAAACAGCTCAAGCTGCTCTTCCTGAGCCGG 

AACCACCTGAGCAGCATCCCCTCGGGGCTGCCGCACACGCTGGAGGAGCTGCGGCTGGATGACAACCG 

CATCTCCACCATCCCGCTGCATGCCTTCAAGGGCCTCAACAGCCTGCGGCGCCTGGTGCTGGACGGTA 

ACCTGCTGGCCAACCAGCGCATCGCCGACGACACCTTCAGCCGCCTACAGAACCTCACAGAGCTCTCG 

CTGGTGCGCAATTCGCTGGCCGCGCCACCCCTCAACCTGCCCAGCGCCCACCTGCAGAAGCTCTACCT 

GCAGGACAATGCCATCAGCCACATCCCCTACAACACGCTGGCCAAGATGCGTGAGCTGGAGCGGCTGG 

ACCTGTCCAACAACAACCTGACCACGCTGCCCCGCGGCCTGTTCGACGACCTGGGGAACCTGGCCCAG 

CTGCTGCTCAGGAACAACCCTTGGTTTTGTGGCTGCAACCTCATGTGGCTGCGGGACTGGGTGAAGGC 

ACGGGCGGCCGTGGTCAACGTGCGGGGCCTCATGTGCCAGGGCCCTGAGAAGGTCCGGGGCATGGCCA 

TCAAGGACATTACCAGCGAGATGGACGAGTGTTTTGAGACGGGGCCGCAGGGCGGCGTGGCCAATGCG 

GCTGCCAAGACCACGGCCAGCAACCACGCCTCTGCCACCACGCCCCAGGGTTCCCTGTTTACCCTCAA 

GGCCAAAAGGCCAGGGCTGCGCCTCCCCGACTCCAACATTGACTACCCCATGGCCACGGGTGATGGCG 

CCAAGACCCTGGCCATCCACGTGAAGGCCCTGACGGCAGACTCCATCCGCATCACGTGGAAGGCCACG 

CTCCCCGCCTCCTCTTTCCGGCTCAGTTGGCTGCGCCTGGGCCACAGCCCAGCCGTGGGCTCCATCAC 

GGAGACCTTGGTGCAGGGGGACAAGACAGAGTACCTGCTGACAGCCCTGGAGCCCAAGTCCACCTACA 

TCATCTGCATGGTCACCATGGAGACCAGCAATGCCTATGTAGCTGATGAGACACCCGTGTGTGCCAAG 

GCAGAGACAGCCGACAGCTATGGCCCTACCACCACACTCAACCAGGAGCAGAACGCTGGCCCCATGGC 

GAGCCTGCCCCTGGCGGGCATCATCGGCGGGGCAGTGGCTCTGGTCTTCCTCTTCCTGGTCCTGGGGG 

CCATCTGCTGGTACGTGCACCAGGCTGGCGAGCTGCTGACCCGGGAGAGGGCCTACAACCGGGGCAGC 

AGGAAAAAGGATGACTATATGGAGTCAGGGACCAAGAAGGATAACTCCATCCTGGAAATCCGCGGCCC 

TGGGCTGCAGATGCTGCCCATCAACCCGTACCGCGCCAAAGAGGAGTACGTGGTCCACACTATCTTCC 

CCTCCAACGGCAGCAGCCTCTGCAAGGCCACACACACCATTGGCTACGGCACCACGCGGGGCTACCGG 

GACGGCGGCATCCCCGACATAGACTACTCCTACACA2^TGCCCGCCCACCCGGGCTGCCCCGCCTCA 

GCCCCAGCTGCCCTGGCGTGGCCATGTGGCTTTGCCCAGCCTGCTGCAATCCAAGAGAGCAAGGAAGA 

GAAATTCCATGGGTGACTTTCCTCCGCAGAAAGCAAAGTTTGGGGAGGGCTGACGATTTTGTAGAACA 

CAACAGTGACAATTTTTTTTAAAAGAATAGAAGGCAGGAGGGGGAATTCGACATTGTTGAAGACATAA 

TTTATACCAAGTTATGCCAGTTGGGGAGGGAAGGACTAAAAATAATATTGCAGGCAGGGCTGGGTTGG 

GTTTTTTTTTTTCCCCCCTGAACTGGAAGGATACTACCTGTACAACATCTGTGGACACCTCATGCTCT 

GTTCAAGGCCATCACAAAGGAACCGCCAGGGAGAAGCAGCCGGCTCTCAAAGCTCCCACGCAGCTCTC 

CCGCCACTGGCCACTCGCTGGCGACCCGATGGAAGGTTTTCAGGCTCCTCACAAAGGAGAGAGGGAAG 

AAAAGATCTTTTGCCCTGGAGATATGGTCCTGAAATCTCTCCCCTGGCTTATTCCATACCATTTCCCT 

TGCAGATTTGCAGAAACATGGCATCTTTCACTGCATTCTTTGAACAATCATGTAGTCGATTAAAAAAA 

AAAACAAACTTTTTTTTCCTAGGCTGAAGCCCTCTTCAGTTCCATGCACCACGCTCCGTAGAAGCCCC 

GGCGGAAGCCGTAGCTTTCCCTGCCACCTGGAGGTGCATCTGTCTGCCTGTCTATCCCTGTCGCGGTG 

TCTCTAAGTACAGATGGGTAGATAGAGCCACATGCACGGTCCtTACCGTTCTTCTTGGGTCAGTTCTT 

ACCATTTCCTGAACAATAGAATTGTGAAAGTGTTAAAAA 
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FIGURE 36 



MWAHPTATATTTPTATVTATVVMTTATMDLRDWLFLCYGLIAFLTEVIDSTTCPSVCRCDNG 
FIYCNDRGLTSIPADIPDDATTLYLQNNQINNAGIPQDLKTKVNVQVIYLYENDLDEFPINLP 
RSLRELHLQDNNVRTIARDSLARIPLLEKLHLDDNSVSTVSIEEDAFADSKQLKLLFLSRNHL 

ssipsglphtleelrlddnristiplhafkglnslrrlvldgnllanqriaddtfsrlqnlte 
lslvrnslaapplnlpsahlqklylqdnaishipyntlakmrelerldlsnnnlttlprglfd 
dlgnlaqlllrnnpwfcgcnlmwlrdwvkaraawnvrglmcqgpekvrgmaikditsemdec 
fetgpqggvanaaakttasnhasattpqgslftlkakrpglrlpdsnidypmatgdgaktlai 
hvkaltadsiritwkatlpassfrlswlrlghspavgsitetlvqgdkteylltalepkstyi 
icmvtmetsnayvadetpvcakaetadsygptttlnqeqnagpmaslplagiiggavalvflf 
lvlgaicwyvhqagelltreraynrgsrkkddymesgtkkdnsileirgpglqmlpinpyrak 
eeywhtifpsngsslckathtigygttrgyrdggipdidysyt 

In^ortant features of the protein:. 
Transmembrane domain: 

amino acids 552-573 

N-glycosylation sites. 

amino acids 249-252, 305-308, 642-645 

Leucine zipper pattern . 

amino acids 182-203, 299-320 



Phospholipase A2 aspartic acid active site. 

amino acids 57-67 




• 



wo 01/40466 



PCT/USOO/32678 



37/550 



FIGURE 37 



GGTGACTGAAGCGAGCCTGGCCTCTTGCATCCTCCGCCTGTGTACCTCCCTCCCCTTTTTTTCCGCCT 

TCTGCCAGCAGAAGCAGCAGCCGCAGCACCTGAGCCGCTACTGCCGCTCACTCAGGACAACGCTATGG 

CTGAGCCTGGGCACAGCCACCATCTCTCCGCCAGAGTCAGGAGAAGAACTGAGAGGCGCATACCCCGG 

CTGTGGCGGCTGCTGCTCTGGGCTGGGACCGCCTTCCAGGTGACCCAGGGAACGGGACCGGAGCTTCA 

TGCCTGCAAAGAGTCTGAGTACCACTATGAGTACACGGCGTGTGACAGCACGGGTTCCAGGTGGAGGG 

TCGCCGTGCCGCATACCCCGGGCCTGTGCACCAGCCTGTCTGACCCCGTCAAGGGCACCGAGTGCTCC 

TTCTCCTGCAACGCCGGGGAGTTTCTGGATATGAAGGACCAGTCATGTAAGCCATGCGCTGAGGGCCG 

CTACTCCCTCGGCACAGGCATTCGGTTTGATGAGTGGGATGAGCTGCCCCATGGCTTTGCCAGCCTCT 

CAGCCAACATGGAGCTGGATGACAGTGCTGCTGAGTCCACCGGGAACTGTACTTCGTCCAAGTGGGTT 

CCCCGGGGCGACTACATCGCCTCCAACACGGACGAATGCACAGCCACACTGATGTACGCCGTCAACCT 

GAAGCAATCTGGCACCGTTAACTTCGAATACTACTATCCAGACTCCAGCATCATCTTTGAGTTTTTCG 

TTCAGAATGACCAGTGCCAGCCCAATGCAGATGACTCCAGGTGGATGAAGACCACAGAGAAAGGATGG 

GAATTCCACAGTGTGGAGCTAAATCGAGGCAATAATGTCCTCTATTGGAGAACCACAGCCTTCTCAGT 

ATGGACCAAAGTACCCAAGCCTGTGCTGGTGAGAAACATTGCCATAACAGGGGTGGCCTACACTTCAG 

AATGCTTCCCCTGCAAACCTGGCACGTATGCAGACAAGCAGGGCTCCTCTTTCTGCAAACTTTGCCCft 

GCCAACTCTTATTCAAATAAAGGAGAAACTTCTTGCCACCAGTGTGACCCTGACAAATACTCAGAGAA 

AGGATCTTCTTCCTGTAACGTGCGCCCAGCTTGCACAGACAAAGATTATTTCTACACACACACGGCCT 

GCGATGCCAACGGAGAGACACAACTCATGTACAAATGGGCCAAGCCGAAAATCTGTAGCGAGGACCTT 

GAGGGGGCAGTGAAGCTGCCTGCCTCTGGTGTGAAGACCCACTGCCCACCCTGCAACCCAGGCTTCTT 

CAAAACCAACAACAGCACCTGCCAGCCCTGCCCATATGGTTCCTACTCCAATGGCTCAGACTGTACCC 

GCTGCCCTGCAGGGACTGAACCTGCTGTGGGATTTGAATACAAATGGTGGAACACGCTGCCCACAAAC 

ATGGAAACGACCGTTCTCAGTGGGATCAACTTCGAGTACAAGGGCATGACAGGCTGGGAGGTGGCTGG 

TGATCACATTTACACAGCTGCTGGAGCCTCAGACAATGACTTGATGATTCTCACTCTGGTTGTGCCAG 

GATTTAGACCTCCGCAGTCGGTGATGGCAGACACAGAGAATAAAGAGGTGGCCAGAATCACATTTGTC 

TTTGAGACCCTCTGTTCTGTGAACTGTGAGCTCTACTTCATGGTGGGTGTGAATTCTAGGACCAACAC 

TCCTGTGGAGACGTGGAAAGGTTCCAAAGGCAAACAGTCCTATACCTACATCATTGAGGAGAACACTA 

CCACGAGCTTCACCTGGGCCTTCCAGAGGACCACTTTTCATGAGGCAAGCAGGAAGTACACCAATGAC 

GTTGCCAAGATCTACTCCATCAATGTCACCAATGTTATGAATGGCGTGGCCTCCTACTGCCGTCCCTG 

TGCCCTAGAAGCCTCTGATGTGGGCTCCTCCTGCACCTCTTGTCCTGCTGGTTACTATATTGACCGAG 

ATTCAGGAACCTGCCACTCCTGCCCCCCTAACACAATTCTGAAAGCCCACCAGCCTTATGGTGTCCAG 

GCCTGTGTGCCCTGTGGTCCAGGGACCAAGAACAACAAGATCCACTCTCTGTGCTACAATGATTGCAC 

CTTCTCACGCAACACTCCAACCAGGACTTTCAACTACAACTTCTCCGCTTTGGCAAACACCGTGACTC 

TTGCTGGAGGGCCAAGCTTCACTTCCAAAGGGTTGAAATACTTCCATCACTTTACCCTCAGTCTCTGT 

GGAAACCAGGGTAGGAAAATGTCTGTGTGCACCGACAATGTCACTGACCTCCGGATTCCTGAGGGTGA 

GTCAGGGTTCTCCAAATCTATCACAGCCTACGTCTGCCAGGCAGTCATCATCCCCCCAGAGGTGACAG 

GCTACAAGGCCGGGGTTTCCTCACAGCCTGTCAGCCTTGCTGATCGACTTATTGGGGTGACAACAGAT 

ATGACTCTGGATGGAATCACCTCCCCAGCTGAACTTTTCCACCTGGAGTCCTTGGGAATACCGGACGT 

GATCTTCTTTTATAGGTCCAATGATGTGACCCAGTCCTGCAGTTCTGGGAGATCAACCACCATCCGCG 

TCAGGTGCAGTCCACAGAAAACTGTCCCTGGAAGTTTGCTGCTGCCAGGAACGTGCTCAGATGGGACC 

TGTGATGGCTGCAACTTCCACTTCCTGTGGGAGAGCGCGGCTGCTTGCCCGCTCTGCTCAGTGGCTGA 

CTACCATGCTATCGTCAGCAGCTGTGTGGCTGGGATCCAGANGACTACTTACGTGTGNCGAGAACCCA 

AGCTATGCTCTGGTGGCATTTCTCTGCCTGAGCAGAGAGTCACCATCTGCAAAACCATAGATTTCTGG 

CTGAAAGTGGGCATCTCTGCAGGCACCTGTACTGCCATCCTGCTCACCGTCTTGACCTGCTACTTTTG 

GAAAAAGAATCAAAAACTAGAGTACAAGTACTCCAAGCTGGTGATGAATGCTACTCTCAAGGACTGTG 

ACCTGCCAGCAGCTGACAGCTGCGCCATCATGGAAGGCGAGGATGTAGAGGACGACCTCATCTTTACC 

AGCAAGAAGTCACTTTTTGGGAAGATCAAATCATTTACCTCCAAGAGGACTCCTGATGGATTTGACTC 

AGTGCCGCTGAAGACATCCTCAGGAGGCCCAGACATGGACCTGT^GAGGCACTGCCTGCCTCACCTG 

CCTCCTCACCTTGCATAGCACCTTTGCAAGCCTGCGGCGATTTGGGTGCCAGCATCCTGCAACACCCA 

CTGCTGGAAATCTCTTCATTGTGGCCTTATCAGATGTTTGAATTTCAGATCTTTTTTTATAGAGTACC 

CAAACCCTCCTTTCTGCTTGCCTCAAACCTGCCAAATATACCCACATTTTTTTTTAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 



MAEPGHSHHLSARVRRRTERRIPRLWRLLLWAGTAFQVTQGTGPELHACKESEYHYEYTACDS 

TGSRWRVAVPHTPGLCTSLSDPVKGTECSFSCNAGEFLDMKDQSCKPCAEGRYSLGTGIRFDE 

WDELPHGFASLSANMELDDSAAESTGNCTSSKWVPRGDYIASNTDECTATLMYAVNLKQSGTV 

NFEYYYPDSSIIFEFFVQNDQCQPNADDSRWMKTTEKGWEFHSVELNRGNNVLYWRTTAFSVW 

TKVPKPVLVRNIAITGVAYTSECFPCKPGTYADKQGSSFCKLCPANSYSNKGETSCHQCDPDK 

YSEKGSSSCNVRPACTDKDYFYTHTACDANGETQLMYKWAKPKICSEDLEGAVKLPASGVKTH 

CPPCNPGFFKTNNSTCQPCPYGSYSNGSDCTRCPAGTEPAVGFEYKWWNTLPTNMETTVLSGI 

NFEYKGMTGWEVAGDHIYTAAGASDNDFMILTLVVPGFRPPQSVMADTENKEVARITFVFETL 

CSVNCELYFMVGVNSRTNTPVETWKGSKGKQSYTYIIEENTTTSFTWAFQRTTFHEASRKYTN 

DVAKIYSINVTNVMNGVASYCRPCALEASDVGSSCTSCPAGYYIDRDSGTCHSCPPNTILKAH 

QPYGVQACVPGGPGTKNNKIHSLCYNDCTFSRNTPTRTFNYNFSALANTVTLAGGPSFTSKGL 

KYFHHFTLSLCGNQGRKMSVCTDNVTDLRIPEGESGFSKSITAYVCQAVIIPPEVTGYKAGVS 

SQPVSLADRLIGVTTDMTLDGITSPAELFHLESLGIPDVIFFYRSNDVTQSCSSGRSTTIRVR 

CSPQKTVPGSLLLPGTCSDGTCDGCNFHFLWESAAACPLCSVADYHAIVSSCVAGIQXTTYVX 

REPKLCSGGISLPEQRVTICKTIDFWLKVGISAGTCTAILLTVLTCYFWKKNQKLEYKYSKLV 

MNATLKDCDLPAADSCAIMEGEDVEDDLIFTSKKSLFGKIKSFTSKRTPDGFDSVPLKTSSGG 
PDMDL 



Inqportant features of the protein: 
N-glycosylation sites: 

amino acids 153-156, 390-393, 391-394, 404-407, 544-547, 576-579, 
672-675, 717-720, 947-950 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

amino acids 15-18, 563-566, 709-712 
Casein kinase II phosphorylation sites: 

amino acids 42-45, 59-62, 81-84, 146-149, 168-171, 282-285, 331- 
334, 340-343, 431-434, 449-452, 465-468, 523-526, 557-560, 761- 
764, 780-783, 835-838, 860-863, 893-896, 949-952 
Tyrosine kinase phosphorylation sites: 
amino acids 50-56, 109-116 

N-myristoylation sites: ^ 

amino acids 77-82, 88-93, 152-157, 268-273, 288-293, 320-325, 

400-405, 405-410, 414-419, 463-468, 599-604, 616-621, 634-639, 

644-649, 839-844, 874-879, 912-917, 916-921 

Amidation site: 

amino acids 707-710 

Cell attachment sequence: 

amino acids 162-164 
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FIGURE 39 

GGGAAGGGGTTCTGGGCTGCCGCAGGCACACAGGCCAGAGCTTCGTGGATACCTGCAGGGCCC 

AAAGGTCCCTCCCTGTTTTGAAGAGTGAGTGATGGCTATGAGGTAGCGGCCAGGCTGATCACC 

CCTGCGTTGGCTGGAGGCAGAATTCTGTAAATCCTCGCCAAGTCTTTCTCCAGGCCACTGGTT 

AGCTCATCTCAGCCTCCTCTGGGAGCATCAACACCAACATGGCACAGGGGACTGCAGTGGTGT 

GCTTTGGACCTGTGTACCCACCCAAGGCTAAAGGCAGAGCCAGGTGACTTTGCGGGGGTCTCT 

TCTCTAGGATTATCTGTACTTCCCCTCTGTCCTCTTTTACTACGGGAGATCGAGCTAGCTATA 

ACCCACCTTCTTTCATGAGAACCACACTAAATTGCAAAAATTATCCCAGTGCTGGAGGAGGGC 

AGCAGGTTGAGATTATGTTGGCAGGAAGAATGTTGGCATTGATTGGCACGCAGGGGACGAGAG 

CTGCTTTGTGCTTTAAAGGAGCCAAGTTACACCCTGTTTAACCCTGCCTTCAAAGGGACGACT 

CTGTAAGATTCTCTGCTACTTATTCAAGTTGACACGAT6CCCTTCACACTCCACCTGAGGTCC 

CGCCTTCCCTCTGCCATAAGGAGTTTGATTCTACAAAAGAAACCAAACATCAGAAATACATCC 

AGCATGGCTGGAGAGCTCCGACCAGCCAGCCTGGTGGTCCTGCCCAGGTCCCTTGCTCCAGCT 

TTTGAAAGATTCTGCCAGGTCAACACTGGTCCTCTACCCCTGCTGGGCCAGAGTGAGCCAGAA 

AAGTGGATGCTGCCCCCTCAAGGTGCTATCTCAGAGACCAGGATGGGCCATCCCCAGTTCTGG 

AAATACGAGTTCGGTGCCTGCACCGGTAGCCTGGCTTCGCTGGAGCAGTACTCGGAGCAGCTG 

AAGGACATGGTGGCCTTCTTCCTGGGCTGCAGCTTCTCCCTGGAGGAGGCCTTGGAGAAAGCG 

GGGCTCCCCAGAAGAGACCCAGCAGGTCACAGCCAGGCGGGTGCATACAAGACAACAGTGCCT 

TGTGTTACCCATGCTGGCTTCTGCTGCCCTCTGGTGGTCACGATGAGGCCCATTCCCAAGGAC 

AAGCTGGAAGGGCTGGTGCGGGCCTGCTGCTCCCTCGGAGGTGAGCAGGGGCAACCTGTTCAC 

ATGGGCGACCCAGAACTGTTGGGAATCAAAGAGCTTTCCAAACCTGCCTACGGGGATGCCATG 

GTGTGTCCCCCAGGGGAGGTTCCAGTGTTCTGGCCTTCTCCGCTGACCAGTCTCGGAGCTGTC 

AGCAGCTGTGAGACCCCACTGGCTTTTGCCAGCATCCCAGGCTGCACAGTTATGACTGACCTG 

AAGGATGCAAAGGCTCCACCTGGTTGTCTCACCCCAGAGAGAATTCCAGAGGTCCATCACATT 

TCCCAAGATCCTCTGCACTACAGCATCGCGTCAGTCTCTGCTTCTCAGAAGATCAGAGAACTA 

GAGTCTATGATCGGCATAGACCCAGGGAACCGGGGGATTGGGCACCTGCTCTGTAAAGATGAG 

CTGCTGAAGGCCTCTCTCTCGCTGTCCCATGCCCGCTCAGTGCTCATCACCACTGGGTTCCCC 

ACACATTTCAATCATGAGCCTCCAGAAGAGACAGATGGCCCACCAGGAGCTGTTGCTCTGGTT 

GCCTTCCTGCAGGCCTTGGAGAAGGAGGTCGCCATAATCGTTGACCAGAGAGCCTGGAACTTG 

CACCAGAAGATTGTTGAAGATGCTGTTGAGCAAGGTGTTCTGAAGACGCAGATCCCGATATTA 

ACTTACCAAGGTGGATCAGTGGAAGCTGCTCAGGCATTCCTGTGCAAAAATGGGGACCCGCAG 

ACACCTAGATTTGACCACCTGGTGGCCATAGAGCGTGCCGGAAGAGCTGCTGATGGCAATTAC 

TACAATGCAAGGAAGATGAACATCAAGCACTTGGTTGACCCCATTGACGATCTTTTTCTTGCT 

GCGAAGAAGATTCCTGGAATCTCATCAACTGGAGTCGGTGATGGAGGCAACGAGCTTGGGATG 

GGTAAAGTCAAGGAGGCTGTGAGGAGGCACATACGGCACGGGGATGTCATCGCCTGCGACGTG 

GAGGCTGACTTTGCCGTCATTGCTGGTGTTTCTAACTGGGGAGGCTATGCCCTGGCCTGCGCA 

CTCTACATCCTGTACTCATGTGCTGTCCACAGTCAGTACCTGAGGAAAGCAGTCGGACCCTCC 

AGGGCACCTGGAGATCAGGCCTGGACTCAGGCCCTCCCGTCGGTCATTAAGGAAGAAAAAATG 

CTGGGCATCTTGGTGCAGCACAAAGTCCGGAGTGGCGTCTCGGGCATCGTGGGCATGGAGGTG 

GATGGGCTGCCCTTCCACAACACCCACGCCGAGATGATCCAGAAGCTGGTGGACGTCACCACG 

GCACAGGTGTAACCGTCCATGTTCCGTGTGAGCAGAGTCCCTACCAACGGGCAGGTCTGCATC 

CGGGGAGAATGCAGCTGCTTCTGGCGACAATCCTGCTAGTAAACACTGGTCTTCGGTGAGCAA 

CGAACACTCGCCTGGCCTGGGAAACTGCATGCCCACTTTCTGGGAGGGGTTAGTGCAGGTGCC 

GTGGACAAAGGACAACATTTCTCTGGGGCTTTTTAACTTTTATTCCTAAGACTCTAAAGGCGT 

TGATTTCAACCCTCCTTCACTCTGGCTTCTTCAGGCAACCCACGTGGTCTCCTATGAGAATCT 

TCTCGACAGTTACTTATGGGGACACTTGTGAACAATTAACTGCCAGGGCAGAGCATGAGAACA 

AACATTCCCAGGCCATGTAGGATAGGATACTCCAGACTCCAGTCATCCTCCCCCATCCATGGT 

TTCTGTTACTCATGGTTTCAGTTACTCATAGCCAACTGCAGACCGAAAATACTAAATGAAAAA 
TTTCAGAAATAAACAACTCTTAAGTTTTAAAAAAAAA 
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FIGURE 40 



MPFTLHLRSRLPSAIRSLILQKKPNIRNTSSMAGELRPASLVVLPRSLAPAFERFCQVNTGPL 
PLLGQSEPEKWMLPPQGAISETRMGHPQFWKYEFGACTGSLASLEQYSEQLKDMVAFFLGCSF 
SLEEALEKAGLPRRDPAGHSQAGAYKTTVPCVTHAGFCCPLWTMRPIPKDKLEGLVRACCSL 
GGEQGQPVHMGDPELLGIKELSKPAYGDAMVCPPGEVPVFWPSPLTSLGAVSSCETPLAFASI 
PGCTVMTDLKDAKAPPGCLTPERIPEVHHISQDPLHYSIASVSASQKIRELESMIGIDPGNRG 
IGHLLCKDELLKASLSLSHARSVLITTGFPTHFNHEPPEETDGPPGAVALVAFLQALEKEVAI 
IVDQRAWNLHQKIVEDAVEQGVLKTQIPILTYQGGSVEAAQAFLCKNGDPQTPRFDHLVAIER 
AGRAADGN Y YN ARKMN I KHL VD P I DDL FLAAKK I PG I S STG VG DGGN ELGMGK VKE A VRRH I R 
HGDVIACDVEADFAVIAGVSNWGGYALACALYILYSCAVHSQYLRECAVGPSRAPGDQAWTQAL 
PSVIKEEKMLGILVQHKVRSGVSGIVGMEVDGLPFHNTHAEMIQKLVDVTTAQV 

Signal peptide: 

amino acids 1-17 



Transmembrane domain: 

amino acids 358-378, 517-539 



N-glycosylation site. 

amino acids 28-32 



Tyrosine kinase phosphorylation site. 

amino acids 444-452 



N-myristoylation site. 

amino acids 98-104, 102-108, 123-129, 149-155, 181-187, 190-196, 
238-244, 308-314, 399-405, .413-419, 448-454, 477-483^ 482-488, 



487-493 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 233-244, 531-542 



wo 01/40466 



PCT/USOO/32678 



41/550 

FIGURE 41 

CTTTCCTGTTTTATCCGCAGCCCTTTTCTTCTTTGAGTTAGTAAAGATTTATTCTGTAACCTG 

ACACTCATCTGGCCCTTTGCAGTTTGCCAGCCATATTCCCATGTGATTTCCCACTGGATCCAG 

GCCCCCATCCGGCTGGCAGGAGGGGGCTCTGACGTACAGGTTGGAAATCAGAAGTCTGTGAGA 

GCGCGGGAGTGCATGGCAGCTCTGGGTCCCAGACCTGGCCCGACCCCTCTGCTTCACCTeCAG 

CTCTGCTGCTCCTCTACTCTTGGGTCGAGATCCCTTTGGAGCCACAGCGAGGAACCCTGTGGT 

CCTCAGGCAGGTGTACCTTGAGTCAGCCAGGAGCCCTCTTTTCCTGTGTCAAAGCCTGCCCTC 

GGGCTCTGCTCACCTCTGGTGACCCTCCAAGATGCCCCTGCCCTCAGTTTCCCCTCATGATCT 

GGCCTCTGCCCCCTTCTCTAGCCACAGCCTCTAGTACACTTTAGCAATACCACCAGACTAGTT 

AGAGTTCCCCACTCACCAAGCAAGACAT6CAGTTTCATGCCTCTGTGCCTTCGCTCATGCTGT 

TTCTTCCGACTGGAATGCCTTCCCCTGCTCCTCCTGCCTTGTCTGCCTGGCAAGTTCATCTCT 

CACGATCCCCTCAAAGGCCCCCTCCTCCAGGAAGGCAACCCCTGTGCCCCTCCCCTCCAGGCT 

ACCTCTGCACTTTGTCAATGCTTCTCTTGTGGCACTTATCACACTGTATTTTACTTGTTTACA 

TGTTTGTCTCCCCTTCTAGACTGTGAATCCTTAAGGGCATGGACTGTATCTTATGCATCTCTG 

TATTTCTGCGCCTAGCACGGTGCCTAGCACACAGTAGGCGCTCAATAAATGTTGAATGAATGA 
ATGATTT 
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FIGURE 42 

MQFHASVPSLMLFLPTGMPSPAPPALSAWQVHLSRSPQRPPPPGRQPLCPSPPGYLCTLSMLL 
LWHLSHCILLVYMFVSPSRL 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



Hicrobodies C-terminal targeting signal. 

amino acids 81-83 
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FIGURE 43 

GTTTCCAACAAGGATGATATGAAGACTTCCCTGAAGAAAGTTGTGAAGGGACCTCCTACGAGa 

TCATGATGCAGTGTGTGTCCCGCATGTTGGCCCACCCCCTGCATGTCATCTCAATGCGCTGCA 

TGGTCCAGTTTGTGGGACGGGAGGCCAAGTACAGTGGTGTGCTGAGCTCCATTGGGAAGATTT 

TCAAAGAGGAAGGGCTGCTGGGATTCTTCGTTGGATTAATCCCTCACCTCCTGGGCGATGTGG 

TTTTCTTGTGGGGCTGTAACCTGCTGGCCCACTTCATCAATGCCTACCTGGTGGATGACAGCT 

TCAGCCAGGCCCTGGCCATCCGGAGCTATACCAAGTTCGTGATGGGGATTGCAGTGAGCATGC 

TGACCTACCCCTTCCTGCTAGTTGGCGACCTCATGGCTGTGAACAACTGCGGGCTGCAAGCTG 

GGCTCCCCCCTTACTCCCCAGTGTTCAAATCCTGGATTCACTGCTGGAAGTACCTGAGTGTGC 

AGGGCCAGCTCTTCCGAGGCTCCAGCCTGCTTTTCCGCCGGGTGTCATCAGGATCATGCTTTG 

CCCTGGAG^aACCTGAATCATCTAAAAAACACGGTCTCAACCTGGCCACTGTGGGTGAGGCCT 

GACCACCTTGGGACACCTGCAAGACGACTCCAACCCAACAACAACCAGATGTGCTCCAGCCCA 

GCCGGGCTTCAGTTCCATATTTGCCATGTGTCTGTCCAGATGTGGGGTTGAGCGGGGGTGGGG 

CTGCACCCAGTGGATTGGGTCACCCGGCAGACCTAGGGAAGGTGAGGCGAGGTGGGGAGTTGG 

CAGAATCCCCATACCTCGCAGATTTGCTGAGTCTGTCTTGTGCAGAGGGCCAGAGAATGGCTT 

ATGGGGGCCCAGGTTGGATGGGGAAAGGCTAATGGGGTCAGACCCCACCCCGTCTACCCCTCC 

AGTCAGCCCAGCGCCCATCCTGCAGCTCAGCTGGGAGCATCATTCTCCTGCTTTGTACATAGG 

GTGTGGTCCCCTGGCACGTGGCCACCATCATGTCTAGGCCTATGCTAGGAGGCAAATGGCCAG 

GCTCTGCCTGTGTTTTTCTCAACACTACTTTTCTGATATGAGGGCAGCACCTGCCTCTGAATG 

GGAAATCATGCAACTACTCAGAATGTGTCCTCCTCATCTAATGCTCATCTGTTTAATGGTGAT 

GCCTCGCGTACAGGATCTGGTTACCTGTGCAGTTGTGAATACCCAGAGGTTGGGCAGATCAGT 

GTCTCTAGTCCTACCCAGTTTTAAAGTTCATGGTAAGATTTGACCTCATCTCCCGCAAATAAA 
TGTATTGGTGATTTGGAAAAAAAAAAAAAAAAA 
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FIGURE 44 



MMMQCVSRMLAHPLHVISMRCMVQFVGREAKYSGVLSSIGKIFKEEGLLGFFVGLIPHLLGDV 
VFLWGCNLLAHFINAYLVDDSFSQALAIRSYTKFVMGIAVSMLTYPFLLVGDLMAVNNCGLQA 
GLPPYSPVFKSWIHCWKYLSVQGQLFRGSSLLFRRVSSGSCFALE . 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

Transmesibrane domains : 

amino acids 51-72, . 97-114 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 160-163 

N-nr/ristoylation sites. 

amino acids 34-39, 100-105, 123-128, 165-170 
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GCTCACTCTTTGGGTCCACACTGCCTTTATGAGCTGTAACACTCACTGGGAATGTCTGCAGCT 

TCACTCCTGAAGCCAGCGAGACCACGAACCCACCAGGAGGAACAAACAACTCCAGACGCGCAG 

CCTTAAGAGCTGTAACACTCACCGCGAAGGTCTGCAGCTTCACTCCTGAGCCAGCCAGACCAC 

GAACCCACCAGAAGGAAGAAACTCCAAACACATCCGAACATCAGAAGGAGCAAACTCGTGACA 

CGCCACCTTTAAGAACCGTGACACTCAACGCTAGGGTCCGCGGCTTCATTCTTGAAGTCAGTG 

AGACCAAGAACCCACCAATTCCGGACACGGCAAAGTAACATCCTAGACATGGCTTTAGAGATC 

CACATGTCAGACCCCATGTGCCTCATCGAGAACTTTAATGAGCAGCTGAAGGTTAATCAGGAA 

GCTTTGGAGATCCTGTCTGCCATTACGCAACCTGTAGTTGTGGTAGCGATTGTGGGCCTCTAT 

CGCACTGGCAAATCCTACCTGATGAACAAGCTGGCTGGGAAGAACAAGGGCTTCTCTGTTGCA 

TCTACGGTGCAGTCTCACACCAAGGGAATTTGGATATGGTGTGTGCCTCATCCCAACTGGCCA 

AATCACACATTAGTTCTGCTTGACACCGAGGGCCTGGGAGATGTAGAGAAGGCTGACAACAAG 

AATGATATCCAGATCTTTGCACTGGCACTCTTACTGAGCAGCACCTTTGTGTACAATACTGTG 

AACAAAATTGATCAGGGTGCTATCGACCTACTGCACAATGTGACAGAACTGACAGATCTGCTC 

AAGGCAAGAAACTCACCTGACCTTGACAGGGTTGAAGATCCTGCTGACTCTGCGAGCTTCTTC 

CCAGACTTAGTGTGGACTCTGAGAGATTTCTGCTTAGGCCTGGAAATAGATGGGCAACTTGTC 

ACACCAGATGAATACCTGGAGAATTCCCTAAGGCCAAAGCAAGGTAGTGATCAAAGAGTTCAA 

AATTTCAATTTGCCCCGTCTGTGTATACAGAAGTTCTTTCCAAAAAAGAAATGCTTTATCTTT 

GACTTACCTGCTCACCAAAAAAAGCTTGCCCAACTTGAAACACTGCCTGATGATGAGCTAGAG 

CCTGAATTTGTGCAACAAGTGACAGAATTCTGTTCCTACATCTTTAGCCATTCTATGACCAAG 

ACTCTTCCAGGTGGCATCATGGTCAATGGATCTCGTCTAAAGAACCTGGTGCTGACCTATGTC 

AATGCCATCAGCAGTGGGGATCTGCCTTGCATAGAGAATGCAGTCCTGGCCTTGGCTCAGAGA 

GAGAACTCAGCTGCAGTGCAAAAGGCCATTGCCCACTATGACCAGCAAATGGGCCAGAAAGTG 

CAGCTGCCCATGGAAACCCTCCAGGAGCTGCTGGACCTGCACAGGACCAGTGAGAGGGAGGCC 

ATTGAAGTCTTCATGAAAAACTCtTTCAAGGATGTAGACCAAAGTTTCCAGAAAGAATTGGAG 

ACTCTACTAGATGCAAAACAGAATGACATTTGTAAACGGAACCTGGAAGCATCCTCGGATTAT 

TGCTCGGCTTTACTTAAGGATATTTTTGGTCCTCTAGAAGAAGCAGTGAAGCAGGGAATTTAT 

TCTAAGCCAGGAGGCCATAATCTCTTCATTCAGAAAACAGAAGAACTGAAGGCAAAGTACTAT 

CGGGAGCCTCGGAAAGGAATACAGGCTGAAGAAGTTCTGCAGAAATATTTAAAGTCCAAGGAG 

TCTGTGAGTCATGCAATATTACAGACTGACCAGGCTCTCACAGAGACGGAAAAAAAGAAGAAA 

GAGGCACAAGTGAAAGCAGAAGCTGAAAAGGCTGAAGCGCAAAGGTTGGCGGCGATTCAAAGG 

CAGAACGAGCAAATGATGCAGGAGAGGGAGAGACTCCATCAGGAACAAGTGAGACAAATGGAG 

ATAGCCAAACAAAATTGGCTGGCAGAGCAACAGAAAATGCAGGAACAACAGATGCAGGAACAG 

GCTGCACAGCTCAGCACAACATTCCAAGCTCAAAATAGAAGCCTTCTCAGTGAGCTCCAGCAC 

GCCCAGAGGGCTGTTAATAACGATGATCCATGTGTTTTACTCSAaAGTGCTAAATATGGGAGT 

TTCCTTTTTTTACTCTTTGTCACTGATGACACAACAGAAAAGAAACTGTAGACCTTGGGACAA 
TCAACATTTAAATAAACTTTATAATTATTAAA 
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FIGURE 46 

MALEIHMSDPMCLIENFNEQLKVNQEALEILSAITQPWVVAIVGLYRTGKSYLMNKLAGKNK 
GFSVASTVQSHTKGIWIWCVPHPNWPNHTLVLLDTEGLGDVEKADNKNDIQIFALALLLSSTF 
VYNTVNKIDQGAIDLLHNVTELTDLLKARNSPDLDRVEDPADSASFFPDLVWTLRDFCLGLEI 
DGQLVTPDEYLENSLRPKQGSDQRVQNFNLPRLCIQKFFPKKKCFIFDLPAHQKKLAQLETLP 
DDELEPEFVQQVTEFCSYIFSHSMTKTLPGGIMVNGSRLKNLVLTYVNAISSGDLPCIENAVL 
ALAQRENSAAVQKAIAHYDQQMGQKVQLPMETLQELLDLHRTSEREAIEVFMKNSFKDVDQSF 
QKELETLLDAKQNDICKRNLEASSDYCSALLKDIFGPLEEAVKQGIYSKPGGHNLFIQKTEEL 
KAKYYREPRKGIQAEEVLQKYLKSKESVSHAILQTDQALTETEKKKKEAQVKAEAEKAEAQRL 
AAIQRQNEQMMQERERLHQEQVRQMEIAKQNWLAEQQKMQEQQMQEQAAQLSTTFQAQNRSLL 
SELQHAQRAVNNDDPCVLL 

Xn^ortant features of the protein: 
Transmembrane domains : 

amino acids 31-49, 114-131 

N-glycosylation sites. 

amino acids 90-94, 144-148, 287-291, 563-567 

N-myristoylation sites. 

amino acids 45-51, 283-289 

Prenyl group binding site. 

amino acids 583-588 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 45-53 
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CACTCATTCATTCCAAAGGGTCTCTCAAGGCAATGGTAATGTGCAAGGAGGTGATACCTAAAT 

GAATGACCAAAAGAACATGCTTCTGCTTTTGTGTGTCTCCTACATTTTAGACATTTGTTTGTT 

TCTCTTGGTAGCCTTTAAATTCCTTGAAGCCCAGGACCATGTCTCACTTACCTTTGTGTTTCC 

ACTAACTAGTCTACCTCCTGGAATTGGCAGATACTCAGTGAAAGCCTGTGAAATAAGTGATGT 

CTATTTCTAGCATATTATTCTGAGATTTAATGATAGATTTAGTGATTGAATGAGATTTCCATT 

TTCAAATACAGCAAAAGCATAACTATTTTCATTCATTCATATTCATTCAACTTCATTCTCAAA 

ATTAGGTCCTGAGTTAACTAATAATTACCTTTGAAATGTGTGGGTTATTTGAGGCAATCAGGT 

GGTGACATTGAGCTCTCAGCCAGAGTTTGTTTCTGGAATTGATTCAGTTCCATTGCATTGATT 

TTTGTTCTCAGAAGCCAAGGTTTCCCATGAAAAATCATTCCCACTTGAATTGGGCTGTGATTC 

TTGCTGCGTTTAAGTAAAGGAAGCCTCTTGGTTCTAGTTCTGCAAACTTACACACTGAACTGG 

GACAAGTTTTTGTTTAGAGTAATGGCTGGGAAAAGAGGAACCTTTCATTTTATTCAGAAGTCA 

AAAACAAAGGCCTCCCAGCCACCT6GAGATGTTTTGTTGCAGACACCAGCCTGGCTCTGTCTT 

TATGCCTAACAATTGAGCATCCAGTCTTCTTTGTGCTGGGACCATTGCTCAGCTCTGCAAGGG 

GAAAAGAGGGAGAAAGCCAGAGCTGCCAGGCTTCTTGCACTGGGGCCGGGGGAGGGTTCCTGG 

GAAGCAGGTGCTCTCTGGCTTCTTGGTACGTGAGGCTCTCGGAGCTGCCTCTCCTCTGACCCT 

caggtcctcaccgagtttgctccaggagtatattgaaaAcatacccagtgctctctcaagcac 

CCACTGCTTAGAGGGCCCAGATTTCTTTTCCTTCTTTCCCTTGCAGAGCTGGAGACTGCATCG 

ggcatctggtgtttaaactaaacaggaaaactgactaaaggtccacagtgctcattgtgtaga 

CTAGCTGCCCTCCGAT6GGTGCTCTGATTATCAGTGGTTCCAGTGCAGGGCCTGTCACTAAAC 

aggcctcacttcctccttgggggctttcccatgggaggtgtggctttttactctacatggaaa 

TGACTCTCTGCAGCCACAGAACACAGTGATTTTCTGAATTATCCCAGTCTCTCATGCGCCCTG 

gattcctccagatgccttatatctcttgtgcaaagttgtctaaaatttggttcccagcttcca 

AGCCTTGCCTTTTGGCCTTCCTGGAAGTATTTTTGTTGATGAGTCGTCTGTCATTATTCTCTA 
AAATGATTTGCTTTTTGTTTCTTTCATTCCTATTTCCACCCCACATATACACACATGCTTCT2 
aACTTAGGGGATTACATGCCAATAAATCTATTGTTGAAAATGCACTAATACTATCGCAAAGAC 
GAAAATTCACAGGCTGAACCGTTGTAAGTCCATATGCTCCTCAACTTACATGTGTGATGGAGT 
TATGCCCAAATAAGTCCATCGTCAAGTTGAAAAATCAAAATCAAGCCATCTTAGGTTGAGGAC 
CATTTGTTTGTACCTCCAAAGATGTCATATCTTTAAACATACTCCCTAGCTTTTCTTTTTACT 
TTTTATTTTGAAGTAATTATAGAATCACAGAAAGTTGCAAAAAA 
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FIGURE 48 



MGALIISGSSAGPVTKQASLPPWGLSHGRCGFLLYMEMTLCSHRTQSFSELSQSLMRPGFLQM 
PYISCAKLSKIWFPASKPCLLAFLEVFLLMSRLSLFSKMICFLFLSFLFPPHIYTHAS 

Important features of the protein: 
Signal peptide: 

amino acids 1-41 

Transmembrane domain: 

amino acids 88-107 

Casein kinase II phosphorylation site. 

amino acids 47-50 

N-myristoylation site. 

amino acids 24-29 
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FIGURE 49 

GGCTTCTACAGTCCACAACACCCACCAGCCCCAGGCCCAGCAGAATGAGCCCAGTGAGTGCCGGGGCTCCCAGTT 
CTACCCCGTGCGGTGCCTGCTGCCCAGTGCCCATGGCTCTTGTGCAGACTGGGCTGCCCGCTGGTACTTCGTTGC 

ctctgtgggccaatgtaaccgcttctggtatggcggctgccatggcaatgccaataaSttg^ 

GTGCATGAGCAGCTGCCAGGGATCTCTCCATGGGCCCCGTCGTCCCCAGCCTGGGGCTTCTGGAAGGAG^^^ 

fn»^^^^S^™'^^^^'^^^°^^^^*^^^°'^^<^^^C^G<2^CCCAGCCAGCACAGGACAGGGGCCGCGGTGCAGAG 

AAAGCCCTGGCCTTCTGGTGGTCTCTGGCGGCAAGACCAACAGCCTGGGCCAGGGGAGGCCCCCCACACCCAGGC 

^^S^^^S^^^^^C^'^CTCCTCCTACAGATCTCACTTCCCACCTCTCCAGGATTAGCTTGGCAGGTGTGGAGCCCT 
CGTTGGTGCAGGCAGCCCTGGGGCAGTTGGTGCGGCTCTCCTGCTCAGACGACACTGCCCCGGAATCC^GGCTT 

cctggcagaaagatggccagcccatctcctctgacaggcaovggctgSgttS^^^ 
ccctgcaggcagaggacgcgggcacctacagctgtggcagcacccggccaggccgcg^tcSagaJSt^^ 
tccgcattatagggggtgacatggccgtgctgtctgaggctgagctgagccgcttccctcSc^^ 
ctcaggactttggccaagcgggggctgctgggcccctgggggccatcccctcttcacacSaSSS^^ 

tgcgggccagggatgagggctcctacatgtgcagtgcctaccaggggagccagggagtcagccgcagcaccgagg 
ccaactgtgatttgatcctgcaggcccagctttgtggcaatgagtattactccagcttctgctgtgccagctgtt 
catagggctagggagaaaggaagatggactcttggcttcctctctctggctggcaaagggagttatcttctggaa 

^^^JT^S^^^^^^^^ttc^ctttataatttgttacacaggaatagttaaatgcatttgtttgtS^^ 
acggagtttcactcttgttgcccaggctggagggcaatggcgcgatctcagctcactgcaacctccgtctcctgg 

»«^^™^^^^^^^^°^*^*^*^°^^'^^^<^<5CATGAGCCaCCACGCCCAGCCATCAATGCATTTTTTTTATTTT 

ttttttgagacagagtttcgcacttcttgcccaggctggagtacaatggtgcgatcttggctcactgcaacctcc 

S^^^^^^^^^^"^^^°*^^*^°^<^<=°^'^<^T'rTCTCCATGTTGGTCAGACTGGTCTTGAACTCCCGACCTCAGG 

T^I^^^S»^^S^''''*''=^''^^^^^^^^^<^^TTAGAGGTGTGAGCCACTGTGCCCAGCCCATCAA^^ 

J™™»^^°^^^°^^'^'^^^^^'^'^^'^TTAAAGCTGGGCAGGGAGATGTGTAATGATTTCAAAGTTAACACC 

^^^^^^^^^^^'''^^'^'^^^'^'^^^^^''^^^^t^tatcagggaagtattctgtgctaaaatcagcgatS^^ 

I!^9IS^^SJ^^^'''^'^^^^^^'^'^^^'''^^'^^*^'^'^C'^'^CAGTCAGCTCTAAATCTGGTGCAGAGGGT 

^SS^^S^I^^ '^'^^^^'^^'^^'^'^'^^^'^^^"^tcaaatagggatttgggatgaaacagctgcagttagcact 

^^^^^^^^^^^'^'^^^■^T^^'^TCGAGATGTTGGACCTTGAAGCCCTCCTGAGGCCAAC 
^I^^™S!'^^'"*^''^'^^^''^^^'^C^<^^^^^CT'^TGTAAAGCTGACCAGCCTGCTCTGTACAGTGACAAT6^^ 
^'^^^^■^'^^^^^'^^^'^C'^^^TGAAGCAAAATGTTTGCTGCCAAAGACAAATCAGACTGTCAGTCA 

ttaaaaacagcattagcaggatgaggatagcaatggggaagggttgtgggcaatgcagtaacaSSa^ 
cagaaatggtttgagttggaagacaacattcttcatctctcaggacttctaattccttgatS^ 

^^rrS^I^S^^^^'^^^^^^^^^^^^^^C^^TTAACCTTCTACAAATCTTTCA^^^ 

^SS^^'^^^^^^^^^^^^^^^^^^^^'^cgggcattacagggaccaaagctctgaaaggtgacttttattac^^ 
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FIGURE 50 

MGPVVPSLGLLEGAPTRMVAAAVLQASRNPASTGQGPRCRESPGLLWSGGKTNSLGQGRPPT 
PRPLENGHGGRSLGPGPLDWVEMPDHQRHPSTAPPTDLTSHLSRISLAGVEPSLVQAALGQLV 
RLSCSDDTAPESQAAWQKDGQPISSDRHRLQFDGSLIIHPLQAEDAGTYSCGSTRPGRDSQKI 
QLRIIGGDMAVLSEAELSRFPQPRDPAQDFGQAGAAGPLGAIPSSHPQPANRLRLDQNQPRW 
DASPGQRIRMTCRAEGFPPPAIEWQRDGQPVSSPRHQLQPDGSLVISRVAVEDGGFYTCVAFN 
GQDRDQRWVQLRVLGELTISGLPPTVTVPEGDTARLLCWAGESVNIRWSRNGLPVQADGHRV 
HQSPDGTLLIYNLRARDEGSYMCSAYQGSQAVSRSTEVKVVSPAPTAQPRDPGRDCVDQPELA 
NCDLILQAQLCGNEYYSSFCCASCSRFQPHAQPIWQ 

Ixcportant features of the protein: 
Signal peptide: 

amino acids 1-16 

Tyrosine kinase phosphorylation site. 

amino acids 392-400 

N-myristoylation sites. 

amino acids 9-15, 50-56, 112-118, 146-152, 173-179, 195-201, 
"220-226, 229-235, 280-286, 306-312, 336-342, 397-403 



Myelin PO protein. 

amino acids 153-182 
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FIGURE 51 



CAGGCAGAAGCGAACAAAGACCCAGCAAGAGAAGGCAGAGGCTAAGACCCATCCCGTATCTGC 

TCTCCTGAAATAATTCTGGAGTCATGCCTGAAATGCCAGAGGACATGGAGCAGGAGGAAGTTA 

ACATCCCTAATAGGAGGGTTCTGGTTACTGGTGCCACTGGGCTTCTTGGCAGAGCTGTACACA 

AAGAATTTCAGCAGAATAATTGGCATGCAGTTGGCTGTGGTTTCAGAAGAGCAAGACCAAAAT 

TTGAACAGGTTAATCTGTTGGATTCTAATGCAGTTCATCACATCATTCATGATTTTCAGCCCC 

ATGTTATAGTACATTGTGCAGCAGAGAGAAGACCAGATGTTGTAGAAAATCAGCCAGATGCTG 

CCTCTCAACTTAATGTGGATGCTTCTGGGAATTTAGCAAAGGAAGCAGCTGCTGTTGGAGCAT 

TTCTCATCTACATTAGCTCAGATTATGTATTTGATGGAACAAATCCACCTTACAGAGAGGAAG 

ACATACCAGCTCCCCTAAATTTGTATGGCAAAACAAAATTAGATGGAGAAAAGGCTGTCCTGG 

AGAACAATCTAGGAGCTGCTGTTTTGAGGATTCCTATTCTGTATGGGGAAGTTGAAAAGCTCG 

AAGAAAGTGCTGTGACTGTTATGTTTGATAAAGTGCAGTTCAGCAACAAGTCAGCAAACATGG 

ATCACTGGCAGCAGAGGTTCCCCACACATGTCAAAGATGTGGCCACTGTGTGCCGGCAGCTAG 

CAGAGAAGAGAATGCTGGATCCATCAATTAAGGGAACCTTTCACTGGTCTGGCAATGAACAGA 

TGACTAAGTATGAAATGGCATGTGCAATTGCAGATGCCTTCAACCTCCCCAGCAGTCACTTAA 

GACCTATTACTGACAGCCCTGTCCTAGGAGCACAACGTCCGAGAAATGCTCAGCTTGACTGCT 

CCAAATTGGAGACCTTGGGCATTGGCCT^CGAACACCATTTCGAATTGGAATCAAAGAATCAC 

TTTGGCCTTTCCTCATTGACAAGAGATGGAGACAAACGGTCTTTCATTMTTTATTTGTGTTG 

GGTTCTTTTTTTTTTTAAATGAAAAGTATAGTATGTGGCACTTTTTAAAGAACAAAGGAAATA 

GTTTTGTATGAGTACTTTAATTGTGACTCTTAGGATCTTTCAGGTAAATGATGCTCTTGCACT 

AGTGAAATTGTCTAAAGAAACTAAAGGGCAGTCATGCCCTGTTTGCAGTAATTTTTCTTTTTA 

TCATTTTGTTTGTCCTGGCTAAACTTGGAGTTTGAGTATAGTAAATTATGATCCTTAAATATT 

TGAGAGTCAGGATGAAGCAGATCTGCTGTAGACTTTTCAGATGAAATTGTTCATTCTCGTAAC 

CTCCATATTTTCAGGATTTTTGAAGCTGTTGACCTTTTCATGTTGATTATTTTAAATTGTGTG 

AAATAGTATAAAAATCATTGGTGTTCATTATTTGCTTTGCCTGAGCTCAGATCAAAATGTTTG 

AAGAAAGGAACTTTATTTTTGCAAGTTACGTACAGTTTTTATGCTTGAGATATTTCAACATGT 

TATGTATATTGGAACTTCTACAGCTTGATGCCTCCTGCTTTTATAGCAGTTTATGGGGAGCAC 

TTGAAAGAGCGTGTGTACATGTATTTTTTTTCTAGGCAAACATTGAATGCAAACGTGTATTTT 

TTTAATATAAATATATAACTGTCCTTTTCATCCCATGTTGCCGCTAAGTGATATTTCATATGT 

GTGGTTATACTCATAATAATGGGCCTTGTAAGTCTTTTCACCATTCATGAATAATAATAAATA 

TGTACTGCTGGCATGTAATGCTTAGTTTTCTTGTATTTACTTCTTTTTTTAAATGTTUVGGACC 

AAACTTCTAAACTAATTGTTCTTTTGTTGCTTTAATTTTTAAAAATTACATTCTTCTGATGTA 

ACATGTGATACATACAAAAGAATATAGTTTAATATGTATTGAAATAAAACACAATAAAATT 
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FIGURE 52 



MPEMPEDMEQEEVNIPNRRVLVTGATGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVNLLD 
SNAVHHIIHDFQPHVIVHCAAERRPDWENQPDAASQLNVDASG.NLAKEAAAVGAFLIYISSD 
YVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEKLEESAVTVM 
FDKVQFSNKSANMDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWSGNEQMTKYEMAC 
AIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPFRIGIKESLWPFLIDK 
RWRQTVFH 

Signal peptides- 
amino acids 1-30 

Traxismembrane domain: 

amino acids 105-127 

N-glycosylation site. 

amino acids 197-201 

N-myristoylation site. 

amino acids 303-309 



Short*chain dehydrogenases/reductases family proteins. 

amino acids 18-30 
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FIGURE 53 

TGGGCTCCCTCCAGCACTGCTGTTGCCTGCTGCCTAAGATGGGTGACACTTGGGCCCAGCTTCCCTGGCCTGGGC 

CACCCCACCCAGCAATCCTGCTGATCTCCCTCCTCTTGGCAGCCGGGTTGATGCACTCGGATGCCGGCACCAGCT 

GCCCCGTCCTTTGCACATGCCGTAACCAGGTGGTGGATTGTAGCAGCCAGCGGCTATTCTCCGTGCCCCCAGACC 

TGCCAATGGACACCCGAAACCTCAGCCTGGCCCACAACCGCATCACAGCAGTGCCGCCTGGCTACCTCACATGCT 

ACATGGAGCTCCAGGTGCTGGATTTGCACAACAACTCCTTAATGGAGCTGCCCCGGGGCCTCTTCCTCCATGCCA 

AGCGCTTGGCACACTTGGACCTGAGCTACAACAATTTCAGCCATGTGCCAGCCGACATGTTCCAGGAGGCCCATG 

GGCTAGTCCACATCGACCTGAGCCACAACCCCTGGCTGCGGAGGGTGCATCCCCAGGCCTTTCAGGGCCTCATGC 

AGCTCCGAGACCTGGACCTCAGTTATGGGGGCCTGGCCTTCCTCAGCCTGGAGGCTCTTGAGGGCCTACCGGGGC 

TGGTGACCCTGCAGATCGGTGGCAATCCCTGGGTGTGTGGCTGCACCATGGAACCCCTGCTGAAGTGGCTGCGAA 

ACCGGATCCAGCGCTGTACAGCAGATTCTCAGCTGGCTGAGTGCCGGGGCCCTCCTGAAGTCGAGGGCGCCCCGC 

TCTTCTCACTCACTGAGGAGAGCTTCAAGGCCTGCCACCTGACCCTGACCCTGGATGATTACCTATTCATTGCGT 

TCGTGGGCTTCGTGGTCTCCATTGCTTCTGTGGCCACCAACTTCCTCCTGGGCATCACTGCCAACTGCTGCCACC 

GCTGGAGCAAGGCCAGTGAAGAGGAAGAGATCTGACATGCCTGCCTCTCATCCCTCCATGCTGCTGACCGCCACA 

GCTGCTGGCCACCAGACGCCCTCCCTGATTGCTCACTCTGGTTCCATGGTGACCTGGCTGCCTCAGTCATGGTTC 

AAGCAAGGTGGGGACACTCATTTTGTATGAGCATCTGCTTTGGGCCAGGCGGCACGCTAGGAATTGGGAACATCA 

GATGAACTGACTCAGTCCCTGCCCTCAAGGCACTTCCCTCTGGTCAAGGAGAGAGATCCAAAAACTATTCCCTTT 

AAGACTATATGTCAGGACTCTGAGCACGTCATTATGGAGGCCCAGAGGAGGAGCCATCATCTGTATCTAGCAATG 

TCCATGAGAATTATAAGATTAGAGTGATTTGTGAACTGGGTCATCAGGAAATATCTACTTTGTCAGGTAGGCAAA 

GAAGGGTGTCTGCACATGGCAGAGGCCAGAATATGCATAGTGTGCTGTGTTGAGAAGAGTGAACAGTTCCTGGTC 

ACTTACTTGTATAGAGGGGGTGTGGCACAGAACTCAAACCTACCCCTCACCTCCTGACACCAAAACTGTCAGCTC 

TCAGCAATGCCAGCCACTGCCTACAGGGAGTAAGAACACCTCTATGACAGCCCCTGGCCTCCTTCCACCAGCAGC 

TACCAGGTGAGACCACCTCCCAGTGACTGCCCCCATATGACCAAATGTCACCAGTTGGTGAGGTCCCAGGCAGCA 

GGCTGAGGATGGACACTTTCAATGCCCTTGCTCCTGCCTCTCACTCAAGTTTTGCTTCAGAAGAGAGAGGCAGGA 

GGCCCAGCAACTGGGGCAGCAAGAGTCCTGGCACCTTGGGATCCTAATCATGTGACTGTTCTTGCCACAGTGCTC 

ATGCCACAGGGTCTCACCAGGAAAGTGCACTGTGGGCCACAGACCCACAGCCTGGCAGCACCCAGAGCTAAAAGG 

GGACAAAGGCAGCACAGTTAT.GACCATATGAGGCTTTGCATTTTCTTCTAAGCAACTTACCCACGTTAAGCATGA 

GGGTGAGAGAGCTATTAAATACTAAGCCCTTGCCAGTGTCAGGTACTTTGAAAAGCTCTCTGCACAAACCATTCC 

CTTTGACACACACACACACAAATCTTTTGAGGTGAACGCTGTTGTTCCCATTTTACGGATGAGGCAACTAAGGCT 

CAGAGAGGTTAAAGTCACATGCCACTATGAGCAAGATAAAGTCTGTGCTCTTTCTACTGCCCCATCCAAGTTGGG 

GAACATCACCATTCCCTCTAGAGTTATATAAATTCAAATTCAACTAGAGCTGACAAAGTTCCTCATAAGGTCCAG 

GCACTCCTCTGGGCACTTTTATATCTATTGACTCACTTCTTTCAATTCTCACAGCAACACTGCCTGGTGGTTTTT 

ATTATCCCCATTTGACAGATGAATTAATCGTAGAGAGTTGAGTGACTTACCCAAGGTTGTCTGGATAAGCCCTAG 

AAGGAAGGCGGTAGGCAGCTCCATTCAGGGAAACTGCATCTAATCAGTCAGTCAAAAATCAAGTAACTTTACGAG 

CAAAGCACAATTATCATCATCGTGGTCTTCTTCATCAGTTTCGTCAGCAGCATCATTATCTTCCCTCTATTTGTT 

CAGCACCGGATAGTTCATGAGTATTTTTGCATCATTCTCCTTGACTTTTCACATCCCTGTGCAGGAGGTAAATCA 

AACATCAGTAATCCTGTTTTACAGATGGGGAAAAAAGTCTCAAGGTTGGATATGACTTGCTATGTGGCAAGGTTG 

GGGCTCAACCCTAACACAGTTCTCTTTCCAGTGCTTTCTCAAGTGCTTGGGGAAGAGAATGCCTCAGAAGGCTGG 

GTAGTGGGGCCCTGGAATTCAGCATCCATGAATGTGCTAGTGGATAAGCTAAATAGAAGGCAGCCAAACCCATCT 

GCTGTACAGATTGAACTATGCTCACGGTAGGGCAAATTGCAGGCTCTGAAACAGAGACTACACAGGTAACACCTG 

AATAGGAGACTCCTGCTTTACAATGTGTAGATAAAACATCAGCAATGGTGGCCATGGTGGCAGTCATGTGAAAAG 

TAAGATCTTTGGGAATCAAGAAAGGAAGCTGTGTTAACCACTCCTGCTCAAGCCCTGCTGCGTGTGTTGCAAGAG 

ATACTAAGAGAGCAAGAAAGCTATAGGTGAGAACCTCTGCAGTTTAGGAGAAGAACATCAAGGCACAGTCCAACA 

TGCTGATAAGTCTGGCCAGGAGGAGAATTAAAACAGGGGCTTTCCACACCTCCCTTGCCCCAAGCTCCAGCGGTA 

TTCTATCAGCCCATCCTCCTGGAAAGCCTGAAAGGAATGAAGGAGGCTAATAAGTCATCTTCCAGGAAGGCATCC 

CTCACTCGTGCTTCCCTGAGCTAGTCAACCAAAAGAGTCTTCAGAAACTTTGCTAGACCTGAAGTACTTGAACCT 

GTGTCCCCTGAATCTTTCTTACAACATCTGGGACAAATCCCTGGTCCTGTGACATCCGAAGCAGAACTGTGCCCT 

GCTCTCTCCTTCTGTGATGACCAAGGATGGTGAACTCAAGTTGTTCTCTACAAGCCAGGCCAGCAACCTAAATAC 

TTGGAGAGGAACTTTTAGAAACTATAATCCTGACAAAATAGAAAAGTTTCCCATAGGGGCATACCATAATACTAT 

AATAACCTCCCAGGAACTATTGTTTGCCAAAATGTAGTTAATATATTTTAAGATATATGCTTTTTTGCATAGGAC 

TAGAACCAGAAAAGACACCAAATGCCCCCTTGACATCAATGTCCTTTCTAGTGGGACAATTTGGTCTCCATTAAT 

GCCAAACCTTTCTGAACAGGATACATGGCTTTTAAAGGACAGATGTTTCTCCTGCTGCTAGAAGTTCCTCAGTTT 

ACTAGAGCACAATGAGGAAAGTATTCAACCTCCCTACTGCCAAGGAATTCCCTGCTTCTCCCCCACCGCCATCAT 

CTTGTCCAAGCTATCAGAAGCAACCTTCTAGAGATAATCTAACAATCCTGATTAGAATTGCTCCCATATCCCTGG 

TGACCACAGGCTTCATTCAAATTGTCCAAACTGGTTAACATGTATGTGATGGGGTATCTCTGCATCTGTATGTCT 

GTCTGCGAGGTTCCTTGTATATTGGCTGTCCGCTGACTTGGGACAGATCTCTCTAGAACTTGGGTTCAGTTCTCT 

GACATAGTCCACTCAGCCAXAGGCTGAGTGGCTAAATATGCATAAATAAGCATGCCTAAATAGGCATATATAGGT 

TGGTGCAAAAGTAATTGCGGTTTTTGCCATTAAAAATGATGGCAAAAATCCCAATTACTTTTGCGTCAATCTAAT 

ATTACATTGCTTGATAGATTAAGATGGAATCCCACCAGGTTTAGGGTAGGACTGGATGCTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 54 



MLLISLLLAAGLMHSDAGTSCPVLCTCRNQWDCSSQRLFSVPPDLPMDTRNLSLAHNRITAV 
PPGYLTCYMELQVLDLHNNSLMELPRGLFLHAKRLAHLDLSYNNFSHVPADMFQEAHGLVHID 



LSHNPWLRRVHPQAFQGLMQLRDLDLSYGGLAFLSLEALEGLPGLVTLQIGGNPWVCGCTMEP 

LLKWLRNRIQRCTADSQLAECRGPPEVEGAPLFSLTEESFKACHLTLTLDDYLFIAFVGFWS 
lASVATNFLLGITANCCHRWSKASEEEEI 

Inportant features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmenibrane domain: 

amino acids 241-260 

N-glycosylation sites. 

amino acids 52-55, 81-84, 107-110 

Tyrosine kinase phosphorylation site. 

amino acids 148-154 

N-myristoylation sites. 

amino acids 11-15, 263-268 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 175-185 



r 



Leucine zipper pattern ^ 

amino acids 77-98 
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FIGURE 55 

GGCTGCGCCCAGGCCGGCGGGCCCAGCAGCTGCGAACCGCCGGCGCACCACCTGTTTCCGCGC 

CCGGGGACTTCCCCGGCGGGGCTCAGAAGTGTGGGGTCGGTCGCTTGGCTTCCCCTGGCGTCA 

GCGACCCAGGGTAACCTCCTCCACTGCTGCGTGCCGTGCAGGCCTGCCTGTGTGAGAGCCACG 

TGTGCCGCGCTCTGGGCACAGCCTTGGAAAGTCAGGACCGCGACGGCAGCAGAGCAGAAACCT 

TACAGAAACATGAAGCCCTCAACCATCTGCTACTCAGTTATTCGGGGCTGACGGCGGCTTCTA 

GAACATCCAGGTGTTCTGCAGATGCGAGAACTCATCCTGTAGTCACCAGATGGAGTCCCAAAC 

AGCCAAGCAGATGTAAGGCCTGTGCTGTGGCTCTGAGGCCCTGAATACAGAAGGGTCACTTTC 

TTAGTGGCCAAAGAGCAGTTGTTGACATTGATGTCTAATTATTGAACACGACCAGTCATTTTA 

CTGAGCTGCAGTGAGGAAACACTGACCATAGAAGATCAAGCCAAATGAGGGATTGCAAATTTC 

CTGATTCTTTTGAATTAGGATTCCAGATGGGGGCCTCATTTCTACAGCCCCCAACATTCCTAT 

AGCCGTTATCACTGCCATCACCACTGCCACCAGCATCTTCTTGCAGATTCCACCCCTGCTCCC 

CAGAGACTTCCTGCTTTGAAAGTGAGCAGAAAGGAAGCTCTCAGAAAAATCTCTAGTGGTGGC 

TGCCGTCGCTCCAGACAATCGGAATCCTGCCTTCACCACCATGGGCTGGCTTTTTCTAAAGGT 

TTTGTTGGCGGGAGTGAGTTTCTCAGGATTTCTTTATCCTCTTGTGGATTTTTGCATCAGTGG 

GAAAACAAGAGGACAGAAGCCAAACTTTGTGATTATTTTGGCCGATGACATGGGGTGGGGTGA 

CCTGGGAGCAAACTGGGCAGAAACAAAGGACACTGCCAACCTTGATAAGATGGCTTCGGAGGG 

AATGAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACCCTCCCGGGCTTCCTTGCT 

CACCGGCCGGCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGG 

CCTTCCGCTCAACGAGACCACCTTGGCAGAGGTGCTGCAGCAGGCGGGTTACGTCACTGGGAT 

AATAGGCAAATGGCATCTTGGACACCACGGCTCTTATCACCCCAACTTCCGTGGTTTTGATTA 

CTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCC 

TCCTTGTCCAGCGTGTCCACAGGGTGATGGACCATCAAGGAACCTTCAAAGAGACTGTTACAC 

TGACGTGGCCCTCCCTCTTTATGAAAACCTCAACATTGTGGAGCAGCCGGTGAACTTGAGCAG 

CCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGCACCAGCGGGAG 

GCCCTTCCTGCTCTATGTGGCTCTGGCCCACATGCACGTGCCCTTACCTGTGACTCAGCTACC 

AGCAGCGCCACGGGGCAGAAGCCTGTATGGTGCAGGGCTCTGGGAGATGGACAGTCTGGTGGG 

CCAGATCAAGGACAAAGTTGACCACACAGTGAAGGAAAACACATTCCTCTGGTTTACAGGAGA 

CAATGGCCCGTGGGCTCAGAAGTGTGAGCTAGCGGGCAGTGTGGGTCCCTTCACTGGATTTTG 

GCAAACTCGTCAAGGGGGAAGTCCAGCCAAGCAGACGACCTGGGAAGGAGGGCACCGGGTCCC 

AGCACTGGCTTACTGGCCTGGCAGAGTTCCAGTTAATGTCACCAGCACTGCCTTGTTAAGCGT 

GCTGGACATTTTTCCAACTGTGGTAGCCCTGGCCCAGGCCAGCTTACCTCAAGGACGGCGCTT 

TGATGGTGTGGACGTCTCCGAGGTGCTCTTTGGCCGGTCACAGCCTGGGCACAGGGTGCTGTT 

CCACCCCAACAGCGGGGCAGCTGGAGAGTTTGGAGCCCTGCAGACTGTCCGCCTGGAGCGTTA 

CAAGGCCTTCTACATTACCGGTGGAGCCAGGGCGTGTGATGGGAGCATGGTGCCTGAGCTGCA 

GCATAAGTTTCCTCTGATTTTCAACCTGGAAGACGATACCGCAGAAGCTGTGCCCCTAGAAAG 

AGGTGGTGCGGAGTACCAGGCTGTGCTGCCCGAGGTCAGAAAGGTTCTTGCAGACGTCCTCCA 

AGACATTGCCAACGACAACATCTCCAGCGCAGATTACACTCAGGACCCTTCAGTAACTCCCTG 

CTGTAATCCCTACCAAATTGCCTGCCGCTGTCAAGCCGCATAACAGACCAATTTTTATTCCAC 

GAGGAGGAGTACCTGGAAATTAGGCAAGTTTGCTTCCAAATTTCATTTTTACCCTCTTTACAA 

ACACACGCTTTAGTTTAGTCTTGGAGTTTAGTTTTGGAGTTAGCCTTGCATATCCCTTCTGTA 

TCCTGTCCCCCCTCCACGCCGACCCGAGAGCAGCTGAGCTGCGCTGGCTCTGGGCAGGGAGTG 

TGCCTTAATGGGAAGCACACGGGCTTTGGAGTCAGGCACAGGTGCCAGCTCCAGCTTTTGAAC 

TTGGGCAATTGTTTAACCTAACCTGCAAGTTGATTTTGAGGGTTAAATAAAGGCATACATGAA 
AATGCCTGGCAACTTTAAAAAAAA2y\AA 
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FIGURE 56 

MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILADDMGWGDLGANWAETKbTAN 

LDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQ 

QAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSR 

NLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHV 

PLPVTQLPAAPRGRSLYGAGLWEMDSLVGQIKDKVDHTVKENTFLWFTGDNGPWAQKCELAGS 

VGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQA 

SLPQGRRFDGVDVSEVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGGARACD 

GSMVPELQHKFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVLQDIANDNISSADYT 
QDPSVTPCCNPYQIACRCQAA 

Iinportanl: £eat:ures of the protein: 
Signal peptide: 

amino acids 1-16 

Transmembrane doxoain: 

amino acids 353-373 

N-glycosylation sites. 

amino acids 117-120, 215-218, 356-359, 397-500 
N-myristoylation sites. 

amino acids 12-17, 33-38, 52-57, 97-102, 101-106, 113-118, 158- 
163, 328-333, 388-393, 418-423, 435-440, 436-441 

Amidation site. 

amino acids 382-385 

Sulfatases signature 2. 

amino acids 129-138 
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FIGURE 57 

TGGACAAGACACCTCCAGGAGCCCAGCTCACAGCCACCGGTACCTTCTTCCAGGACAAGCTGG 

GGGCCTCCATGGGCGCCTGAGGGCCAGGCGCCAGGGCCGTGGGCACGAGTATGGTGAGACACC 

AGCCCCTGCAGTACTACGAGCCACAGCTGTGCCTCTCCTGCCTCACGGGCATCTACGGCTGCC 

GTTGGAAGCGCTACCAGCGCTCCCATGATGATACCACACCGGGCACAGCGCCATTCCTGCATG 

TGGGGGCTGTGGCAGCAGTCACCATGCTCTCCTGGATCGTGGCAGGACAGTTCGCCCGTGCAG 

AGCGGACCTCCTCCCAGGTGACCATTCTCTGTACCTTCTTCACCGTGGTGTTTGCCCTCTACC 

TGGCCCCTCTCACCATCTCCTCTCCCTGCATCATGGAGAAGAAAGACCTCGGCCCCAAGCCTG 

CTCTCATTGGCCACCGCGGGGCCCCCATGCTGGCTCCAGAGCACACGCTCATGTCCTTCCGGA 

AGGCCCTCGAGCAGAAGCTGTACGGGCTCCAGGCTGACATTACCATCAGCCTGGACGGCGTGC 

CCTTCCTCATGCATGACACCACCCTGCGGCGCACCACCAACGTGGAGGAGGAGTTCCCGGAGC 

TGGCCCGCAGGCCTGCCTCCATGCTTAACTGGACCACCCTGCAGAGACTCAACGCTGGCCAGT 

GGTTCCTGAAGACTGACCCCTTCTGGACAGCCAGCTCCCTGTCACCCTCCGACCACAGAGAGG 

CCCAGAACCAGTCCATCTGCAGCCTGGCAGAGCTCCTGGAGCTGGCCAAGGGCAATGCCACAC 

TGCTGCTCAACCTGCGTGACCCGCCCCGGGAGCACCCCTACCGCAGCAGTTTTATCAACGT6A 

CTCTGGAGGCCGTGCTGCACTCCGGCTTCCCCCAGCACCAGGTCATGTGGCTGCCTAGCAGGC 

AGAGGCCCCTGGTGCGGAAGGTGGCTCCCGGCTTCCAACAGACATCAGGCTCCAAGGAGGCAG 

TCGCCAGCCTGCGGAGAGGCCACATCCAGCGGCTGAACCTGCGCTACACTCAGGTGTCCCGCC 

AGGAGCTCAGGGACTACGCGTCCTGGAACCTGAGTGTGAACCTCTACACAGTCAACGCACCGT 

GGCTCTTCTCCCTGCTGTGGTGTGCGGGGGTCCCATCCGTCACCTCTGACAACTCCCACACCC 

TGTCCCAGGTGCCTTCCCCCCTCTGGATCATGCCCCCGGACGAGTACTGTCTCATGTGGGTCA 

CTGCCGACCTGGTCTCCTTCACCCTCATCGTGGGCATCTTCGTGCTCCAGAAGTGGCGCCTGG 

GTGGCATACGGAGCTACAACCCTGAGCAGATCATGCTGAGTGCTGCGGTGCGCCGGACCAGCC 

GGGACGTCAGCATCATGAAGGAGAAGCTTATTTTCTCAGAGATCAGCGATGGTGTAGAGGTCT 

CCGATGTGCTCTCCGTATGTTCAGACAACAGTTATGACACATATGCCAACAGCACCGCCACCC 

CTGTGGGCCCCCGAGGGGGTGGCAGCCACACCAAGACCCTCATAGAGCGGAGTGGGCGTTA6C 

TGAAGACATGTCTGTCCCACCTGTACCTGACACAGAAGCTGGGGAGCCTAGGAGAGCTGGTGG 

AAGTGTGTCTGAACTCGGAGTGCTCTGGGAGCGGGCTCCACAGCCTCCTTGTGGGCTCCAGCC 

CCTTGTCAGCCGCAGCCTCTCTTGAGGGGGACTCCCTGTCTCCTGAGGCCCAGCTGGGCCAGG 

ACTCCATCCTTTCAGATGCCCCTGCAGGCCTGGGGCTCCTTCTGGGAAGTATGGGGCCTAGGG 

CTTGGTCCCCCTCTTCTGAGGCCCTCTCCTGTATCCCGACCTGGAAGCTTTGATGGGTCATGG 

GCCATGCCATACCCCCTGTGGCAATGGAGTGTGTGGATGCTCACCTGTGCCATCTGTCCTCCT 

GTCTGTGCCAGGAGGCACCTGAGTTCTCTGCTGTTATCCTGCCCCAAGGGCCTGGGCCGAGCC 

TCTACCTGAAGCAACTCTGCTCTTCCTGTCAGTCTCAAAGCACAAGGAGGTTCAGCCCAGGAG 

GAAGCCAGCTGCAATGTGGAGACACGTCCTCCTCCCCAACCCACCTCATGCCACCGCCAACCC 

CCTGCCCCAGGAGCGGGCCTGAGCCACGTCCCCTAGGAGCAGCTGGAGATGGCCAAAAGAGTG 

AGCTCAGGACTACTGGATCCCATGCCCAGGTGTCCAGCAGACCTCAAGGCAGAAGGGTCACCT 

AACCCAGGAGTCCACAGACTGATGTGACCTCAGGTTCCCACATCAGTGGCCACAGGGCAGGGC 

CCACCTGGTAGAAGTGTTCTGGATATGGCCAGGGTGGGTGTGTGGCTAAGTGGGCCTGAACAG 

AGGGAACCTAGGGCCCTTGGCCAATGTGATTAAAGCTGCCATCTTGAAA 
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FIGURE 58 

MVRHQPLQYYEPQLCLSCLTGIYGCRWKRYQRSHDDTTPGTAPFLHVGAVAAVTMLSWIVAGQ 
FARAERTSSQVTILCTFFTWFALYLAPLTISSPCIMEKKDLGPKPALIGHRGAPMLAPEHTL 
MSFRKALEQKLYGLQADITISLDGVPFLMHDTTLRRTTNVEEEFPELARRPASMLNWTTLQRL 
NAGQWFLKTDPFWTASSLSPSDHREAQNQSICSLAELLELAKGNATLLLNLRDPPREHPYRSS 
FINVTLEAVLHSGFPQHQVMWLPSRQRPLVRKVAPGFQQTSGSKEAVASLRRGHIQRLNLRYT 
QVSRQELRDYASWNLSVNLYTVNAPWLFSLLWCAGVPSVTSDNSHTLSQVPSPLWIMPPDEYC 
LMWVTADLVSFTLIVGIFVLQKWRLGGIRSYNPEQIMLSAAVRRTSRDVSIMKEKLIFSEISD 
GVEVSDVLSVCSDNSYDTYANSTATPVGPRGGGSHTKTLIERSGR 

Is^ortant features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains : 

amino acids 47-61, 77-93, 335-350, 380-399 
N-glycosylation sites. 

amino acids 182-186, 217-221, 233-237, 255-259, 329-333, 462-466 

Tyrosine kinase phosphorylation site. 

amino acids 130-139 

N-myristoylation sites. 

amino acids 21-27, 48-54, 294-300, 404-410, 442-448, 473-479 
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FIGURE 59 

CCTGAGCAAACACAGCAGCCCGAGTGTTCCCAAGGCCAAAATGCTGAGAACGTCCACTCCTAA 

TCTGTGTGGTGGTCTGCATTGCCGGGCCCCCTGGCTCTCTTCTGGCATTCTCTGCCTCTGCCT 

CATATTCTTGTTAGGCCAGGTGGGCTTGCTGCAGGGACACCCCCAGTGCCTGGATTACGGGCC 

CCCTTTCCAGCCCCCTCTGCACCTTGAGTTTTGCTCTGACTATGAGTCCTTCGGCTGCTGTGA 

TCAGCACAAGGACCGCCGCATCGCTGCCCGGTACTGGGACATCATGGAATATTTTGATCTGAA 

GAGACATGAGCTGTGTGGAGATTACATTAAAGACATCCTTTGCCAGGAGTGCTCGCCCTACGC 

AGCCCACCTCTACGACGCCGAAAACACCCAGACGCCTCTCCGGAATCTCCCGGGCCTCTGCTC 

TGATTACTGCTCTGCCTTCCATTCTAACTGTCACTCAGCCATTTCCCTGCTGACCAATGACCG 

CGGCCTCCAGGAGTCTCATGGAAGGGACGGTACCCGCTTCTGCCACCTCCTGGACCTTCCTGA 

CAAGGACTATTGCTTCCCTAATGTCCTGAGGAACGACTATCTCAACCGCCACCTGGGCATGGT 

GGCCCAAGATCCTCAGGGCTGCCTGCAGCTCTGCCTGAGCGAGGTGGCCAACGGGCTGAGGAA 

CCCCGTCTCCATGGTCCATGCTGGGGACGGCACCCATCGCTTCTTTGTTGCCGAGCAGGTAGG 

AGTGGTGTGGGTCTACCTCCCTGATGGGAGTCGCCTGGAGCAACCCTTCCTGGACCTCAAGAA 

CATCGTGTTGACCACCCCATGGATCGGGGATGAGAGAGGCTTCTTGGGGTTGGCTTTTCACCC 

CAAATTCCGCCACAATCGCAAGTTCTATATTTATTATTCGTGCCTGGACAAGAAGAAGGTAGA 

AAAGATCCGAATTAGTGAGATGAAGGTTTCTCGGGCTGATCCTAACAAAGCTGACCTGAAATC 

AGAGAGGGTCATCTTGGAGATTGAAGAACCAGCCTCAAACCATAATGGCGGACAACTTCTTTT 

TGGCCTGGATGGCTATATGTACATATTCACTGGGGACGGGGGACAGGCTGGAGATCCCTTTGG 

CCTGTTTGGAAATGCTCAGAACAAAAGTTCCCTGCTGGGAAAAGTTTTAAGGATCGATGTGAA 

CAGGGCAGGCTCACATGGCAAGCGGTACCGAGTCCCCTCGGACAATCCATTTGTTTCTGAGCC 

AGGGGCCCACCCCGCCATCTATGCCTATGGGATCAGGAACATGTGGCGTTGTGCTGTGGACCG 

AGGGGACCCCATCACGCGCCAGGGCCGAGGCCGGATATTCTGTGGGGACGTGGGCCAGAACAG 

GTTTGAAGAGGTTGACCTCATTTTGAAAGGTGGAAACTATGGCTGGAGAGCAAAGGAAGGGTT 

TGCATGTtATGACAAAAAACTTTGTCACAATGCCTCTTTGGATGATGTTCTGCCAATCTATGC 

TTATGGCCATGCAGTGGGGAAGTCAGTCACTGGAGGTTATGTCTATCGTGGTTGTGAATCCCC 

AAATCTCAATGGCCTGTATATCTTTGGAGACTTCATGAGTGGTCGACTTATGGCTTTGCAGGA 

AGATAGAAAAAACAAGAAATGGAAGAAGCAGGATCTTTGCCTGGGCAGCACCACGTCCTGTGC 

CTTCCCAGGGCTGATCAGCACCCATAGCAAGTTCATCATCTCCTTTGCTGAAGATGAAGCAGG 

GGAGCTGTATTTCCTGGCGACCTCTTACCCAAGTGCCTATGCACCACGTGGATCTATTTACAA 

GTTTGTTGACCCCTCAAGGCGAGCACCCCCAGGCAAGTGCAAATACAAGCCAGTGCCCGTGAG 

AACCAAGAGTAAGCGGATCCCGTTCAGACCACTCGCCAAGACAGTCTTGGACTTGCTAAAGGA 

ACAATCAGAGAAAGCTGCTAGAAAATCTTCCAGTGCAACCTTAGCTTCTGGCCCAGCCCAGGG 

TTTGTCTGAGAAAGGCTCCTCCAAGAAGCTGGCTTCTCCTACAAGCAGCAAGAATACATTGCG 

AGGGCCTGGTACAAAGAAGAAAGCCAGAGTGGGGCCCCACGTCCGCCAGGGCAAGAGGAGGAA 

GAGCCTGAAAAGCCACAGTGGCAGGATGAGGCCATCAGCAGAGCAGAAGCGAGCTGGCAGAAG 

TCTCCCTIGACCTATTGGTCAAGGTGGCCGACAGGGTGACGTGAGAGAGGAGAGCCACCTCAT 

CAAATGAAAGTCACTGCTGAATAAAGACCTTAGAAGTCTGGGAAGCCAGGGTAGAGGTGGGGC 

AGGGCGGTTTTCCTCTCCCTGGGAAATCTTGCTGTCTACTGAATAAATAAATGCACCTTCTCT 

6TATGCAGTGCTTCTGTGGGAGACCATATCCCAGATTGCTGGTGCACCTGGGTTATGGTAAGC 

ACTAGTCCATGAGCCTGCTTGGAATCACACTGGATGTCTCCGTTTTGTCTTGTAAATGCCTAC 

AACCTGAGGTAATAAATCAACATTTGCTCA 
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FIGURE 60 

MLRTSTPNLCGGLHCRAPWLSSGILCLCLIFLLGQVGLLQGHPQCLDYGPPFQPPLHLEFCSD 
YESFGCCDQHKDRRIAARYWDIMEYFDLKRHELCGDYIKDILCQECSPYAAHLYDAENTQTPL 
RNLPGLCSDYCSAFHSNCHSAISLLTNDRGLQESHGRDGTRFCHLLDLPDKDYCFPNVLRNDY 
LNRHLGMVAQDPQGCLQLCLSEVANGLRNPVSMVHAGDGTHRFFVAEQVGWWVYLPDGSRLE 
QPFLDLKNIVLTTPWIGDERGFLGLAFHPKFRHNRKFYIYYSCLDKKKVEKIRISEMKVSRAD 
PNKADLKSERVILEIEEPASNHNGGQLLFGLDGYMYIFTGDGGQAGDPFGLFGNAQNKSSLLG 
KVLRIDVNRAGSHGKRYRVPSDNPFVSEPGAHPAIYAYGIRNMWRCAVDRGDPITRQGRGRiF 
CGDVGQNRFEEVDLILKGGNYGWRAKEGFACYDKKLCHNASLDDVLPIYAYGHAVGKSVTGGY 
VYRGCESPNLNGLYIFGDFMSGRLMALQEDRKNKKWKKQDLCLGSTTSCAFPGLISTHSKFII 
SFAEDEAGELYFLATSYPSAYAPRGSIYKFVDPSRRAPPGKCKYKPVPVRTKSKRIPFRPLAK 
TVLDLLKEQSEKAARKSSSATLASGPAQGLSEKGSSKKLASPTSSKNTLRGPGTKKKARVGPH 
VRQGKRRKSLKSHSGRMRPSAEQKRAGRSLP 

Ixoportant features of the protein: 
Signal peptide: 

amino acids 1-41 . ' 
Transmembrane dosiain: 
amino acids 17-36 
N-glycosylation sites. 

amino acids 372-376, 480-484 

cAMP- and cCSiP-dependent protein kinase phosphorylation site, 
amino acids 645-649, 699-703 
Tyrosine kinase phosphorylation site, 
amino acids 81-89 
N*-myristoylation sites. 

amino acids 11-17, 37-43, 156-162, 165-171, 357-363, 365-371, 

368-374, 408-414, 459-465, 548-554, 557-563 

Amidation sites. 

amino acids 391-395, 696-700 

Cell attachment sequence. 

amino acids 428-431 

Leucine zipper pattern. 

amino acids 25-47 
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FIGURE 61 



CTCCATTAAACCACCACCAGCTCCCCAAGCCACCCCTTCAGCCATGAAGTTCCTGCTCCTGGT 

CTTGGCAGCCCTCGGATTCCTGACCCAGGTGATCCCAGCCAGTGCAGGTGGGTCAAAATGTGT 

GAGTAACACCCCAGGATACTGCAGGACATGTTGCCACTGGGGGGAGACAGCATTGTTCATGTG 

CAACGCTTCCAGAAAATGCTGCATCAGCTACTCCTTCCTGCCGAAGCCTGACCTACCACAGCT 

CATCGGTAACCACTGGCAATCAAGGAGAAGAAACACACAAAGGAAAGACAAGAAGCAACAAAC 

GACCGTAACATCATAATAACCACTGCTATCGCCTCCACCAACTCAGAGAAATATCATTTCCAC 

AGTTCCAATTCCTCCTACATTGCTGAGTACTAGCCAAGGCTCCTCTTTATGGGGCAGATATCT 

ATAGCCAACCCCAAAACTTCTGTCTTCTATCATTCTGTCATTCATCTAGTAACTAATTTGGAG 

TTTGTATCTATCTTACGAGAACAATCATCATGCAGATTCGTCCACAGGGGATCTGTCAGTTTG 

GGTCCTCCAAATGAAAAATGTCAAGACAGAATTGGACATGCAAAAGATTGACTGGGAGAACAC 

ACCTCTGATGGACAAAGGTGAGACAGAGCAGCCACAGGCAGGGAGAGCCTTCAGACTGCAACG 

CTGGCCTGATACGTGTCAAAGGAGAGAGGGATAGAGGAGGATTGAATAGAAGGAGACTAAGAC 

TGCAGCTCTAAGAAAGTCTCAGCCAAACAGATGGGGAGGCCCAAAGCAAGGCTTGCCCCTCAG 

AGGAGCTCACGCAGGGCAGGAATAGCCAGGTTCTCATATCCCAGGGGTTCAGACTTGGCTGAG 

AACAGCCCCTGGAGAACATGGGGTGACTGCTACCATAGGTCTGGAAGTATGAGGCTGTCCACC 

AACTATCCCCTTGAAGCAAGTTCTCTTGAAAGGAAATCTAAACAGTGCACCCCCATGGCTGCC 

ACGGAGTATAAGGAGGGAGAGAAAGGAGCTGAAAGTCTAGGTTTGGCCAGCTAGGTAGACTGA 

CTTGTGAGGTATTTATTTATTCATTTGAGTT^CAAAGCAGACAGAATACATAGCCACCATTGG 

TAGTACACCCCAAAAGCAAGGATGGCATGATGCTGGTGACTCAAACGTGCCTACTCATGGTGT 

CAAATTGGCATAATCCTCTTGGGAAGCTGTGTGGAAATAAGCACAGAGAAGCAGAACTCTAAT 

TGCTTAATCCACTAAACATTACTTCTGGGAATTGGCTCATCATAAATTATCCAAGAGAAAGCA 

CAAAGTTATGGGCACAAAGGTTTTCCATATAATATTATTTAAAATGCTGAGAAAATGAAAAAA 

TCTAAATGGTGAAATATATACTAATGCCATCTATAAATACAAACAAATAGAATGTTTATAGAA 

TAATGGAACATAATAACATTATTCAAAATTGCATTTATGCTATAGTTGTCAAAATTGTCTCCT 

TATATGATACAAAACTCATGAAAATTATGACTTTTTTGTTTGGTTGGAAAGCAGAATTATGCA 

TAAATTTCCTCTTACAGTTCGATGCCCATTAGTTTTATATAACATTTATTTGACACGTACTGA 

CTTCTATCTGAGAAGAACAAACCAAAACACTCAGGCCTAAATAATTAAAAACGGTCCTAAAAA 

CTAGCAAACCAGATAAGAAAAGATGTTAATGCCCATTCCCTAACTTATGTCTTAGACCAAAAT 

TAATTCTAGATGGTTTTAAAATGACAGTGTAAAAGTAAAGTATTAAAAGATTGTGTGGTCAAA 

TATTCAATTTT^GAGCAAGGAAATTCTTATAAATATAACAATAGAGGCAGAACTCATGTAAGA 

ATAAATTGATTAGGTGGTATTAAATATTAAGTTCTTATGTATGTCAAAAGATATCATTTTGAA 

ATTCATCCATCTTATTGGGTATTGCAGGAGTTCATTCCTTTTTGTTTATAAATACTCTTCCGT 

CATATGAATAGTATTCATTTGTATACTGGTTTGTTGATGGACATTTGGGTTGTTCCCAGTTTA 

TGGCTATTACAAATAAAGCTTCTATGAACATTTATGTACA 
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FIGURE 62 



MKFLLLVLAALGFLTQVIPASAGGSKCVSNTPGYCRTCCHWGETALFMCNASRKCCISYSFLP 
KPDLPQLIGNHWQSRRRNTQRKDKKQQTTVTS 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 1-22 

N-glycosylation site. 

amino acids 50-53 

cAMP- and cGbdP-dependent protein kinase phosphorylation site. 

amino acids 79-82 

N-myristoylation site. 

amino acids 23-28 
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FIGURE 63 

GCGGAGCGCCTGGGAGAGGAGAAGGAGCCGACCTGCCGAGaaeSGAGGCGACCGGCACCTGGGC 

GCTGCTGCTGGCGCTGGCGCTGCTCCTGCTGCTGACGCTGGCGCTGTCCGGGACCAGGGCCCG 

AGGCCACCTGCCCCCCGGGCCCACGCCGCTACCACTGCTGGGAAACCTCCTGCAGCTACGGCC 

CGGGGCGCTGTATTCAGGGCTCATGCGGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTA 

CCTGGGACCCTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGG 

AGGTCAGGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGG 

CCATGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGC 

TCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGTG 

TCTGGTGGAGACATTCCAGGGGACAGAAGGACGCCCATTCGATCCCTCCCTGCTGCTGGCCCA 

SSSS^^^'^^^^^'^'^^^'^^'^^^'^^^^'^^'r^CTCTTTGGCCTCCGCTTCTCCTATGAGGATAAGGA 

GTTCCAGGCCGTGGTCCGGGCAGCTGGTGGTACCCTGCTGGGAGTCAGCTCCCAGGGGGGTCA 

GACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCCAGGCCCCCACAAGCAGCTCCTCCA 

CCACGTCAGCACCTTGGCTGCCTTCACAGTCCGGCAGGTGCAGCAGCACCAGGGGAACCTGGA 

TGCTTCGGGCCCCGCACGTGACCTTGTCGATGCCTTCCTGCTGAAGATGGCACAGGAGGAACA 

AAACCCAGGCACAGAATTCACCAACAAGAACATGCTGATGACAGTCATTTATTTGCTGTTTGC 

TGGGACGATGACGGTCAGCACCACGGTCGGCTATACCCTCCTGCTCCTGATGAAATACCCTCA 

TGTCCAAAAGTGGGTACGTGAGGAGCTGAATCGGGAGCTGGGGGCTGGCCAGGCACCAAGCCT 

AGGGGACCGTACCCGCCTCCCTTACACCGACGCGGTTCTGCATGAGGCGCAGCGGCTGCTGGC 

GCTGGTGCCCATGGGAATACCCCGCACCCTCATGCGGACCACCCGCTTCCGAGGGTACACCCT 

GCCCCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATGTTCAA 

GCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGA 

GGCGTTCCTGCCCTTCTCCTTAGGGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGA 

GCTCTTCCTCTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGA 

CACCCTGAGCCTCAAGCCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCA 

AGTCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGTG 

GTGGGTGCCCAGGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAG 

AGTGTACACTGCAGGCAGCCACATTTACACGCCTGCAGTTGTTTTCCGGAGTCTGTCCCACGG 

CCCACACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAA 

GGGCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCAC 

AAGCACATAGCCAGGTAACCCACCAACTCCCCTGGATCTGCAGCCCACACGTGGGAGTCTGGC 

TGTCACCTTCACAAGCCACAGAAACGGCCACACATGTTCACAGCTCACACGCCCTCTCCATTC 

ATCGAACTTCTCAGTGTCCCTGTCCCTGGTGCCTGGCACAGGGAACAGCATGCCCCCTCCGGG 

GTCATGCCACCCAGAGACTGTCGCTGTCTATGGCCCCAACTCATGCTCCCTCTCTTGGCTACA 

CCACTCTCCCAGCCTGTGACCACCGATGTCCACACACCCCCAACCACTTGTCCACACAGCTAC 

CCACGTACAACATCGTCCTGGCTCCCCAGAGTATCTTCCCACTGAGACACGCCGCCCCCACAG 

AGGCACAGTCCCCAGCCACCTCTGCAACTGCAGCCCTCAGTCACCCCTTTTTAAGCACCCTGA 

TTCTACCAAATGCAAACACATCTGGGTCTGCGATTATGCACAGAGACTTTGGACATACGAGGA 

CCCTCAGACCGGAGGAACACCTGCCCAACCCCAACACGTGCTTATGTAACCACGTGGAAAGCG 

GCCCCTGCTGCCCCTCCACACACACATACACACTCACTGATCTACAGCCCCTGTTCGGCGTCA 

GAGTCCCCACTAGACCCAGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACC 

CTGTCAGGGAGTGAGCCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGAATTAAA 

GTTTGTTTCTGGCCTTTAGCCTAAAAAAAAAAAAAAAAAA 
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FIGURE 64 



MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMRLSKK 
YGPVFTIYLGPWRPWVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSNGERWRQ 
LRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSNWCSLLFGL 
RFSYEDKEFQAWRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQV 
QQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTL 
LLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRT 
TRFRGYTLPQGTEVFPLLGSILHDPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSLGKRVCL 
GEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 294-313 

Glycosaminoglycan attachment site. 

amino acids 99-103 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 128-132 

N-myristoylation sites. 

amino acids 51-57, 109-115, 115-121, 188-194, 207-213, 257-263, 
284-290, 339-345, 370-376, 444-450 

Amidation sites. 

amino acids 140-144, 435-439 

Leucine zipper pattern. 

amino acids 32-54, 39r61 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 433-443 
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FIGURE 65 

CGGACGCGTGGGGCCGTATGCGCGGCTCTGTGGAGTGCACCTGGGGTTGGGGGCACTGTGCCC 

CCAGCCCCCTGCTCCTTTGGACTCTACTTCTGTTTGCAGCCCCATTTGGCCTGCTGGGGGAGA 

AGACCCGCCAGGTGTCTCTGGAGGTCATCCCTAACTGGCTGGGCCCCCTGCAGAACCTGCTTC 

ATATACGGGCAGTGGGCACCAATTCCACACTGCACTATGTGTGGAGCAGCCTGGGGCCTCTGG 

CAGTGGTAATGGTGGCCACCAACACCCCCCACAGCACCCTGAGCATCAACTGGAGCCTCCTGC 

TATCCCCTGAGCCCGATGGGGGCCTGATGGTGCTCCCTAAGGACAGCATTCAGTTTTCTTCTG 

CCCTTGTTTTTACCAGGCTGCTTGAGTTTGACAGCACCAACGTGTCCGATACGGCAGCAAAGC 

CTTTGGGAAGACCATATCCTCCATACTCCTTGGCCGATTTCTCTTGGAACAACATCACTGATT 

CATTGGATCCTGCCACCCTGAGTGCCACATTTCAAGGCCACCCCATGAACGACCCTACCAGGA 

CTTTTGCCAATGGCAGCCTGGCCTTCAGGGTCCAGGCCTTTTCCAGGTCCAGCCGACCAGCCC 

AACCCCCTCGCCTCCTGCACACAGCAGACACCTGTCAGCTAGAGGTGGCCCTGATTGGAGCCT 

CTCCCCGGGGAAACCGTTCCCTGTTTGGGCTGGAGGTAGCCACATTGGGCCAGGGCCCTGACT 

GCCCCTCAATGCAGGAGCAGCACTCCATCGACGATGAATATGCACCGGCCGTCTTCCAGTTGG 

ACCAGCTACTGTGGGGCTCCCTCCCATCAGGCTTTGCACAGTGGCGACCAGTGGCTTACTCCC 

AGAAGCCGGGGGGCCGAGAATCAGCCCTGCCCTGCCAAGCTTCCCCTCTTCATCCTGCCTTAG. 

CATACTCTCTTCCCCAGTCACCCATTGTCCGAGCCTTCTTTGGGTCCCAGAATAACTTCTGTG 

CCTTCAATCTGACGTTCGGGGCTTCCACAGGCCCTGGCTATTGGGACCAACACTACCTCAGCT 

GGTCGATGCTCCTGGGTGTGGGCTTCCCTCCAGTGGACGGCTTGTCCCCACTAGTCCTGGGCA 

TCATGGCAGTGGCCCTGGGTGCCCCAGGGCTCATGCTGCTAGGGGGCGGCTTGGTTCTGCTGC 

TGCACCACAAGAAGTACTCAGAGTACCAGTCCATAAATTAAGGCCCGCTCTCTGGAGGGAAGG 

ACATTACTGAACCTGTCTTGCTGTGCCTCGAAACTCTGGAGGTTGGAGCATCAAGTTCCAGCC 

GGCCCCTTCACTCCCCCATCTTGCTTTTCTGTGGAACCTCAGAGGCCAGCCTCGACTTCCTGG 

AGACCCCCAGGTGGGGCTTCCTTCATACTTTGTTGGGGGACTTTGGAGGCGGGCAGGGGACAG 

GGCTATTGATAAGGTCCCCTTGGTGTTGCCTTCTTGCATCTCCACACATTTCCCTTGGATGGG 

ACTTGCAGGCCTAAATGAGAGGCATTCTGACTGGTTGGCTGCCCTGGAAGGCAAGAAAATAGA 

TTTATTTTTTTTCACAGGGAAAAAAAAAAAA 
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FIGURE 66 

MRGSVECTWGWGHCAPSPLLLWTLLLFAAPFGLLGEKTRQVSLEVIPNWLGPLQNLLHIRAVG 
TNSTLHYVWSSLGPLAVVMVATNTPHSTLSINWSLLLSPEPDGGLMVLPKDSIQFSSALVFTR 
LLEFDSTNVSDTAAKPiiGRPYPPYSLADFSWNNITDSLDPATLSATFQGHPMNDPTRTFANGS 
LAFRVQAFSRSSRPAQPPRLLHTADTCQLEVALIGASPRGNRSLFGLEVATLGQGPDCPSMQE 
QHSIDDEYAPAVFQLDQLLWGSLPSGFAQWRPVAYSQKPGGRESALPCQASPLHPALAYSLPQ 
SPIVRAFFGSQNNFCAFNLTFGASTGPGYWDQHYLSWSMLLGVGFPPVDGLSPLVLGIMAVAL 
GAPGLMLLGGGLVLLLHHKKYSEYQSIN 

N-giycosylation sites: 

amino acids 65-69, 95-99, 134-138, 159-163, 187-191, 230-234, 
333-337 

cAMP- and cOdP-dependent: protein kinase phosphoirylation site: 

amino acids 397-401 

Casein kinase II phosphorylation sites: 

amino acids 151-155, 249-253, 255-259 

N-n^ristoylation sites: 

amino acids 3-9, 63-69, 235-241, 273-279, 292-298, 324-330 

Leucine zipper pattern. 

amino acids 371-393 
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FIGURE 67 

CGGGACAGGCGCGTGAGGCCACAACACATGCGTGTATCTTGCTTGGGCTATCTTCCCTGCTCTGCCACGCCGGGT 
CTGGAGAAGGGGTTTCAGCCCCAGGACATTTACTGAGAGTCGGCGAATATTGGGAGCCGCGATGTTCCCCCTTCG 
°^?£^'^^^*^'^^^T<='^^?<^CGCTTCTAGGAGTGGCCGGATCATGCCCGGAGCCGTGCGCCTGCGTGGACAAGTA 
^^^^^^^'^^^^^^^^^^'^'^^^^^'^^^^^^GTTGCGTGAGGTGCCGGAAGGACTGCCTGCCAACGTGACGAC 
^?T^^S^^^^^^^°CGAACAAGATCACTGTGCTGCGGCGCGGGGCCTTCGCCGACGTCACACAGGTCACGTCGCT 
GTGGCTGGCGCACAATGAGGTGCGCACCGTGGAGCCAGGCGCACTGGCCGTGCTGAGTCAGCTCAAGAACCTCGA 
TCTGAGCCACAACTTCATATCCAGCTTTCCGTGGAGCGACCTGCGCAACCTGAGCGCGCTGCAGCTGCTCAAAAT 
GAACCACAACCGCCTGGGCTCTCTGCCCCGGGACGCACTCGGTGCGCTACCCGACCTGCGTTCCCTGCGCATCAA 
CAACAACCGGCTGCGTACGCTGGCGCCTGGCACCTTCGACGCGCTTAGCGCGCTGTCACACTTGCAACTCTATCA 
CAATCCCTTCCACTGCGGCTGCGGCCTTGTGTGGCTGCAGGCCTGGGCCGCGAGCACCCGGGTGTCCTTACCCGA 

^^^^^^^^'^^*^^°CC^SCCACCGCTTGAAGCACCCGGCACCCCACTGCGCGCAGGACTGGC 

gttcgtgttacactgcatcgccgacggccaccctacgcctcgcctgcaatggcaacttcagatccccggtggSc 

$r^^IS3^™^^'^^^°^^'^^^^^'^^^^^^C^^CGGGGTTGGGGCGGAGGAAGGAGAGGGAG^^ 

^o™^r°^^^^^°^°^^^^°CC^CGCCGACTCCAGCACCCGCTTGGCCGGCGCCCCCAGCCACACC 

^™^»^?^^^^'^^°^^^*^'^^^^'^^'f'r«5TGCCCCTCCTGAGTGCCAAGGAGGCGGGCGTCTACACTTG 

^^^^^^-^^'^^^'^^'^^^^'^'^^^^^^''^C^TC^TACGCGTGGCGGTGGCAGCAACCGGGCCCCCAAAACA 

^^^^^oSSHS^*^*^^^^^^^^^'^^'^^^^^*^^"^C<^<=G^CCTCTGAGCGCAAGTCCACAGCCAAGGGCCGGGG 

^^^^J^'^^^^^^^^^^^^^^^'^^^^^^TCAAAGGCCAAGGCCTGGCCAAGGTCAGCATTCTCGGGGA 

GACCGAGACGGAGCCGGAGGAGGACACAAGTGAGGGAGAGGAGGCCGAAGACCAGATCCTCGCGGACCCGGCGGA 

GGAGCAGCGCTGTGGCAACGGGGACCCCTCTCGGTACGTTTCTAACCACGCGTTCAACCAGAGCGCAGAGCTCAA 

GCCGCACGTCTTCGAGCTGGGCGTCATCGCGCTGGATGTGGCGGAGCGCGAGGCGCGGGTGCAGCTGACTCCGCT 

GGCTGCGCGCTGGGGCCCTGGGCCCGGCGGGGCTGGCGGAGCCCCGCGACCCGGGCGGCGACCCCTGCGCCTACT 

^3;^]^S^'^^^^*^^^°<^'^°'^*^<^'5C'5<^CGCGGCAGTGCAGTGGTCCCGCGTAGAGGAAGGCGTCAACGCCTACTGGTT 

™S^S^^'^^^^^^°^^^^^^^^*^^'='^^<=^T^TGCCTGGCGCTGGCGGGCGAAGCCTGCCACGTGCAAGTGGT 

GTTTTCCACCAAGAAGGAGCTCCCATCGCTGCTGGTCATAGTGGCAGTGAGCGTATTCCTCCTGGTGCTGGCCAC 

™o^^^*^^^^'^'^^'^^<^'^<^'=^°C'fGCCATCTGCTGGCTAAAavCCCGGGCAAGCCCTACCGTCTGATCCTGCG 

GCCTCAGGCCCCTGACCCTATGGAGAAGCGCATCGCCGCAGACTTCGACCCGCGTGCTTCGTACCTCGAGTCCGA 

GAAAAGCTACCCGGCAGGCGGCGAGGCGGGCGGCGAGGAGCCA6AGGACGTGCAGGGGGAGGGCCTTGATGAAGA 

^SoS^^SS^S^^^*^^^*^^°°°*^^C^T<^^^'^<5AGAGGAGAGCCTGGCGGCCTGCTCACTGGTGGAGTCCC^ 

GTCCAAGGCCAACCAAGAGGAGTTCGAGGCGGGCTCTGAGTACAGCGATCGGCTGCCCCTGGGCGCCGAGGCGGT 

CAACATCGCCCAGGAGATTAATGGCAACTACAGGCAGACGGCAGGCT^CCTCCGCCCGTCCGGCCCGCCCATT 

CCCGACCTCCACCTAGGGTGCCTGGGAGCAGCAGTCTAGGGCTGGCAGGACTTATGTCCCCCGTCCCCAACCTTC 

^S^I^^^^^^^^^'^^^'^^^T'^^^^^'^'^TTGACTACCAGGGACTTCTATTAGGGAGTGGGCCGATTTCACCA 

GTCCCTGCTACCCACGGCTGCCATTCTCCCTGCGGGCTGAATCCCCTTCCCCGCCAAGCACAGTGTTTATCTTAC 

CCCATGCAAGACTCCACCCGCAGACGGTGGGCGATATCTATGTCCCTCCATTCCCGTCGCGATTATCTGCGAAAT 

^^^^?SS^^^^^*^^'^*^^^'^^^^^^^'^°^'^<^'^^'5AGCCAGAGGAAACGAGCGAAGACTTTGGAAACCTCGCGGTAA 

CGCGGTGGTTTCGGGGGCCAGCCAAGGCCAGTGGAGTGCTGTGGGGTCCCACCTCGACCCCTCCTCCTCCCTTTC 

TTTCTTTCCTTTTTTTTTATTTTTTAATTTTATTTATTTATTTATTTATTTTTTGACGGAGTCTTGGTCTGTCGC 

CAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCATCTTCCGCCTCCCGGGTTCAAGCGATTCTCCTGCCTC 

AGCCTGCCTAGTAGCTGGGACTACAGGCGCGCGCCACCACGACCAGCTAATTTCTTCTATTTTTAGTAGAGACGG 

GGTTTCACCATGTTGGCCAGGATGGTCTGGATCTCTTGACCTCAGGTGATCCATCTGCCTCGGCCTCTCAAAGTG 

CTGGGATTACAGGCGTGAGGCACCGCGCCCGGCCrCTCCTCCCTTTCAATCCCTACTCCCAGAAGCCGGGATTCG 

^S^S^S'^^^'^^'^'^^^^^^'^^'^^^^^C^^CCTGCCGGCAGGGAACCAAATCCTTCTGTCC^^^ 

CCACTTCTGGCCAGTTGGAGTCCAGCCCGGTGCCTGGGGCGCCTTTCAGCTCCGCGCTCAGATTTTCCTGTTTTC 

S^5P'^^^'^*^'^*^**^^*'^^^'^'"C®^TCTGGTGCTAACACCCCCTTCCCAGCCTCTGGGAAAATCGAGT 

^''^'^^'^'''^^^''''''^"^'^C^CGGGCCAATTTCGTTGTCCGTGTGTTGGGCTTTC^ 
^,^^^^°^°^^°^'^^^*^^'^^<^^^^^CCGACCCAGACCCTAGCTGGAAAGCGCCGGAGGCGGAGGAAGCT 
^^^S^S ^^^'^'^'^^^°^^'^^'^^^^<^^^^^<^<=TCGCGCCCTAGTTCGCACAAAGCCTGCTCGTGACTGTGC 
^^S^°TS»S^^°°*^^^^^'^*^^'^*^^'^<^^C^^<^C'=CCTCCGTCCTCGCGTCTCGGTCCTCGCGTCGCCCCGCCCCAC 
S^S^H^ ^^^^^°^^°<^'^^'^^^'^<^TTTGCCCGGCGTGTAGTCCCTGACAAGCGTGCCCTGTAGGAGAAAAGTC 
^^J^T^S^^^^^^'^'^TG^CCGTGTAGTGTAGGGGGGCGGGCGGGGGGGCGGATGGGCGGGGAGGGAGGGAAGGG 
GAGGGGCGCGGCGCGGCGACTCGGGGCGGGGTTCTTTTTTCCATTTTGAAAGAAAGCGTCGGGGTTGGGGTGGGG 
™o™S;!^o^'^^^°^^'^^^^^*^*^'^^'^^<^^^<^^°CGCAGCACAAGCGCGGGCCTGGGACGGAG^^^ 
GGAGCCCGTGCCCTTTTCTAAACGCGTCTGTATGCAGTCAATAAAACAATCGATTTGAAA 
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MFPLRALWLVWALLGVAGSCPEPCACVDKYAHQFADCAYKELREVPEGLPANVTTLSLSANKI 
TVLRRGAFADVTQVTSLWLAHNEVRTVEPGALAVLSQLKNLDLSHNFISSFPWSDLRNLSALQ 
LLECMNHNRLGSLPRDALGALPDLRSLRINNNRLRTLAPGTFDALSALSHLQLYHNPFHCGCGL 
VWLQAWAASTRVSLPEPDSIACASPPALQGVPVYRLPALPCAPPSVHLSAEPPLEAPGTPLRA 
GLAFVLHCIADGHPTPRLQWQLQIPGGTWLEPPVLSGEDDGVGAEEGEGEGDGDLLTQTQAQ 
TPTPAPAWPAPPATPRFLALANGSLLVPLLSAKEAGVYTCRAHNELGANSTSIRVAVAATGPP 
KHAPGAGGEPDGQAPTSERKSTAKGRGNSVLPSKPEGKIKGQGLAKVSILGETETEPEEDTSE 
GEEAEDQILADPAEEQRCGNGDPSRYVSNHAFNQSAELKPHVFELGVIALDVAEREARVQLTP 
LAARWGPGPGiSAGGAPRPGRRPLRLLYLCPAGGGAAVQWSRVEEGVNAYWFRGLRPGTNYSVC 
LALAGEACHVQWFSTKKELPSLLVIVAVSVFLLVLATVPLLGAACCHLLAKHPGKPYRLILR 
PQAPDPMEKRI/^DFDPRASYLESEKSYPAGGEAGGEEPEDVQGEGLDEDAEQGiDPSGDLQRE 
ESLAACSLVESQSKANQEEFEAGSEYSDRLPLGAEAVNIAQEINGNYRQTAG 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 
TransmeiDbrane domain: 
amino acids 587-610 
N-glycosylation sites* 

amino acids 52-55, 121-124, 337-340, 364-367, 474-477, 563-566 
cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 397-400 

Casein kinase II phosphorylation sites. 

amino acids 19-23, 202-205, 289-292, 246-249, 411-414, 431-434, 
433-436, 440-443, 544-547, 583-586, 650-653, 700-703 
N-nr^ristoylation sites. 

amino acids 15-20, 48-53, 165^170, 296-301, 351-356, 362-367, 
390-395, 419-424, 514-519, . 536-541, 557-562, 561-566, 610-615, 
.661-666, 716-721 
Amidation site. 

amino acids 522-525 

Prokaryotic membrane lipoprotein lipid attachment sites. 

amino acids 10-20, 603-613 
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GGCGGCGGGAGCAGCGAAGGGGGCGGCAGGGATCCTCCAGGCTGCCGGCTGGGAAGGCGTGGG 

CGACCCGGTGTGTGGCGCGCCCAGAGCCCCGCGTTTCAGCCCTAGGGAAGGAAGCCAGTTGAG 

GGAAGTTCTCCATGAATGTACGTCACAATGATGATGACCGACCAAATCCCTCTGGAACTGeCA 

CCATTGCTGAACGGAGAGGTAGCCATGATGCCCCACTTGGTGAATGGAGATGCAGCTCAGCAT 

GTTATTCTCGTTCAAGTTAATCCAGGTGAGACTTTCACAATAAGAGCAGAGGATGGAACACTT 

CAGTGCATTCAAGGACCTGCTGAAGTTCCCATGATGTCACCCAATGGATCCATTCCTCCCATT 

CATGTGCCTCCAGGTTATATCTCACAGGTGATTGAAGATAGTACTGGAGTCCGCCGGGTGGTG 

GTCACACCCCAGTCTCCTGAGTGTTATCCCCCAAGCTACCCCTCAGCCATGTCTCCAACCCAT 

CATCTCCCTCCCTATCTGACTCACCATCCACATTTTATTCATAACTCACACACGGCTTACTAC 

CCACCTGTTACCGGACCTGGAGATATGCCGCCTCAGTTTTTTCCCCAGCATCATCTTCCCCAC 

ACAATATATGGTGAGCAAGAAATTATACCATTTTATGGAATGTCAAGCTACATCACCCGAGAA 

GACCAGTACAGCAAGCCTCCGCACAAAAAACTGAAAGACCGCCAGATCGATCGCCAGAACCGC 

CTCAACAGCCCTCCTTCTTCTATCTACAAAAGCAGCTGCACAACAGTATACAATGGCTATGGG 

AAGGGCCATAGTGGTGGAAGTGGCGGAGGCGGCAGCGGTAGTGGTCCCGGAATTAAGAAAACA 

GAGCGACGAGCAAGAAGCAGCCCAAAGTCGAATGATTCAGACTTGCAAGAATATGAGTTGGAA 

GTAAAGAGGGTGCAAGACATTCTTTCGGGAATAGAGAAACCACAGGTTTCTAATATTCAGGCA 

AGAGCAGTTGTGTTGTCCTGGGCTCCCCCTGTTGGACTTTCCTGTGGACCCCACAGTGGTCTT 

TCCTTCCCCTACAGTTACGAGGTGGCCTTATCAGACAAAGGACGAGATGGAAAATACAAGATA 

ATTTACAGTGGAGAAGAATTAGAATGTAACCTGAAAGATCTTAGACCAGCAACAGATTATCAT 

GTGAGGGTGTATGCCATGTACAATTCCGTAAAGGGATCCTGCTCCGAGCCTGTTAGCTTCACC 

ACCCACAGCTGTGCACCCGAGTGTCCTTTCCCCCCTAAGCTGGCACATAGGAGCAAAAGTTCA 

CTAACCCTGCAGTGGAAGGCACCAATTGACAACGGTTCAAAAATCACCAACTACCTTTTAGAG 

TGGGATGAGGGAAAAAGAAATAGTGGTTTCAGACAGTGCTTCTTCGGGAGCCAGAAGCACTGC 

AAGTTGACAAAGCTTTGTCCGGCAATGGGGTACACATTCAGGCTGGCCGCTCGAAACGACATT 

GGCACCAGTGGTTATAGCCAAGAGGTGGTGTGCTACACATTAGGAAATATCCCTCAGATGCCT 

TCTGCACCAAGGCTGGTTCGAGCTGGCATCACATGGGTCACGTTGCAGTGGAGTAAGCCAGAA 

GGCTGTTCACCCGAGGAAGTGATCACCTACACCTTGGAAATTCAGGAGGATGAAAATGATAAC 

CTTTTCCACCCAAAATACACTGGAGAGGATTTAACCTGTACTGTGAAAAATCTCAAAAGAAGC 

ACACAGTATAAATTCAGGCTGACTGCTTCTAATACGGAAGGAAAAAGCTGTCCAAGCGAAGTT 

CTTGTTTGTACGACGAGTCCTGACAGGCCTGGACCTCCTACCAGACCGCTTGTCAAAGGCCCA 

GTTACATCTCATGGCTTTAGTGTCAAATGGGATCCCCCTAAGGACAATGGTGGTTCAGAAATC 

CTCAAGTACTTGCTAGAGATTACTGATGGAAATTCTGAAGGTGAAGTTTTTGGCAATTGTTTT 

ATTCAAATCCAATA6CAAGCTCTGTTTTCTAATATAGTAAATGTCTTTATAGTAATAGTGAGT 

AATCATTAATTCTAAAGATAGAATTATTATTACAATAAACAAACTTTAGTCACATATTGGCAG 

TTTTTCTATTTCAAACACAGCACCAGAGATCAGAGTCTACTTGAAACTTACATTTGTGTTATT 

TAACAATTTTTCTGTATCTTTTTCATTGGTGTTTTGTTTTGTTTATCTTTTGTTTTTGTTTCT 

TTGGTTTGGTTTGTTTTTGTTTTGTTTTTTGAGATACGATCTCTGTCACACAGGCTGGAGGGC 

AGTGGCACAGACATGGCCCATTGCAGTCTCAGACTCCTGGGCTTAAGTGACTCTTCTGCCACA 

GAAGATGAGGAAGAATACATTTTTCATAGTGATGGGGTCTCACTATGTTATCTAGGCTGGTCT 

CAAACTCCTGGCCTCAAGCAACCCTCCACCTTGGCCTCCCAAAGTGCTGGGACTATAGACATG 

AATCACCACACTCAGCTTCCATGTCTTTTTATGAACTAGGGTTCCTAATTAATCAGATAAATT 

TGGTATTTTCATCTCCTAACTTGCCATATGTTTTCTGGAAATTCTTATAAGCAGCCGAGAGTG 

GTGGCTCACGCTGTAGTCCCAGCACTTTGGGAGGCTGAGGTGGGTGGTCAGGAGATCAAGACC 

ATCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTG 

GCAGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGAAGAATTGCTTGAACCCAGCAG 

GCGGAGGTTGCAGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGTGACAGAGTGAGACTC 
TGTCTCAAAAAAAAAAAA 



wo 01/40466 



PCT/USOO/32678 



70/550 



FIGURE 70 



MMMT DQ I PLEL P PLLNGE VAMM PHLVNG DAAQH VI LVQVN PGET FT I RAE DGTLQC I QG PAEV 
PMMSPNGSIPPIHVPPGYISQVIEDSTGVRRVVVTPQSPECYPPSYPSAMSPTHHLPPYLTHH 
PHFIHNSHTAYYPPVTGPGDMPPQFFPQHHLPHTIYGEQEIIPFYGMSSYITREDQYSKPPHK 
KLKDRQIDRQNRLNSPPSSIYKSSCTTVYNGYGKGHSGGSGGGGSGSGPGIKKTERRARSSPK 
SNDSDLQEYELEVKRVQDILSGIEKPQVSNIQARAWLSWAPPVGLSCGPHSGLSFPYSYEVA 
LSDKGRDGKYKIIYSGEELECNLKDLRPATDYHVRVYAMYNSVKGSCSEPVSFTTHSCAPECP 
FPPKLAHRSKSSLTLQWKAPIDNGSKITNYLLEWDEGKRNSGFRQCFFGSQKHCKLTKLCPAM 
GYTFRLAARNDIGTSGYSQEWCYTLGNIPQMPSAPRLVRAGITWVTLQWSKPEGCSPEEVIT 
YTLEIQEDENDNLFHPKYTGEDLTCTVKNLKRSTQYKFRLTASNTEGKSCPSEVLVCTTSPDR 
PGPPTRPLVKGPVTSHGFSVKWDPPKDNGGSEILKYLLEITDGNSEGEVFGNCFIQIQ 

Important features of the protein: 
N-glycosylation sites. 

amino acids 69-73, 254-258, 401-405 
Glycosaminoglycan attachment sites. 

amino acids 229-233, 234-238, 236-240 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 416-420, 535-539 
Tyrosine kinase phosphorylation site, 
amino acids 319-326 
N'-myristoylation sites. 

amino acids 52-58, 227-233, 228-234, 230-236, 231-237, 232-238, 
235-241, 239-245, 402-408, 610-616 
Amidation site, 
amino acids 414-418 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 290-301 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 546-554 

CUB domain proteins profile. 

amino acids 294-301. 
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AAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGMGTCAAGCTCTTCCTGGCTCCTTCTCAGCCTTGTT 

GCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAAGACATTTTTGGACAAGTTTAACCACGAAGCCGAA 

GACCTGTTCTATCAAAGTTCACTTGCTTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATG 

AATAATGCTGGGGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACAAGAA 

ATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTCAGTGCTCTCAGAAGACAAG 

AGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCATCTACAGTACTGGAAAAGTTTGTAACCCAGATAAT 

CCACAAGAATGCTTATTACTTGAACCAGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTC 

TGGGCTTGGGAAAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTTGAAA 

AATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGACTATGAAGTAAATGGGGTA 

GATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGTGGAACATACCTTTGAAGAGATTAAACCATTATAT 

GAACATCTTCATGCCTATGTGAGGGCAAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTC 

CCTGCTCATTTGCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGGACAG 

AAACCAAACATAGATGTTACTGATGCAATGGTGGACCAGGCCTGGGATGCACAGAGAATATTCAAGGAGGCCGAG 

AAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGGATTCTGGGAAAATTCCATGCTAACGGACCCAGGA 

AATGTTCAGAAAGCAGTCTGCCATCCCACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCACA 

AAGGTGACAATGGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGCATATGCTGCA 

CAACCTTTTCTGCTAA6AAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGAAATCATGTCACTTTCTGCA 

S^^S'^'^^^^'^'^'^'^^^'^^^^'^'^'^^'^^TTCTGTCACCCGATTTTCAAGAAGACAATGAAACAGAAATAAAC 

TTCCTGCTCAAACAAGCACTCACGATTGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATG 

GTCTTTAAAGGGGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGGGGTG 

GTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGTTTCTGATGATTACTCATTC 

ATTCGATATTACACAAGGACCCTTTACCAATTCCAGTTTCAAGAAGCACTTTGTCAAGCAGCTAAACATGAAGGC 

CCTCTGCACAAATGTGACATCTCAAACTCTACAGAAGCTGGACAGAAACTGTTGTAAGAAATACCTCAAAATGTT 

GAACCTCTCCTAGTATTCAGTATTACTCATTTCCATGCCTAGGTTTGTATTTGATTTCTTTGTTCTAAAAAGAAA 

ATTTTATGGCCTCAAAATGTCCTCATTTACAAACCAAACATTTAATTTGTGGTCAGACAGGAACCTAGACCATAC 

aacaattgggtgggccacctcttttctccctatcataactacagccctctcttcctggtaattggaaggaaagag" 

CGGTTTAGGGTGGAATATATCTGTTAATATGCATTCTTTTCTTATCTGCCAGAAGCAAATTTAGCCAAGTCAAAG 

agaagaaaccatagatcatagatgtaaatatatgtacatctggaacccctcaaaaggccctgaaccccctttttt 

TGTGTAGCAATATGCTGAGGCTTGGAAAATCAGAACCCTGGACCCTAGCATTGGAAAATGTTGTAGGAGCAAGAA 

catgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttt 

IS3^o *^^*^^^^^*^^*^^^'^'^^^^™'^'^C^^^CCCAAAGCATCAAAGTGAGGATAAGCCTAAAATC^^ 

ttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggc 

AGTACTTTTTAAAAGTAAAAAATCAGATGATTCTTTTTGGGGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAA 

^^IS^^^^'^'^^^'^'^'^^^'^^^'^^^'^'^^'^^ctaaaaatgtgtctgatatcattcctagaactgaagttgaaaagg 

CCATCAGGATGTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGGGATAC 

agccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtga 

'^*^^^'^'^*^^*^^^^^^°^'^^^T<^^C^<^<^G^TCAGAGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAG 

aaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcaga 

CCTCCTTTTAGAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGGTATAG 

aaaatataagatgataaagatatcattaaatgtcaaaactatgactctgttcagaaaaaaaattgtccaaagaca 
acatggccaaggagagagcatcttcattgacattgctttcagtatttatttctgtctctggatttgacttctgtt 
ctgtttcttaataaggattttgtattagagtatattagggaaagtgtgtatttggtctcacaggctgttcaggga 

TAATCTAAATGTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGCAAGTGTTGGATCT 

tgtatggaatatggatggatcacttgtaaggacagtgcctgggaactggtgtagctgcaaggattgagaatggca 

TGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGACATGCTTTCTTCACAGTAACTCAGTTCAAGTACTA 

tggtgatttgcctacagtgatgtttggaatcgatcatgctttcttcaaggtgacaggtctaaagagagaagaatc 

CAGGGAACAGGTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAAAACTA 

gagccaggggcctccgtgaactccccagagcatgcctgatagaaactcatttctactgttctctaactgtggagt 

GAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGTACCCAGTCTCTTAAATCTTTTGTATTTGCTCACA 

gtgtttgagcagtgctgagcacaaagcagacactcaataaatgctagatttacaaaa 
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MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMN 
NAGDKWSAFLKEQSTLAQM.YPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTI 
YSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYWLKNE 
MARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY 
PSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFF 
VSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGH 
IQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLK 
QALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLF 
HVSDDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLL. 

Important: features of the protein: 
Signal peptide : 

amino acids 1-17 

N-glycosylation sites . 

amino acids 53-57, 90-94, 103-107, 322-326, 432-438, 546-550 

N«-iayristoylat±on sites. 

amino acids 260-266, 286-292, 395-401 

Cell attachment sequence. 

amino acids 204-207 

Neutral zinc metallopeptidases , zinc-binding region signature • 

amino acids 371-381 
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CCCACGCGTCCGAGCGGGGTGGACAAGTGGCGTGTGTGCTGCGACCCCGAGGGAAGAT6AACG 

GGACGCGGAACTGGTGTACCCTGGTGGACGTGCACCCAGAGGACCAGGCGGCGGCGGGCAGGA 

AGACCTATGCCATGGTGTCCAGCCACTCAGCTGGTCATTCTCTGGCTTCAGAACTGGTGGAGT 

CCCATGATGGACATGAGGAGATCATTAAGGTGTACTTGAAGGGGAGGTCTGGAGACAAGATGA 

TTCACGAGAAGAATATTAACCAGCTGAAGAGTGAGGTCCAGTACATCCAGGAGGCCAGGAACT 

GCCTACAGAAGCTCCGGGAGGATATAAGTAGCAAGCTTGACAGGAACCTAGGAGATTCTCTCC 

ATCGACAGGAGATACAGGTGGTGCTAGAAAAGCCAAATGGCTTTAGTCAGAGTCCCACAGCCC 

TGTACAGCAGCCCACCTGAGGTGGACACCTGTATAAATGAGGATGTTGAGAGCTTGAGGAAGA 

CGGTGCAGGACTTGCTGGCCAAGCTTCAGGAGGCCAAGCGGCAACACCAGTCAGACTGTGTGG 

CTTTTGAGGTCACACTCAGCCGGTACCAGAGGGAAGCAGAACAAAGTAATGTGGCCCTTCAGA 

GAGAGGAGGACAGATGTCCAGAGTgATTGGAGT^TGTCCTGGGGGAATGAAGTTCCTTCCACA 

AACACAGCTCAGTTCTTAGCAACAAACTGTTTGTTTTTCTACTTGCTCCATCTGCAGCCTACG 

CTGCCCTGGCCTCCTGCAGACAGATAGTGGGGTTACCTGGCAAGGCCTGGTGAGAGCCAGTGA 

ACCTAAGCTTTGACTGGGTGGCCTTGTCTTTCTGGGGAGGAGGGAATGTACATTCAGGGAGTA 

GCCTTTTGCGGAAAAATTCTCTAGGGCTACAGACAGTCATGTGTGACTTCTCTCTGCTGTGAA 

AACTCCCAGAGTCTCTTTAGGGATTTTCCCTAAGGTGTACCACCAGGCACACCTCAGTCTTCT 

TGACCCAGAGCCTGAAAACTGTTTTCACTGGGTTCCACCAGTCCCAGCAAAATCCTCTTTGTA 

TTTATTTTGCTAAGTTATTGGTGGTTTTGCTTACATCTCATGATTGATATAATACCAAAGTTC 

TATAGCCTTCTCTTGCAGTATTTGGATTTGCTTGAAACCGGGAAAACTGTTCCCATTAGGCTT 

GTTAATGTCAGAGTGACACTATTATGAATCTTTCTCTCCCTTTCCTCTGCCTGTTTCTTCTCT 

CTTTCTCCTTCAAACTTGCTCTGCAGCTAAGGAAGGTGAGTCTACTTTCCCTGAGGCTTTGGG 

GTCAGAGTATATGTTGTTTGGAGAAAGAGGGCAATCAGGACTCTTCTGGGACCCAGATGAGTT 

CTTCACTAGCCCTTCTGAACCCCTTGCTCCATAATTGGTCTTTTATCCTGGCTCTGAATGACC 

CTGCAGGTCATCATGGTTTTCTTTTTTTATTGTTTTTTTTTTTTTCTGAGACAGAGTCTCACT 

CTGTCACCCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACCTCTGCCTCCCGGAT 

TTAAGCGATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGTGCCACCACGCCTG 

GCTGATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATACTGGCTAGGCTGGTCTCGAATT 

CCTGACCTCAGGTGATCCACCCACCTCGGCTTCCCAAAGTGCTAGGATTATAGGCTTGAGCTA 

CTGCGCCCGGCCCATGGTGTTTTTCTTTAGGGCTCTTCCTACAGCCTTGAGAAGTAGATAGGC 

ATCAGAGTATGGTACTATAGGAATCAGAAAAATTCAAAACAAATGTGGATTAAGTGTTTAGGC 

TCTATGTGGCTCACGCAGCCAGAATCCTTAAGTCTGTGTGTTTCTGTGTCTCAAGACTGGGCT 

CACATTCTGGCTTTGTCCATAACAATGCTCTGGGATTTCAGGGAGTTCCCTCATTTGTAAAAT 

GAGGGGGTCAGAGCAGGTGATATCCATGTTTCTTCCCTTTCTGATATTGTTGTCTGTGGCATA 

TTCTTTGTATGGCGAATTTAATAAATTATATTAATGTGTCA 
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FIGURE 74 

MNGTRNWCTLVDVHPEDQAAAGRKTYAMVSSHSAGHSLASELVESHDGHEEIIKVYLKGRSGD 
KMIHEKNINQLKSEVQYIQEARNCLQKLREDISSKLDRNLGDSLHRQEIQWLEKPNGFSQSP 
TALYSSPPEVDTCINEDVESLRKTVQDLLAKLQEAKRQHQSDCVAFEVTLSRYQREAEQSNVA 
LQREEDRCPE 

Important features of the protein: 
Signal peptide: 
amino acids 1-39 

N-glycosylation site. 

amino acids 2-6 



Amidation site. 

amino acids 21-25 
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FIGURE 75 

^^^^^^^^^'^T^^GGCCGTGGCCAGAGTAGCAAGGAAGGTCAACGAGACGGTGGAGAGCGGCTCTGACK^ 
CTGGACCTGGCCGAGTGCAAGCTGGTCTCCTTTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCT 

atccacctcatcaccctggctaacaacgagcttaagtccctcaccagcaagttcatgaccStt^ 

^^^^'^^^'^^^''^'^'''^'^^^^'^^^C'^'rCCTACACCGCCTCCCCAGCGAGGTCAGTGCCCTGCA^^ 

^^^^^^^'^'^'''''''''^^^^^''t^^^^g^cttccctgagcagcttaccgccctgccgSgctg^^ 

^^^^^^^^^^gcccccctaccttaggccaccctcctcatgcccacccagcaag^^^ 

ggtggggcctgcagcctggactcggcactcaovgctgctgtgcaaactcaggcagatctcctcccctctct 

cttgtcacttgaaaaaaacaggaccctttccctcctttgggctccctggagctttttaa^ 

agttacctccagatcagcaggcacaggtgggcattgccaggtattttctgagcccctgcgSS^^ 

twtagtgctgagagccagttgctgccctgagaagagaagacaacSSSctS 

^.^^^''^''^''^''''^^^^^^'^'^^^^'^'^cttcagtcctgtggtccctggactggtgggS^^ 

rS^^™f'^''^^'^^'^^''^^°^'=^^^<^^C'^°CTGCAGGGCAAGAACATTCATGSGC^^ 

ttgtcatggcctcctgggggcagaaggcgatgtggaccacgggatttgtagccagccagctcccagScScgcc 

caaagccctgatgacctggttcttctgaggccctcaacctggcatcttaggctatggtSgcS^g^^ 

agctgtcctggtttcccaggacatggaactttcaatgctaaaactgggaStcaccSgSIgtcSSm 

gtcccctaccaggagagggcctggggctcttgcttcccgagaacgcctgSgcttgaagScS^^ 

tcctcaggtatctacctcccaccttctcctcatctgtggagcaagcSactcactScc^^ 

tgcatctttgtttgctccagagacacctgaggccccagagcttgaggcaaaSSggSct^^ 

^^^^^^''^^^^'''^^^^ctttactactcctaaggctaagatgttgagagctcagaSctSca^^ 

o^^ISS^'^'^'^'^^^^^^^c^gttccctcgtccctgcccaggtctcttgtctcttgggSgSS^^ 

gccattgttgggccatcactgagcgctcagtatctcaagagactctgttcattctgctcgtmSca^^^ 

n^r^J™^'^''^^°'^^*'^^"^^^^C'^<^S*TGAGGAGGTCAAGACAGGATGTCCA^^^^ 
^J™*^^^®^°^^°'^^<^'^'^^C'^<^^GTCCTGTGCAGGCTGCGCTGGCCCCACCTGSTGTGC?a^^^ 

ggggtttctccttgttgaagaagaggcatccttctctgatgtgcacaaacacaatgtatSccSS 

»^^™^^^^^*^^^'^^'^^^'^^'^^c'^'=cgagcagagggtgaagtttctctcagttctctccctgccacttcc 
^^^™J^^^'''^^''''^^^^^^'^^^C'^^cct^cccaccctgggagacacttggcctgtta^ 

TCTGCATGGCACTCCTGAAGAAGGGAGTGTGACCTGCAGTCACCAGGAGATGAGGGTTAGGTGTGrc^^ 
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FIGURE 76 

MLKKMGEAVARVARKVNETVESGSDTLDLAECKLVSFPIGIYKVLRNVSGQIHLITLANNELK 
SLTSKFMTTFSQLRELHLEGNFLHRLPSEVSALQHLKAIDLSRNQFQDFPEQLTALPALETIN 
LEENEIVDVPVEKLAAMPALRSINLRFNPLNAEVRVIAPPLIKFDMLMSPEGARAPLP 

Istportant features of the protein: 
N-glycosylation sites. 

amino acids 17-21/ 47-51 
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FIGURE 77 

acgcaggcggctcgccccactgtccgacatcgccatcgccamtctctSgg^^ 



lAGGGAGA 



^^^o™^^^^^'^"^"'^^^^'^^'^TCCTTGGCTTCGGCGTTTGGGGTAAGCTCCGTTCTCGTTCTCAA 
rTO^^^^I™^'^^'^''^^^'^^^'^^^CG<^GCCGTCTCGAGAGCCGGCATCTCcSS^^^ 
SAAr^^n^'^^^^*^*^°'''''''''''''^^'^'''^^^^^^^'^'^'^^'^'^CCTGACGCCGGCGGACCCCG^ 



^^S^^^^'^'^'^^^'^^^^^'^'^'^'^^'^^^^^^^'^^CGGCTTCAGGAGGAGTGGAACTTGCAGGAGGCGGAGCGCGAT 
S»!^^^^^^^^^'^^^J=^^^^'^^'r^GTGGACGCGTCGTGGGAGTTGGTGGACCCCACAcSCTT^ 

tttgttcagtttaacgaccaattcttctggggccagctggaggccgtcgagSgStggagS 
ctgtgtgctgggatatgcagctatgaagggaagggtggaatgtgttccatccgtctcagSSScct^^ 
^I^^^^^^^^'^^^^'^^^'^^'^^g^g^ccctcctgcatgaaatgatacaS^ 



ATn^n™^^^^'^°^^^^°''^^^^^'^'^TT'^TAAACATATGCATCGCATCAACAGCCT^^^ 
rI^^r^^^Jn^^^^^''^^"^''^'^''^^^^^^<^'^TGAGTATCGGCGACACTGGTGGCGCTG^ 



^™^^*^^°°°^^'^^'r^TG^CATATATTTGCCAGTACTTCATCTTCAAGATTT^ 

A^n^nl^^^I^^^^^^^^^^^^^^'^-TT'^ATATGGAAAGACCGTATCTTCATTTTCGTGAGTAGAAGGAAA 
H^^^^^^^^^^^CAGCAGATTTTCCCTCCTGGAATTACACATAAAGGACACTAAGCAATTTT^ 



iTAAGATGGA 



rr^^^^If''^''^''^''^^^^^^°^C^'^T^C^TTAGTTTAA?^^^ 
^^^^S^^^^'^^'^'^^^T^'^^CTTTTTTGGCAGATAAACCCAACAGAGGTGAGGC^^ 
^^^^^^^^^'^''''''''''''''^''^T^^^^GAAACAAGCAATTTACCTTCACCTGGGAA^S 

^I^^^^^^^S^^^^^'tcaaaaccattcagcaaatgctgtaag^Stt^^ 



^nr^™~SS!?H^Ti^!I???^TCTGGTCTCCCCTGCTGTTAGTAACACT 

GATA 



^^^S»^™^I^??!^H^5!^T?^TTT?CCAACCAAAAGGCTTTCAGAGGTGTGAATGGATCT^ 



ACATTCCCAAATAAACGACCTAGGCTAGAAGATAAAAAAAAA 
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FIGURE 78 



MDDDLMLALRLQEEWNLQEAERDHAQESLSLVDASWELVDPTPDLQALFVQFNDQFFWGQLEA 
VEVKWSVRMTLCAGICSYEGKGGMCSIRLSEPLLKLRPRKDLVETLLHEMIHAYLFVTNNDKD 
REGHGPEFCKHMHRINSLTGANITVYHTFHDEVDEYRRHWWRCNGPCQHRPPYYGYVKRATNR 
EPSAHDYWWAEHQKTCGGTYIKIKEPENYSKKGKGKAKLGKEPVLAAENKGTFVYILLIFM 

Inqportant features of the protein: 
Signal peptide: 

amino acids 1-41 

N-glycosylation sites, 
amino acids 148-151, 217-220 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 184-187 

Casein kinase II phosphorylation sites. 

amino acids 30-33, 121-124, 154-157, 187-190, 192-195 

Tyrosine kinase phosphorylation site. 

amino acids 211-218 

N-n^ristoylation sites. 

amino acids 59-64, 85-90, 146-151 



Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 108-117 



• 
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FIGURE 79 

CGGACGCGTGGGTGGCAACCAGGAGAAGCCAAACTTGGTCCCCCGGCTCGCGGAGTGCCTGCG 

AGCGGTGCTCATGGCGCTCTATGAGGTCTTCTCTCACCCGGTCGAGCGCAGTTACCGCGCGGG 

GCTCTGCTCCAAAGCCGCGCTGTTCCTGCTGCTGGCCGCTGCGCTCACGTACATCCCGCCGCT 

GCTGGTGGCCTTCCGGAGCCACGGGTTTTGGCTGAAGCGGAGCAGCTACGAGGAGCAGCCGAC 

CGTGCGCTTCCAACACCAGGTGCTGCTCGTGGCCCTGCTCGGACCCGAAAGCGACGGGTTCCT 

CGCCTGGAGCACGTTCCCCGCCTTCAACCGGCTGCAAGGGGATCGCCTGCGCGTCCCGCTCGT 

TTCGACTAGAGAAGAAGACAGGAACCAGGATGGGAAGACGGACATGTTACATTTTAAGCTGGA 

GCTTCCCCTGCAGTCCACGGAGCACGTTCTCGGTGTGCAGCTCATCCTGACTTTCTCCTATCG 

ATTACACAGGATGGCGACCCTCGTGATGCAGAGCATGGCGTTTCTCCAGTCCTCCTTTCCTGT 

CCCGGGATCCCAGTTATACGTGAACGGAGACCTGAGGCTGCAGCAGAAGCAGCCGCTGAGCTG 

TGGTGGCCTAGATGCCCGATACAACATATCCGTGATCAACGGGACCAGCCCCTTTGCCTATGA 

CTACGACCTCACCCATATTGTTGCTGCCTACCAGGAGAGGAACGTTACCACCGTCCTGAATGA 

TCCCAACCCCATCTGGCTGGTGGGCAGGGCCGCAGATGCTCCATTTGTGATTAATGCTATCAT 

CCGATACCCTGTGGAAGTCATTTCTTATCAGCCAGGATTCTGGGAGATGGTAAAGTTCGCCTG 

GGTACAGTATGTCAGCATCCTGCTTATCTTCCTCTGGGTGTTTGAAAGAATCAAGATCTTCGT 

GTTTCAGAATCAGGTGGTGACCACCATTCCTGTGACAGTGACGCCCCGGGGAGACTTGTGTAA 

GGAGCACTTATCCTAGAAAGGCCATTTCTGAAGACTCAGCAGGACCGTGGCTGCCTCATTGTC 

ATCTTCTGGGAACATCTTAGGACCTTTTGAAAGAGCCCAGCGGACACCTGCGGGCTTGTGTGC 

TTTTCCCTCAGAGACAACGGTTCTTTCCGGTTTTGCTCTACACAGTTCCGTATCTTCAGAGCT 

CCTGCAGAATTGTCAGGGACTAGTTTGTGGAAAGGTCTGAGAGTTCCTGGAGGCTATAATTAG 

CTTTTTGGGTTTTCCTTCTTTGCCTTAGCGTTGAATTTCAGGAGAAAATTGCAGTCAGTTCAG 

ACATCTTGGAAAGAGTCCCATCTCTGGTCAAGCAGAGACTTTTCCTCTGTTGAACTGAGGAAC 

ACACTGTGCATTTCTTCCTTCTGTTGTGAGCCACTCTTACTCTTTTCAGGGCTCTCTTGTGAC 

AAACATGCCAATCACTAGCACTTTGCACCCCTGGGCTTCTCCATTTCCCATTCACAGCTTTGA 

TTTCCAGAGCTGAGGCCTTTAACTGGAGACCTGGAGGGGCAGGGCCCAAGGGCAAGGGCCGCA 

TTAGCACAGGCAATCAGGGAGGGCCGCTGAAGGACACTTGGACCGTCCACCTGCCCCAGCCCA 

ACAGTCAGTCATCTGTCATCAGCTCAGCTGAGCAGCCCTGGATCTTTGCCGTACTGTGACTGG 

GCTCTTTGCCCTATTTTTCCCTCTGTCTGTGCCCCTGGATGGCAGGCTGAAGTCAGAGGGGCT 

GTTTCATTCTCAGCCCCCTCAGCAGCACTGGGGGAAGAAAGCATTGTCACAACAGGTTCTTTC 

TGGCCCTCACCCAACAGCCTGGGCACTTGGCCCTCCTCCTCCTTGACAGCCCTCCCCCTTCCT 

GCAAAGGACAGGGGCGACAGGGGTTGGTGTTGGGATTGGCTCCCGCTGCCTGACAACCACAAG 

TTTATTTGGAAGGCTAGCGGGAAGCCCAGCGGCTGGCGTTTCCCTTGACTAAGGAACAGGGTG 

CCCATCAGAGTGGGGCGGGCAGCTTTGGGAAGGACACAAGAAGCAGTAAGAGTGTAAAGAGGA 

TGCTGGCCTGGGCAGGCCAGTCCAGCCTGGCCACTAGCAGAATACCAAGCAGTCCAGTGGATT 

ACCCTCGTGGCTAAGCAAGTGTCTGCAGGAGCAGAGATGGCTGGAAGGGGCCTCTGCACACGG 

AAGATGGCTTGTTCAGCCCATTCACCTCCTGAGGATGTGGGCAGTCTCCTCCAAGAACACATG 

GAGCTGCTTCCTGATCCCAAGCAGGTCATTGCCACTGGAAGGACATGGCCCCGGTGATCCATG 

CTTCATGCCCACCCAGAAACACACCCCTCAGTGTGTGCCTCAGTTTACTTTGGAGATCAGTTG 

TCGTTTTTAGTGCTCCTTTAGGCTTACTAAAACAGTTTTGGAAACAAAGCTATTTTGAAGTAT 

TCAAGCAGAGGAATTCCCTAACACTGACCCCCTTGTCTTTTTTTAATATTCAGGCTGTTTTAT 

ATGCCTAAATTTTTTTCTTAAGATCTAAACGAAAAATAGTTTCTTGTTTAAATTCACATAAGG 

CAATGAGATATGGAAAGATGACAAGATACGTATAAACATTGGTTTGCATCTTATTAAATTATT 

CTAATGCAAATCTTGTATAAAGAACCCATGATGTTTTGTAACTTTCTAATTAAAATGTTCAAA 
ATGAG 
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FIGURE 80 

MALYEVFSHPVERSYRAGLCSKAALFLLLAAALTYIPPLLVAFRSHGFWLKRSSYEEQPTVRF 
QHQVLLVALLGPESDGFLAWSTFPAFNRLQGDRLRVPLVSTREEDRNQDGKTDMLHFKLELPL 
QSTEHVLGVQLILTFSYRLHRMATLVMQSMAFLQSSFPVPGSQLYVNGDLRLQQKQPLSCGGL 
DARYNISVINGTSPFAYDYDLTHIVAAYQERNVTTVLNDPNPIWLVGRAADAPFVINAIIRYP 
VEVISYQPGFWEMVKFAWVQYVSILLIFLWVFERIKIFVFQNQWTTIPVTVTPRGDLCKEHLS 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 268-284 

N-glycosylation sites. 

amino acids 194-198, 199-203, 221-225 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 51-55 

Tyrosine kinase phosphorylation site. 

amino acids 250-259 

N-myristoylation site. 

amino acids 187-193 



Cell attachment 3equence. 

amino acids 307-310 
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FIGURE ai 

r^^^n^^*^'''^''^''^^'^^^^'^^^^^CC^'t'ACTGTCTGGATATTGATGSTCTG^ 

gtgtgcccacatctgcatcaataccttgggcagctaccgctgcgagtgccgggaaggctaStS^ 

tgggaagacatgtaccaggggagacaaatatcccaatgacactggccatgagSgtS^^^ 

S^S^^^'^^^^'^^^^cacatgcaaggagttctaccagatgaagSSccgS^ 

^^^^^^'^'^^'''^^'^''''^^^c^^^'^'^tgggcaagtatatcactggtgacaagctgSScct^ 

tccaggacctcctggcctgcctgggggccagggccctcccggctcaccag^ccaSSgSSc?SS^^ 



^SI»^^^^*^^^^'^^^^^^^^<^^GAAAGACCACTGGAGACCTAGAAAACATACATTTTTCTrTTr 
^^!S?SSI?^???I????^CTCCTCTTCTTCCAAATACGATGCTATTTTCAG^ 



™nfS°''^*^''°^^^'^*'^'^'^'^C<='rGCTTCTCACTAAGAGTCCAT^ 

ttotacatcctatttttccaggaactttggatttaagtactctcacagtgtcttaaatcatS 
aaatttggcagagtatcaaaagggggaaaatgacaaagtgagctctaagSgtgIg^ 



J^S^^^'^'^'^^^^^^^'^^^^^T'^T'^TATTTTAAAGCTGTTTAATCAACAGTGGAG^^ 
T^JJ^ir°^^'^^'''^^^<^'^^^°^<^<=TTAGAGAGAGGTGCAGAATTcd^Sc^^^ 

J^S^^P^^'^^^'^^^'^^^c^^ggctgaaccaatatacagtggggctgaagtctgSag^^^ 
cagttcttttaggccttctctttgatttattttcccctgcatgtgagaagcagttcagaa^ 

^^S^S^?^^I?^1^I^?^????^T«^CAGA^^^^^^ 

'GTAACGTTGTTGAGGGCCCT 

ACCTTTGCGGGAGATTTT^ACT^^^^^^ 



o™^S^'^'^^^^^^^^^^<^<^^^"^GAGACCAGAAGGATGATCTGCTCCTTTGTAACGTTGTTGAGGGCC 



??^??S?cgggSSg?S^^^^ 
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FIGURE 82 



MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCLKSS 
GELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYDRERHRK 
REKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKYPNDTGHEKS 
ENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLP 
GGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQGRRGPVGPPGAPGRDG 
SKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSYP 
EAMDLGSGDDHPRRTETRDLRAPRDFYP 

Important features of the protein: 
Signal peptide: 

amino acids .1-34 
N-glycosylation sites, 
amino acids 142-148, 182-188 
Tyrosine kinase phosphorylation site, 
amino acids 125-132 
N-myristoylation sites. 

amino acids 10-16, 143-149, 155-161, 196-202, 250-256 
Amidation site. 

amino acids 299-303 

Aspartic acid and asparagine hydroxylation site. 

amino acids 150-162 

Cell attachment sequence. 

amino acids 176-179 

Clq d<Miiain proteins. 

amino acids 247-280 

Calcium-binding EGF-like domain proteins pattern proteins. 

amino acids 144-165 



• 
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FIGURE 83 

ATCTGAGTGAGCTAACTGACACAATGAAACTGTCAGGCATGTTTCTGCTCCTCTCTCTGGCTC 

TTTTCTGCTTTTTAACAGGTGTCTTCAGTCAGGGAGGACAGGTTGACTGTGGTGAGTTCCAGG 

ACCCCAAGGTCTACTGCACTCGGGAATCTAACCCACACTGTGGCTCTGATGGCCAGACATATG 

GCAATAAATGTGCCTTCTGTAAGGCCATAGTGAAAAGTGGTGGAAAGATTAGCCTAAAGCATC 

CTGGAAAATGCISAGTTAAAGCCAATGTTTCTTGGTGACTTGCCAGCTTTTGCAGCCTTCTTT 

TCTCACTTCTGCTTATACTTTTGCTGGTGGATTCCTTTAATTCATAAAGACATACCTACTCTG 

CCTGGGTCTTGAGGAGTTCAATGTATGTCTATTTCTCTTGATTCACTTGTCAATAAAGTACATTC 
TGCAAAAGCAAAAA 
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FIGURE 84 



MKLSGMFLLLSLALFCFLTGVFSQGGQVDCGEFQDPKVYCTRESNPHCGSDGQTYGNKCAFCK 
AIVKSGGKISLKHPGKC 



amino acids 1-23 

N-myristoylation sites. 

amino acids 26-32, 52-58, 56-62, 69-75 

Kazal serine protease inhibitors family signature. 

amino acids 40-63 



Important features of the protein: 
Signal peptide: 




# 
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FIGURE 85 



GGAGCAGACACACAGACCCGGGCCGGAGGCCCCTCTTCTAGCGCTGCGGGAACCGGACAGTTC 

CCCAACTGGGGACTCTGGAACCACAGCTCCTAAATCATCAAATTCTCAAGCTTTTTTTTTCCC 

TCTCTTCGTCCCAGCCATCCCAGTCTTCTTCTTCTTTTTTTTTTTTTTAACTTATTGTTTTTT 

TCGCTCCTGTCATTATGAAAGTGGTCACGCCATTCAATATTAAGACTTGGAGGGAATTGGGGA 

AAGAAAAGAAAGAATCTAAAAGAAGAGAAGCGACCGGTGCTTTTAAGGGTGTCTAATTTTCAA 

AAGAGACGTCTGGGAGTATTTTGCTCTGGGCGTTTGGAGCAACTTCGCGGACAGCGGAGCTCG 

CCCAGCATGGATGTTCCAGGTTCACAGGCGCCTTTCTTCTGAGAACGACCCTGGCCTTGAACG 

TCAGAGCCGGGGACGAAGGCCCCCGGAGGCTGCTGCGAGCTCCGCGCGTTCCTTCGCGCCCTT 

CCGCGCCGCTCGCGCCGGCGCCGGCCTCCACCCCCGCGCGCCGCCTCCCACCAGTCCCGATGC 

AGGCGCCCGGCCGGGGGCCACTCGGGCTGCGGCTGATGATGCCCGGGCGCCGGGGGGCGCTGC 

GCGAGCCTGGCGGCTGCGGATCCTGCCTGGGGGTGGCGCTGGCCCTGCTGTTGCTGCTACTGC 

CCGCCTGCTGCCCCGTGCGGGCGCAGAACGACACGGAGCCCATCGTGCTGGAGGGCAAGTGCC 

TGGTGGTGTGCGACTCCAGCCCGTCGGCGGACGGCGCCGTCACCTCCTCCCTAGGCATCTCCG 

TGCGCTCCGGCAGCGCCAAGGTGGCCTTCTCCGCCACGCGGAGCACCAACCACGAGCCGTCCG 

AGATGAGCAACCGCACCATGACCATCTATTTCGACCAGGTATTAGTAAATATTGGCAACCACT 

TTGATCTTGCTTCCAGTATATTTGTAGCACCGAGAAAAGGGATTTATAGCTTCAGCTTCCACG 

TGGTCAAAGTGTATAACAGACAAACCATCCAGGTCAGTTTAATGCAGAATGGCTACCCAGTGA 

TCTCGGCCTTTGCAGGAGACCAGGATGTCACCAGAGAAGCTGCTAGCAATGGCGTGCTGCTGC 

TCATGGAAAGGGAAGACAAAGTGCATCTCAAACTTGAGAGAGGCAACCTCATGGGGGGCTGGA 

AATACTCCACATTCTCGGGCTTCTTGGTGTTTCCTCTATAAACACAGAGCCCCCTAGATGGTG 

GGGGAATGGCAAACTGGACCCAGGACTCCGCCCTTTAAAACACCCTGAACTTACTGGAATTGG 

ACACCTTGTTTCCAACCTCCGTCAGACTGTTGCAGTAGAAGAATGATTTCCTTTGAAACCT.ee 

AGTACTTTTGTTTTTGTTTTTTGGAATACTGACAATTCCTCGGGAACCTGGCCTCTAATTAGT 

TTTAGATGACAAGGTCTTAAGGAGAAATGAAATTATCGATTTGAGCAATTTGTACCTGTGATT 

GTAAAGTCAATATCGGATTTTATTGTTGGGACCATGGACCTCTTTTGTTTGTATGTTGTATTG 

TCGTCCCAACGGAAGGAGAGCTCCTGACTCCAGGA'rGGGCTGCAGGTTGCAGTCAGGGCTTGA 

AGTAGGAGCCeAGCAAAGAACCACCTGCTGGAeAGTeCTTGACATGTGTTCTGTGTGTGTCTG 

TATAGCCTTAAGAAAAAGAATGGCTTCACTTTCATTCTGTATTCTTCCCCCCACCATGTGGCT 

GGGAGGACTTGGGAGGGGGATGGGGACATTGGGAACCTGTCAAGAAGTGCTTTATCCAGAGAA 

GCAAATTTTGCACGATTGGACTGCAATTTTTGTTTTGTATTGTTTGTGTTTTTTCTTGAAAAG 

CTTTACTTTTCTTTCCACACTCAGCTCTCCCTCCTCAACCCCACTTTTATTTTTCTTGCTGGG 

GTTGAGGAGAGAAAATATAGAATTCCTGGATAAGACCAAACAAAACAAAACATTAAAATACCT 

GTATGTTTTGTTTTAGACGAGACCAAACTAAACAAAAAGTATCTGTTTATCAAAGTAAAAGTA 

ACACAATCGACAATTCTGCTTATTCTCTCAAAGAGATTCTAAGATGCACCTTTAGAACTATTA 

ATAGCAACCTGCATTTTTTTTTAATTTATACTTCAGAATCCTTTAAGAACCTGGTGTTCCTGA 

GTGGTCCTGAATCATATAAGTTGGTAATGGAAGCTGTAATGACCAAGTCCCCTAAACATACTA 

TGTCTTTGCCACGTGTGCTGTGACTTCTCTGTGGGTGATTTAATTTATTTGGATCCACCTCTG 

AGTGAGCGCACAGTGATeAGGTGCTTCAAAGCCAACAGACCAGCTCCTCTTCCTCCGGATCCT 

CTTTTGATCTGCCCAGGAAAGGGATGCATTGACACTCTCCTGCATGCACCTGGCGAGAAGCCA 

CCTGAAAGTCACTGTGGTTAAAGATATTGGTGGAGGTACCCCAGGAGCACTGTTACAAATCCT 

TCTTGTTTTGGCATCTCGTACAACATTATTAAGACACAGCTGAGAGTTGATGGGTGTGTAATG 

CATATGCCAAGGAAATGTCACTAATCCCAAAGCAATCAAAAAGGAGACCTCAAACCAGATGTT 

AATTTGTTCTTTGTGTAACAATGTAACCAAAATATTGATGATAAAAGTCATAATTTAAGATTC 

AGAATAAATGGGTTTGATGTCTGGCAAAAAAAAAAAAAAAA 
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FIGURE 86 

MQAPGRGPLGLRLMMPGRRGALREPGGCGSCLGVALALLLLLLPACCPVRAQNDTEPIVLEGK 
CLWCDSSPSADGAVTSSLGISVRSGSAKVAFSATRSTNHEPSEMSNRTMTIYFDQVLVNIGN 
HFDLASSIFVAPRKGIYSFSFHWKVYNRQTIQVSLMQNGYPVISAFAGDQDVTREAASNGVL 
LLMEREDKVHLKLERGNLMGGWKYSTFSGFLVFPL 

Impor'bant features of the protein: 
Signal peptide: 

amino acids 1-48 

N-glycosylation sites. 

amino acids 53-57, 110-114 

N-myristoylation sites. 

amino acids 26-32, 27-33, 29-35, 33-39, 76-82, 205-211 

Amidation site. 

amino acids 16-20 

Clq domain signature. 

amino acids 117-148 

Clq domain proteins. 

amino acids .115-149 
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FIGURE 87 

AGGGCCCGCGGGTGGAGAGAGCGACGCCCGAGGGGMGGCGGCAGCGTCCCGGAGCGCCTCTG 
GCTGGGCGCTACTGCTGCTGGTGGCACTTTGGCAGCAGCGCGCGGCCGGCTCCGGCGTCTTCC 
AGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGGGCGGCCTTGCGAGC 
CCGGCTGCCGGACTTTCTTCCGCGTCTGCCTTAAGCACTTCCAGGCGGTCGTCTCGCCCGGAC 
CCTGCACCTTCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACG 
ACAGTAGCGGCGGGGGGCGCT^CCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCT 
TCTCGCTCATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCAC 
CAGATGCACTCATCAGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGTTAT 
TGGATGAGCAAACCAGCACCCTCACAAGGCTGCGCTACTCTTACCGGGTCATCTGCAGTGACA 
ACTACTATGGAGACAACTGCTCCCGCCTGTGCAAGAAGCGCAATGACCACTTCGGCCACTATG 
TGTGCCAGCCAGATGGCAACTTGTCCTGCCTGCCCGGTTGGACTGGGGAATATTGCCAACAGC 
CTATCTGTCTTTCGGGCTGTCATGAACAGAATGGCTACTGCAGCAAGCCAGCAGAGTGCCTCT 
GCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATGCATCCCCCACAATGGCTGTCGCCACG 
GCACCTGCAGCACTCCCTGGCAATGTACTTGTGATGAGGGCTGGGGAGGCCTGTTTTGTGACC 
AAGATCTCAACTACTGCACCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTG 
GGCAGCGAAGCTACACCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGC 
TCAGCGAGTGTGACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCT 
ACCACTGCCTGTGTCCTCCGGGCTACTATGGCCTGCACTGTGAACACAGCACCTTGAGCTGCG 
CCGACTCCCCCTGCTTCAATGGGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTT 
GTGAATGTCCCCCCAACTTCACCGGCTCCAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCA 
ACCCCTGTGCCAACGGGGGACAGTGCCTGAACCGAGGTCCAAGCCGCATGTGCCGCTGCCGTC 
CTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACTGTGCCCGTAACCCTTGCGCCC 
ACGGTGGCACTTGCCATGACCTGGAGAATGGGCTCATGTGCACCTGCCCTGCCGGCTTCTCTG 
GCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGAGTCCCTGCTTCAACAGGG 
CCACCTGCTACACCGACCTCTCCACAGACACCTTTGTGTGCAACTGCCCTTATGGCTTTGTGG 
GCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCTTCCCCTGGGTGGCCGTCTCGCTGG 
GTGTGGGGCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGGCAGTGGCTGTGCGGCAGCTGC 
GGCTTCGACGGCCGGACGACGGCAGCAGGGAAGCCATGAACAACTTGTCGGACTTCCAGAAGG 
ACAACCTGATTCCTGCCGCCCAGCTTAAAAACACAAACCAGAAGAAGGAGCTGG/^GTGGACT 
GTGGCCTGGACAAGTCCAACTGTGGCAAACAGCA7UVACCACACATTGGACTATAATCTGGCCC 
CAGGGCCCCTGGGGCGGGGGACCATGCCAGGAAAGTTTCCCCACAGTGACAAGAGCTTAGGAG 
AGAAGGCGCCACTGCGGTTACACAGTGAAAAGCCAGAGTGTCGGATATCAGCGATATGCTCCC 
CCAGGGACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAATGAATGTGTCATTG 

CCACGGAGGTATaAGGCAGGAGCCTACCTGGACATCCCTGCTCAGCCCCGCGGCTGGACCTTC 
CTTCTGCATTGTTTACA 
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FIGURE 88 

MAAASRSASGWALLLLVALWQQRAAGSGVFQLQLQEFINERGVLASGRPCEPGCRTFFRVCLK 
H FQAVVS PG PCX FGT VST PVLGTNS FAVRDDSSGGGRN PLQL PFN FTWPGT FSLI I EAWHAPG 

DDLRPEALPPDALISKIAIQGSLAVGQNWLLDEQTSTLTRLRYSYRVICSDNYYGDNCSRLCK 
KRNDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECLCRPGWQGRLCNE 
C I PHNGCRHGTCST PWQCTC DEGWGGL FC DQDLN YCTHHS PCKNGATCSNSGQRS YTCTCRPG 
YTGVDCELELSECDSNPCRNGGSCKDQEDGYHCLCPPGYYGLHCEHSTLSCADSPCFNGGSCR 

ernqganyacecppnftgsncekkvdrctsnpcanggqclnrgpsrmcrcrpgftgtycelhv 
sdcarnpcahggtchdlenglmctcpagfsgrrcevrtsidacasspcfnratcytdlstdtf 

VCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAVLLVLLGMVAVAVRQLRLRRPDDGSREA 

mnnlsdfqkdnlipaaqlkntnqkkelevdcgldksncgkqqnhtldynlapgplgrgtmpgk 
fphsdkslgekaplrlhsekpecrisaicsprdsmyqsvcliseernecviatev 

Important features of the protein:. 
Signal peptide: 
amino acids 1-26 
Transmembrane domain: 
amino acidg 530-552 
N-glycosylation sites. 

amino acids 108-112, 183-187, 205-209, 393-397, 570-574, 610-614 
Glycosaminoglycan attachment site. 

amino acids 96-100 

Tyrosine kinase phosphorylation site. 

amino acids 340-347 
Nnqfristoylation sites. 

amino acids 42-48, 204-210, 258-264, 277-283, 297-303, 383-389, 
415-421, 461-467, 522-528, 535-541, 563-569, 599-605, 625-631 
Amidation sitia. 
amino acids 471-475 

Aspartic acid and asparagine hydroxy lation site. 

amino acids 339-351 

EGF-like domain cysteine pattern signature. 

amino acids 173-185, 206-218, 239-251, 270-282, 310-322, 348-360, 
388-400, 426-438, 464-476, 506-518 
Calcium-binding EGF-like: 

amino acids 224-245, 255-276, 295-316, 333-354, 373-394, 411-432, 
449-470 
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GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGAC 

TTGACTCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGC 

CGCCCAGTCCCGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCC 

TCCTTTTCATTCATAACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCC 

GAGCGTGGAAGAM6GGGTTCCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATT 

CAAGCTTTCCCCAAACCTGGAGGAAGCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCA 

GAAAGACCTTTGAATGAACAGATTGCTGAAGCAGAAGAAGACAAGATTAAAAAAACATATCCT 

CCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTTTGTTGATAACTTGAACCTGCTAAAGGCA 

ATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTATAAGAAGCTCCCCACTTGATAAT 

AAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTGATCGATGATTATGACTCT 

ACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCATCAACTAGACGGG 

ACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAGAAAATGAC 

AGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGCCAA 

GCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAAC 

AATTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCA 

GAGAAAGTGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACA 

GTATCTAACACATTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGAC 

AACTTTGAGGAACTCCAATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAA 

AAAGAAGCAAAAGAGAAAGAAACACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAG 

ATGATGGTGAAATATGGAACAATATCTCCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGAT 

GAAATGATTGCTCTTCAGACCAAAAACAAGCTAGAAAAAAATGCTACTGACAATATAAGCAAG 

CTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAACAGACAGTACCAAGGAAGAAGCAGCT 

AAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAAGATGATAACTCCAACCCAGGA 

GGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGCCATCAGAAAAAATATT 

GAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTTCAAAGATGAGA 

GACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAAGAAGCC 

GAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAAC 

TGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCC 

AAGGGTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGC 

TTTCTTCCCGTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAA 
AAA 
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FIGURE 90 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENK 
PGQSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSG 
LDHKFQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTL 
EDEVAEVLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNT 
LTLTNGLERRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVK 
YGTISPEEGVSYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEK 
EYGSLKDSTKDDNSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFIN 
KQADAYVEKGILDKEEAEAIKRIYSSL 

Important features: 
N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70^73, 82-85, 97-100, .125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 



N-inyristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 91 

TGCATCAGTGCCCAGGCAAGCCCAGGAGTTGACATTTCTCTGCCCAGCCATGGGCCTCACCCT 

GCTCTTGCTGCTGCTCCTGGGACTAGAAGGTCAGGGCATAGTTGGCAGCCTCCCTGAGGTGCT 

GCAGGCACCCGTGGGAAGCTCCATTCTGGTGCAGTGCCACTACAGGCTCCAGGATGTCAAAGC 

TCAGAAGGTGTGGTGCCGGTTCTTGCCGGAGGGGTGCCAGCCCCTGGTGTCCTCAGCTGTGGA 

TCGCAGAGCTCCAGCGGGCAGGCGTACGTTTCTCACAGACCTGGGTGGGGGCCTGCTGCAGGT 

GGAAATGGTTACCCTGCAGGAAGAGGATGCTGGCGAGTATGGCTGCATGGTGGATGGGGCCAG 

GGGGCCCCAGATTTTGCACAGAGTCTCTCTGAACATACTGCCCCCAGAGGAAGAAGAAGAGAC 

CCATAAGATTGGCAGTCTGGCTGAGAACGCATTCTCAGACCCTGCAGGCAGTGCCAACCCTTT 

GGAACCCAGCCAGGATGAGAAGAGCATCCCCTTGATCTGGGGTGCTGTGCTCCTGGTAGGTCT 

GCTGGTGGCAGCGGTGGTGCTGTTTGCTGTGATGGCCAAGAGGAAACAAGAATCCCTCCTCAG 

TGGTCCACCACGTCAGS^CTCTGGACCGGCTGCTGAATTGCCTTTGGATGTACCACACATTA 

GGCTTGACTCACCACCTTCATTTGACAATACCACCTACACCAGCCTACCTCTTGATTCCCCAT 

CAGGAAAACCTTCACTCCCAGCTCCATCCTCATTGCCCCCTCTACCTCCTAAGGTCCTGGTCT 

GCTCCAAGCCTGTGACATATGCCACAGTAATCTTCCCGGGAGGGAACAAGGGfGGAGGGACCT 

CGTGTGGGCCAGCCCAGAATCCACCTAACAATCAGACTCCATCCAGCTAAGCTGCTCATCACA 

CTTTAAACTCATGAGGACCATCCCTAGGGGTTCTGTGCATCCATCCAGCCAGCTCATGCCCTA 

GGATCCTTAGGATATCTGAGCAACCAGGGACTTTAAGATCTAATCCAATGTCCTAACTTTACT 

AGGGAAAGTGACGCTCAGACATGACTGAGATGTCTTGGGGAAGACCTCCCTGCACCCAACTCC 

CCCACTGGTTCTTCTACCATTACACACTGGGCTAAATAAACCCTAATAATGATGTGCAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 92 



MGLTLLLLLLLGLEGQGIVGSLPEVLQAPVGSSILVQCHYRLQDVKAQKVWCRFLPEGCQPLV 

SSAVDREUVPAGRRTFLTDLGGGLLQVEMVTLQEEDAGEYGCMVDGARGPQILHRVSLNILPPE 

EEEETHKIGSLAENAFSDPAGSANPLEPSQDEKSIPLIWGAVLLVGLLVAAVVLFAVMAKRKQ 
ESLLSGPPRQ 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 161-181 

N-myristoylation sites. 

amino acids 17-23, 172-178 

Amidation site. 

amino acids 73-79 
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FIGURE 93 

GGCGGCGTTGCCGGGCTCTCCGGAAGGAGACGTGGCGGCGGTTGGGCCGGTGATACCCGGGCG 

CTTTATAGTCCCGCCGCCTCCTCCTCCACCTCCTCCTCCTCCTCCTCTCCTCCTGGGGCAGAG 

GAGGTTGTGGCGGTGGCTGGAGAAAGCGGCGGCGGAGGATGGAGGAAGGAGGCGGCGGCGTAC 

GGAGTCTGGTCCCGGGCGGGCCGGTGTTACTGGTCCTCTGCGGCCTCCTGGAGGCGTCCGGCG 

GCGGCCGAGCCCTTCCTCAACTCAGCGATGACATCCCTTTCCGAGTCAACTGGCCCGGCACCG 

AGTTCTCTCTGCCCACAACTGGAGTTTTATATAAAGAAGATAATTATGTCATCATGACAACTG 

CACATAAAGAAAAATATAAATGCATACTTCCCCTTGTGACAAGTGGGGATGAGGAAGAAGAAA 

AGGATTATAAAGGCCCTAATCCAAGAGAGCTTTTGGAGCCACTATTTAAACAAAGCAGTTGTT 

CCTACAGAATTGAGTCTTATTGGACTTACGAAGTATGTCATGGAAAACACATTCGGCAGTACC 

ATGAAGAGAAAGAAACTGGTCAGAAAATAAATATTCACGAGTACTACCTTGGGAATATGTTGG 

CCAAGAACCTTCTATTTGAAAAAGAACGAGAAGCAGAAGAAAAGGAAAAATCAAATGAGATTC 

CCACTAAAAATATCGAAGGTCAGATGACACCATACTATCCTGTGGGAATGGGAAATGGTACAC 

CTTGTAGTTTGAAACAGAACCGGCCCAGATCAAGTACTGTGATGTACATATGTCATCCTGAAT 

CTAAGCATGAAATTCTTTCAGTAGCTGAAGTTACAACTTGTGAATATGAAGTTGTCATTTTGA 

CACCACTCTTGTGCAGTCATCCTAAATATAGGTTCAGAGCATCTCCTGTGAATGACATATTTT 

GTCAATCACTGCCAGGATCTCCATTTAAGCCCCTCACCCTGAGGCAGCTGGAGCAGCAGGAAG 

AAATACTAAGGGTGCCTTTTAGGAGAAATAAAGAGGGTGTCGGTTGGTGGAAATATGAATTCT 

GCTATGGCAAACATGTACATCAATACCATGAGGACAAGGATAGTGGGAAAACCTCTGTGGTTG 

TCGGGACATGGAACCAAGAAGAGCATATTGAATGGGCTAAGAAGAATACTGCTAGAGCTTATC 

ATCTTCAAGACGATGGTACCCAGACAGTCAGGATGGTGTCACATTTTTATGGAAATGGAGATA 

TTTGTGATATAACTGACAAACCAAGACAGGTGACTGTAAAACTAAAGTGCAAAGAATCAGATT 

CACCTCATGCTGTTACTGTATATATGCTAGAGCCTCACTCCTGTCAATATATTCTTGGGGTTG 

AATCTCCAGTGATCTGTAAAATCTTAGATACAGCAGATGAAAATGGACTTCTTTCTCTCCCCA 

ACTAAAGGATATTAAAGTTAGGGGAAAGAAAAGATCATTGAAAGTCATGATAATTTCTGTCCC 

ACTGTGTCTCATTATAGAGTTCTCAGCCATTGGACCTCTTCTAAAGGATGGTATAAAATGACT 

CTCAACCACTTTGTGAATACATATGTGTATATAAGAGGTTATTGATAAACTTCTGAGGCAGAC 

ATTTGTCTCGCTTTTTTTCATTTTTGTTGTGTCTTATAAACTGACTGTTTTTCTTTGCTTGGA 

TACTGTGATTCCAAAATAAATCTCATCCAAGCAAGTTAGAGTCCAGCCTAATCAAATGTCATA 

ATTGTTGTACCTATTGAAAGTTTTTAAATAATAGATTTATTATGTAAATTATAGTATATGTAA 

GTAGCTAATGAAGTAAAGATCATGAAGAAAGAAATTGATAGGTGTAAATGAGAGACCATGTAA 

AATATGTAAATTCTAGTACCTGAAATCCTTTCAACAGATTTTTATATAGCAACTGCTCTCTGC 

AAGTAGTTAAACTAGAAACTGGGCACATGGTAGAGGCTCACATGGGAGTTGTCCTCACCCTTG 

TTAATCTCAAGAAACTCTTATTTATAATAGGTTGCTTCTCTCTCAGAACTTTTATCTATTACT 

TTTTTCTTCTTATGAGTATGTTTACTCTCAGAGTATCTATCTGATGTAGACAGTTGGTGATGC 

TTCTGAGACTCAGAATGGTTTACTCTAACAAAACACTGTGCTGTCTATCCCTTGTACTTGCCT 

ACTGTAATATGGATTTCACTTCTGAACAGTTTACAGCACAATATTTATTTTAAAGTGAATAAA 
ATGTCCACAAGCAAAAA 
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FIGURE 94 

MEEGGGGVRSLVPGGPVLLVLCGLLEASGGGRALPQLSDDIPFRVNWPGTEFSLPTTGVLYKE 
DNYVIMTTAHKEKYKCILPLVTSGDEEEEKDYKGPNPRELLEPLFKQSSCSYRIESYWTYEVC 
HGKHIRQYHEEKETGQKINIHEYYLGNMLAKNLLFEKEREAEEKEKSNEIPTKNIEGQMTPYY 
PVGMGNGTPCSLKQNRPRSSTVMYICHPESKHEILSVAEVTTCEYEWILTPLLCSHPKYRFR 
ASPVNDIFCQSLPGSPFKPLTLRQLEQQEEILRVPFRRNKEGVGWWKYEFCYGKHVHQYHEDK 
DSGKTSVWGTWNQEEHIEWAKKNTARAYHLQDDGTQTVRMVSHFYGNGDICDITDKPRQVTV 
KLKCKjESDSPHAVTVYMLEPHSCQYILGVESPVICKILDTADENGLLSLPN 

Important features of the protein: 
Signal peptide: 

amino acids 1-30 

Glycosaminoglycan attachment site. 

amino acids 28-32 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 337-341 

N-BQ^ristoylation sites. 

amino acids 6-12, 23-29, 29-35, 49-55, 141-147, 152-158, 192-198, 
196-202 

Gram-positxve cocci surface proteins 'anchoring' hexapeptide. 

amino acids 54-60 
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FIGURE 95 



TTCCGTTTCTGGGAGGAGTGAGGGGCAACGGGTCGGAGAAAAAGGAAAAAAGAAGGGCTCAGC 

GCCTCCCCGCCGGGCCGTGGACAGAGGGGCACAGTTTCGGCAGGCGGGTGAGGTCGCTGAGGG 

CCCGCCGGAGATGTTTTCCTTGTCGAGCACGGTGCAACCCCAGGTTACAGTTCCTCTGAGTCA 

TCTCATCAATGCCTTCCATACACCAAAAAACACTTCTGTTTCTCTCAGTGGAGTGTCAGTTTC 

TCAAAACCAGCATCGAGATGTAGTTCCTGAGCATGAGGCTCCCAGCAGTGAGCCTTCACTTAA 

CTTAAGGGACCTTGGATTATCTGAACTAAAAATTGGACAGATTGATCAGCTGGTAGAAAATCT 

ACTTCCTGGATTTTGTAAAGGCAAAAACATTTCTTCCCATTGGCATACATCCCATGTCTCTGC 

ACAATCCTTCTTTGAAAATAAATATGGTAACTTAGATATATTTAGTACATTACGTTCCTCTTG 

CTTGTATCGACATCATTCAAGAGCTCTTCAAAGCATTTGTTCAGATCTTCAGTACTGGCCAGT 

TTTCATACAGTCTCGGGGTTTTAAAACTTTGAAATCAAGGACACGACGTCTCCAGTCTACCTC 

CGAGAGATTAGCTGAAACACAGAATATAGCGCCATCATTCGTGAAGGGGTTTCTTTTGCGGGA 

CAGAGGATCAGATGTTGAGAGTTTGGACAAACTCATGAAAACCAAAAATATACCTGAAGCTCA 

CCAAGATGCATTTAAAACTGGTTTTGCGGAAGGTTTTCTGAAAGCTCAAGCACTCACACAAAA 

AACCAATGATTCCCTAAGGCGAACCCGTCTGATTCTCTTCGTTCTGCTGCTATTCGGCATTTA 

TGGACTTCTAAAAAACCCATTTTTATCTGTCCGCTTCCGGACAACAACAGGGCTTGATTCTGC 

AGTAGATCCTGTCCAGATGAAAAATGTCACCTTTGAACATGTTAAAGGGGTGGAGGAAGCTAA 

ACAAGAATTACAGGAAGTTGTTGAATTCTTGAAAAATCCACAAAAATTTACTATTCTTGGAGG 

TAAACTTCCAAAAGGAATTCTTTTAGTTGGACCCCCAGGGACTGGAAAGACACTTCTTGCCCG 

AGCTGTGGCGGGAGAAGCTGATGTTCCTTTTTATTATGCTTCTGGATCCGAATTTGATGAGAT 

GTTTGTGGGTGTGGGAGCCAGCCGTATCAGAAATCTTTTTAGGGAAGCAAAGGCGAATGCTCC 

TTGTGTTATATTTATTGATGAATTAGATTCTGTTGGTGGGAAGAGAATTGAATCTCCAATGCA 

TCCATATTCAAGGCAGACCATAAATCAACTTCTTGCTGAAATGGATGGTTTTAAACCCAATGA 

AGGAGTTATCATAATAGGAGCCACAAACTTCCCAGAGGCATTAGATAATGCCTTAATACGTCC 

TGGTCGTTTTGACATGCAAGTTACAGTTCCAAGGCCAGATGTAAAAGGTCGAACAGAAATTTT 

GAAATGGTATCTCAATAAAATAAAGTTTGATCAATCCGTTGATCCAGAAATTATAGCTCGAGG 

TACTGTTGGCTTTTCCGGAGCAGAGTTGGAGAATCTTGTGAACCAGGCTGCATTAAAAGCAGC 

TGTTGATGGAAAAGAAATGGTTACCATGAAGGAGCTGGAGTTTTCCAAAGACAAAATTCTAAT 

GGGGCCTGAAAGAAGAAGTGTGGAAATTGATAACAAAAACAAAACCATCACAGCATATCATGA 

ATCTGGTCATGCCATTATTGCATATTACACAAAAGATGCAATGCCTATCAACAAAGCTACAAT 

CATGCCACGGGGGCCAACACTTGGACATGTGTCCCTGTTACCTGAGAATGACAGATGGAATGA 

AACTAGAGCCCAGCTGCTTGCACAAATGGATGTTAGTATGGGAGGAAGAGTGGCAGAGGAGCT 

TATATTTGGAACCGACCATATTACAACAGGTGCTTCCAGTGATTTTGATAATGCCACTAAAAT 

AGCAAAGCGGATGGTTACCAAATTTGG/\ATGAGTGAAAAGCTTGGAGTTATGACCTACAGTGA 

TACAGGGAAACTAAGTCCAGAAACCCAATCTGCCATCGAACAAGAAATAAGAATCCTTCTAAG 

GGACTCATATGAACGAGCAAAACATATCTTGAAAACTCATGCAAAGGAGCATAAGAATCTCGC 

AGAAGCTTTATTGACCTATGAGACTTTGGATGCCAAAGAGATTCAAATTGTTCTTGAGGGGAA 

AAAGTTGGAAGTGAGATGATAACTCTCTTGATATGGATGCTTGCTGGTTTTATTGCAAGAATA 

TAAGTAGCATTGCAGTAGTCTACTTTTACAACGCTTTCCCCTCATTCTTGATGTGGTGTAATT 

GAAGGGTGTGAAATGCTTTGTCAATCATTTGTCACATTTATCCAGTTTGGGTTATTCTCATTA 

TGACACCTATTGCAAATTAGCATCCCATGGCAAATATATTTTGAAAAAATAAAGAACTATCAG 
GATTGAAAACAAAAAAAAAAAA 
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FIGURE 96 

MFSLSSTVQPQVTVPLSHLINAFHTPKNTSVSLSGVSVSQNQHRDVVPEHEAPSSEPSLNLRD 

LGLSELKIGQIDQLVENLLPGFCKGKNISSHWHTSHVSAQSFFENKYGNLDIFSTLRSSCLYR 

HHSRALQSICSDLQYWPVFIQSRGFKTLKSRTRRLQSTSERLAETQNIAPSFVKGFLLRDRGS 

DVESLDKLMKTKNIPEAHQDAFKTGFAEGFLKAQALTQKTNDSLRRTRLILFVLLLFGIYGLL 

KNPFLSVRFRTTTGLDSAVDPVQMKNVTFEHVKGVEEAKQELQEVVEFLKNPQKFTILGGKLP 

KGILLVGPPGTGKTLLARAVAGEADVPFYYASGSEFDEMFVGVGASRIRNLFREAKANAPCVI 

FIDELDSVGGKRIESPMHPYSRQTINQLLAEMDGFKPNEGVIIIGATNFPEALDNALIRPGRF 

DMQVTVPRPDVKGRTEILKWYLNKIKFDQSVDPEIIARGTVGFSGAELENLVNQAALKAAVDG 

KEMVTMKELEFSKDKILMGPERRSVEIDNKNKTITAYHESGHAIIAYYTKDAMPINKATIMPR 

GPTLGHVSLLPENDRWNETRAQLLAQMDVSMGGRVAEELIFGTDHITTGASSDFDNATKIAKR 

MVTKFGMSEKLGVMTYSDTGKLSPETQSAIEQEIRILLRDSYERAKHILKTHAKEHKNLAEAL 
LTYETLDAKEIQIVLEGKKLEVR 

In^r tant features . o£ the protein : 
Transmembrane domain: 

amino acids 238-259 

N-glycosylation sites. 

amino acids 28-32, 90-94, 230-234, 278-282, 535-539, 584-588, 
623-627 

N-myristoylation sites. 

amino acids 35-41, 266-272, 286-292, 325-331, 357-363, 599-605 

Amidation site. 

amino acids 387-393, 709-713 

ATP/6TP-binding site motif A (P-loop) . 

amino acids 322-330 



AAA-protein family proteins 

amino acids 315-336, 343-386, 405-451 
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FIGURE 97 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGG 

GACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAA 

ACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTT 

CCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACG 

ACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAG 

GCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGasSAAAGCCTCTAGT 

CTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAG 

ACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCT 

GAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGT^TCTTAAGGAGGACT 

GAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTC 

TATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGC 

AGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACA 

TGCCATTGTGGGGAGGAAGCAATGAAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTG 

GAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAG 

GAGACAGAAiaSGAGGAAAGTGATGCTGCTGeTAAGAATATTCGAGGTCAAGAGCTCCAGTCT 

TCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACTGTACTAGTCTTGTGCT 

GGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGATTGTCTTTATGCATCCCC 

AATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTTTCTGCTATTGGATATATT 

TATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTTATTTTTTTACTTGGACATG 

AAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAGAGCAGGTGATGTATTTTTAT 

ACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCTAGGGGGGTTATTCATTTGTAT 

TCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGATATTTGAAATTGAACCAATGAC 

TACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCACATCTACCTTACAATTACTG 

ACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAGCCAGGAATCCTACACGG 

CCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 98 



MKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNGFSEIRGSVQAKDGNIDIR 
ILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKISSLANSFLTIKKDLRLC 
HAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEETE 

Signal sequence: 

amino acids 1^24 

cAMP- and cGMP -dependent protein kinase phosphorylation sites . 

amino acids 107-110, 140-143 

N-myristoylation site. 

amino acids 51-56 

Interleukin 10: 

amino acids 9-176 . 
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FIGURE 

GCGCCGGCTCCGCGCCTCGCGCCCAGTCCGCGGGCCGCGCCGCCGCTCCCGCCGCTCCCGCCG 

CTCCCGCAGCCGCCCCGCCGCCCGCCCGGAGCCCCGCGTCCCTAGGCCTGGCTCCCGCCTGCC 

CGAGACCCGCCCAGCCTGCCCCGCTCAGCCGCCAGAGAAGMSCGGCTGCTCCCGGAATGGTT 

CCTCTTGCTCTTTGGCCCGTGGCTCCTTAGGAAGGCCGTCAGTGCCCAGATACCAGAGTCCGG 

AAGGCCGCAGTACCTGGGGCTGCGCCCCGCCGCGGCCGGAGCGGGTGCCCCCGGCCAGCAGCT 

CCCAGAGCCAAGGTCTTCGGACGGCCTAGGCGTGGGCCGCGCCTGGAGCTGGGCCTGGCCGAC 

CAACCACACGGGGGCGCTGGCCCGGGCAGGGGCAGCCGGGGCGTTGCCCGCGCAGCGCACCAA 

GAGGAAGCCGTCCATCAAGGCGGCGCGCGCCAAAAAGATCTTCGGCTGGGGGGACTTCTACTT 

TCGGGTGCATACCCTCAAGTTTTCGCTGCTGGTGACCGGCAAGATCGTGGACCATGTGAACGG 

TACCTTCAGTGTGTATTTCCGCCACAACTCGTCCAGCCTGGGCAACCTCAGTGTCAGCATCGT 

GCCGCCCTCCAAGCGTGTCGAGTTCGGAGGAGTCTGGCTGCCCGGGCCTGTCCCCCACCCTCT 

GCAGTCTACGCTCGCCCTGGAGGGGGTGCTTCCTGGGCTGGGGCCCCCGCTGGGGATGGCAGC 

AGCAGCGGCGGGGCCGGGGCTTGGGGGCTCCCTCGGGGGCGCACTGGCGGGGCCGCTTGGGGG 

CGCGTTGGGAGTGCCTGGGGCCAAAGAGTCACGCGCTTTCAATTGCCACGTGGAGTATGAGAA 

GACAAACCGCGCGCGCAAGCACCGACCGTGCCTGTACGACCCGTCGCAGGTGTGTTTCACCGA 

GCACACGCAGAGCCAGGCCGCCTGGCTCTGTGCCAAGCCCTTCAAAGTCATCTGTATCTTCGT 

CTCTTTCCTCAGCTTTGACTACAAACTGGTGCAGAAGGTGTGCCCAGACTATAACTTCCAGAG 

TGAGCACCCCTACTTCGGATA6CGCCCCTCCCCAGCCAGTCCTGAGCCTCCCGCCAAATCCCA 

GCCTCACTAGGTGGGACCCCCTTCCCAGTGTTCTGCCGCTCCTGTGGCCATGTCGCCCACTCC 

TTCCACTCTGGGGGCGGAGGGGAATGGCTTCTCGGGACCCTCAGCTAGCGTGGGTGCCCTTTT 

CCTTATGCGGAGTGCCCGCAAGGCTGGGGTAGCCCCCTCCAGTACACCCCAAAGTGAAAGGGA 

TAAGAGTGCAGCCCCAGAATAGGCGGGGCTTGGAGGCGGTCCCAATGTCCCCTGGGTCCACAG 

TGGGTCCCCTTTTCACCCTTGGCGCTAGGCTGCGCACTCCCTTTCCCCGCAGCTTTAATAACT 

CCTGGCCTGGCACCCTCACCCCACCCTGACTTTCCCATCCCCCAGCGCTTGTCCTGCTTCACC 

ATACCCCGCCTAAGACTGTAAAGGCCTAAAAACCTCGGCCTGTCCTCCCACCATTCTGCCTGC 

CATATGCCTGTCCCCTTTTCCTCCAAACCCTATTAGGGTACCGGAAGCAGAACCCCTGGGCTG 

AGGCCCTGGCCCTGCCCCCGGCCCCTGCCCCTGCCCGCCCCCCTCCAGTCCAGGCAGTCGAGC 

TCCACCTGCCCTCTCCTGCTGCTTCCTCTCGGTGATATTTTTTCTACGCCAAAACAGACGGGA 

AAGGGAACAAAATAAAGTGAAATCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAA 



wo 01/40466 



PCT/USOO/32678 



100/550 



FIGURE 10 Q 



MRLLPEWFLLLFGPWLLRKAVSAQIPESGRPQYLGLRPAAAGAGAPGQQLPEPRSSDGLGVGR 
AWSWAWPTNHTGALARAGAAGALPAQRTKRKPS lECAARAKKI FGWGDFYFRVHTLKFSLLVTG 
KIVDHVNGTFSVYFRHNSSSLGNLSVSIVPPSKRVEFGGVWLPGPVPHPLQSTLALEGVLPGL 
GPPLGMAAAAAGPGLGGSLGGALAGPLGGALGVPGAKESRAFNCHVEYEKTNRARKHRPCLYD 
PSQVCFTEHTQSQAAWLCAKPFKVICIFVSFLSFDYKLVQKVCPDYNFQSEHPYFG 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 273-288 

N-glycosylation sites. 

amino acids 72-76, 133-137, 143-147, 149-153 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 93-97 

N-myristoylation sites. 

amino acids 35-41, 58-64, 60-66, 81-87, 84-90, 184-190, 194-200, 
203-209, 205-211, 206-212, 209-215, 217-223, 221-227, 224-230 

Cytochrome b/b6 Qo site signature. 

amino acids 5-11 
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FIGURE IQl 

AATGCCCCATCCGCACCCCACAGCTCGCGCTCCTGCAAGTGTTCTTTCTGGTGTTCCCCGATG 

GCGTCCGGCCTCAGCCCTCTTCCTCCCCATCAGGGGCAGTGCCCACGTCTTTGGAGCTGCAGC 

GAGGGACGGATGGCGGAACCCTCCAGTCCCCTTCAGAGGCGACTGCAACTCGCCCGGCCGTGC 

CTGGACTCCCTACAGTGGTCCCTACTCTCGTGACTCCCTCGGCCCCTGGGAATAGGACTGTGG 

ACCTCTTCCCAGTCTTACCGATCTGTGTCTGTGACTTGACTCCTGGAGCCTGCGATATAAATT 

GCTGCTGCGACAGGGACTGCTATCTTCTCCATCCGAGGACAGTTTTCTCCTTCTGCCTTCCAG 

GCAGCGTAAGGTCTTCAAGCTGGGTTTGTGTAGACAACTCTGTTATCTTCAGGAGTAATTCCC 

CGTTTCCTTCAAGAGTTTTCATGGATTCTAATGGAATCAGGCAGTTTTGTGTCCATGTGAACA 

ACTCAAACTTAAACTATTTCCAGAAGCTTCAAAAGGTCAATGCAACCAACTTCCAGGCCCTGG 

CTGCAGAGTTTGGAGGCGAATCATTCACTTCAACATTCCAAACTCAATCACCACCATCTTTTT 

ACAGGGCTGGGGACCCCATTCTTACTTACTTCCCCAAGTGGTCTGTAATAAGCTTGCTGAGAC 

AACCTGCAGGAGTTGGAGCTGGGGGACTCTGTGCTGAAAGCAATCCTGCAGGTTTCCTAGAGA 

GTAAAAGTACAACTTGCACTCGTTTTTTCAAGAACCTGGCTAGTAGCTGTACCTTGGATTCAG 

CCCTCAATGCTGCCTCTTACTATAACTTCACAGTCTTAAAGGTTCCAAGAAGCATGACTGATC 

CACAGAATATGGAGTTCCAGGTTCCTGTAATACTTACCTCACAGGCTAATGCTCCTCTGTTGG 

CTGGAAACACTTGTCAGAATGTAGTTTCTCAGGTCACCTATGAGATAGAGACCAATGGGACTT 

TTGGAATCCAGAAAGTTTCTGTCAGTTTGGGACAAACCAACCTGACTGTTGAGCCAGGCGCTT 

CCTTACAGCAACACTTCATCCTTCGCTTCAGGGCTTTTCAACAGAGCACAGCTGCTTCTCTCA 

CCAGTCCTAGAAGTGGGAATCCTGGCTATATAGTTGGGAAGCCACTCTTGGCTCTGACTGATG 

ATATAAGTTACTCAATGACCCTCTTACAGAGCCAGGGTAATGGAAGTTGCTCTGTTAAAAGAC 

ATGAAGTGCAGTTTGGAGTGAATGCAATATCTGGATGCAAGCTCAGGTTGAAGAAGGCAGACT 

GCAGCCACTTGCAGCAGGAGATTTATCAGACTCTTCATGGAAGGCCCAGACCAGAGTATGTTG 

CCATCTTTGGTAATGCTGACCCAGCCCAGAAAGGAGGGTGGACCAGGATCCTCAACAGGCACT 

GCAGCATTTCAGCTATAAACTGTACTTCCTGCTGTCTCATACCAGTTTCCCTGGAGATCCAGG 

TATTGTGGGCATATGTAGGTCTCCTGTCCAACCCGCAAGCTCATGTATCAGGAGTTCGATTCC 

TATACCAGTGCCAGTCTATACAGGATTCTCAGCAAGTTACAGAAGTATCTTTGACAACTCTTG 

TGAACTTTGTGGACATTACCCAGAAGCCACAGCCTCCAAGGGGCCAACCCAAAATGGACTGGA 

AATGGCCATTCGACTTCTTTCCCTTCAAAGTGGCATTCAGCAGAGGAGTATTCTCTCAAAAAT 

GCTCAGTCTCTCCCATCCTTATCCTGTGCCTCTTACTACTTGGAGTTCTCAACCTAGAGACTA 

TGSSaAGAAAAGAAAATAATCAGATTTCAGTTTTCCCTATGAGAAACTCTGAGGCAGCCACTT 

ATCTTGGCTAAATAGAACCTCACCTGCTCATGACCAGAGAGCATTTAGGATAATAGATGACCT 

AACTGAAGGAATCCTTGTATATGAAAGGAGTTATTTTAGAAAAGCAATAAAAATATTTTATTC 
ATCNTAAAAAAAAAA 
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FIGURE 102 

MRTPQLALLQVFFLVFPDGVRPQPSSSPSGAVPTSLELQRGTDGGTLQSPSEATATRPAVPGL 
PTWPTLVTPSAPGNRTVDLFPVLPICVCDLTPGACDINCCCDRDCYLLHPRTVFSFCLPGSV 
RSSSWVCVDNSVIFRSNSPFPSRVFMDSNGIRQFCVHVNNSNLNYFQKLQKVNATNFQALAAE 
FGGESFTSTFQTQSPPSFYRAGDPILTYFPKWSVISLLRQPAGVGAGGLCAESNPAGFLESKS 
TTCTRFFKNLASSCTLDSALNAASYYNFTVLKVPRSMTDPQNMEFQVPVILTSQANAPLLAGN 
TCQNVVSQVTYEIETNGTFGIQKVSVSLGQTNLTVEPGASLQQHFILRFRAFQQSTAASLTSP 
RSGNPGYIVGKPLLALTDDISYSMTLLQSQGNGSCSVKRHEVQFGVNAISGCKLRLKKADCSH 
LQQEIYQTLHGRPRPEYVAIFGNADPAQKGGWTRILNRHCSISAINCTSCCLIPVSLEIQVLW 
AYVGLLSNPQAHVSGVRFLYQCQSIQDSQQVTEVSLTTLVNFVDITQKPQPPRGQPKMDWKWP 
FDFFPFKVAFSRGVFSQKCSVSPILILCLLLLGVLNLETM 

Important features of the protein: 
Signal peptide : 

amino acids 1-22 

Transmembrane domains: 

amino acids 484-505, 581-600 

N-glycosylation sites. 

amino acids 78--82, 165-169, 179-185, 279-285, 331-337, 347-351, 
410-414, 487-491 

N-myristoylation sites. 

amino acids 30-36, 41-47, 124-130, 232.-238, 236-242, 409-415 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 420-431 
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FIGURE 103 

CCTAATTCTCAAGGTGATGCTATTTAGGAAGTCATAACTCATGTGAGTGGAGCCATGTGGGAT 

TAAGAAGTGATAGGAGAGCTTGCTGTCTGTCTCTGCTCTCCACTGTGTGAGGATACAACAGGA 

AGACAGCCATCTGGTGAGGAAGAGAGGGCCCTCGCCAGATACCGGACCTGCTGACACCTTGAT 

CTTGGACTTCCCATCTTCCAGGAAGGCCTGACCTCAGTTGTTCCAGGGTAAAGAATTTGGGCA 

GTGCCCACACCCACGCTGTTGGATAACATTTCTTCACCATACCAGTGAGGGTGAATGTGTACA 

CGCCCAGCTTCCTGCCTGTTACTCTCCACAGTMSCGAAGAATATCCCTGACTTCTAGCCCTG 

TGCGCCTTCTTTTGTTTCTGCTGTTGCTACTAATAGCCTTGGAGATCATGGTTGGTGGTCACT 

CTCTTTGCTTCAACTTCACTATAAAATCATfGTCCAGACCTGGACAGCCCTGGTGTGAAGCGC 

AGGTCTTCTTGAATAAAAATCTTTTCCTTCAGTACAACAGTGACAACAACATGGTCAAACCTC 

TGGGCCTCCTGGGGAAGAAGGTATATGCCACCAGCACTTGGGGAGAATTGACCCAAACGCTGG 

6AGAAGTGGGGCGAGACCTCAGGATGCTCCTTTGTGACATCAAACCCCAGATAAAGACCAGTG 

ATCCTTCCACTCTGCAAGTCGAGATGTTTTGTCAACGTGAAGCAGAACGGTGCACTGGTGCAT 

CCTGGCAGTTCGCCACCAATGGAGAGAAATCCCTCCTCTTTGACGCAATGAACATGACCTGGA 

CAGTAATTAATCATGAAGCCAGTAAGATCAAGGAGACATGGAAGAAAGACAGAGGGCTGGAAA 

AGTATTTCAGGAAGCTCTCAAAGGGAGACTGCGATCACTGGCTCAGGGAATTCTTAGGGCACT 

GGGAGGCAATGCCAGAACCGACAGGCAGAAGATCCACCT^AGGTGATACCACGGCGGCGCAG 

AGTTGTTCACCTGTGGTCCTCGATCGCTGACAGCCTTGGCTCCCACTGCTGTGTGTTCCCTGA 

GTCAAGTGGAGGCGGAGCCTGCAATGAGCGGAGATCGCGCCTCTGCATTCCAGTCTTGGCAAC 

AGAGCAAGACTCCGTCTCAAAAAAAAAAAATTTTTTTTCAGTACATATTTTTTAAAAGATAGG 

GCTGGGCACAGCAGCTCACATCTATAATCCCAACACTTTGGGAGGCCTAGGCAGGAGGATCAC 

TTGAGCCCAGGAATCTGAAGCTGCAGTGAGCCTTTGCTCGTGAGATTGTGGACCTATGATCCT 

ACCACCAGCCCACCTGGTTCTAACACCCCCTCCTCTATGTGTGAGAGGGAGAGAAGAAAAGTG 

AGGGAGAAAAGAGAGATAAGCAAAGAACAGAGAGGAAAAATGGAAAATAAGAGGAAATTGGGG 

GAATTAAACAGAGGGGAGGGCATGGATCCCCGGGAGTTAGAAGAGTAGCAGCTTGTGGATTAC 

TACGCAGTGGAGGAAGAAGAGTTGTTGGAAATTATTTGAGAGGTAGTATAATCATTTGTGAGG 

CAGTTTTCTGCATTCACCATTTCTCACAGACTAAGTTACTCATAAGCAAACGTGCAATTCACA 

TTACACTGAAATTCTTCCCTAATACATCATTTGCATTGGAATAAAGTACGGTTTTCAAACAAC 

CTGATATAGCAGAACTGACTGTATAAATTATGTGAGCACAGTGCAAGTAATTCTTTGTTTGTT 

TGTTTGTTTTTTTGAGACAGAGTCTCACTCTATCTCCCAGGCTGGAGTGTAGTGGTGCGATCC 

CGGCTCACTGCAACCTCGATCTCCCAGGCTCAAGCGATTCCCCTGCCTCAGCCTCCTGAGTAG 

CTGGGATTACAGGCATGAGCCACCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGT 

TTCACCCTGTTGGCCAGGCTGGTCTCGAACTACGGACCTCAGGTGATCTGCCCCCCTCAGCCT 

CTCAAAGTGCTGGGATTATAGCATGAGCCACTGAGCCCAGACACAAGTAGTTCTTTCTGATAA 

ACACTTTAACACTGAATGCA 
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FIGURE 104 



MRRISLTSSPVRLLLFLLLLLIALEIMVGGHSLCFNFTIKSLSRPGQPWCEAQVFLNKNLFLQ 
YNSDNNMVKPLGLLGKKVYATSTWGELTQTLGEVGRDLRMLLCDIKPQIKTSDPSTLQVEMFC 
QREAERCTGASWQFATNGEKSLLFDAMNMTWTVINHEASKIKETWKKDRGLEKYFRKLSKGDC 
DHWLREFLGHWEAMPEPTGRRST 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 11-30 (possible type II protein) 

N-glycosylation site. 

amino acids 36-39, 154-157 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 2-5, 182-185, 209-212 

Casein kinase II phosphorylation site. 

amino acids 86-89, 93-96, 142-145, 185-188 

N-myristoylation site . 

amino acids 46-51 

Amidation site. 

amino' acids 77-80, 207-210 
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FIGURE 105 

TTTTCCGAGTGACCTTCTTGAT6CTGGCTGTTTCTCTCACCGTTCCCCTGCTTGGAGCCATGA 

TGCTGCTGGAATCTCCTATAGATCCACAGCCTCTCAGCTTCAAAGAACCCCCGCTCTTGCTTG 

GTGTTCTGCATCCAAATACGAAGCTGCGACAGGCAGAAAGGCTGTTTGAAAATCAACTTGTTG 

GACCGGAGTCCATAGCACATATTGGGGATGTGATGTTTACTGGGACAGCAGATGGCCGGGTCG 

TAAAACTTGAAAATGGTGAAATAGAGACCATTGCCCGGTTTGGTTCGGGCCCTTGCAAAACCC 

GAGATGATGAGCCTGTGTGTGGGAGACCCCTGGGTATCCGTGCAGGGCCCAATGGGACTCTCT 

TTGTGGCCGATGCATACAAGGGACTATTTGAAGTAAATCCCTGGAAACGTGAAGTGAAACTGC 

TGCTGTCCTCCGAGACACCCATTGAGGGGAAGAACATGTCCTTTGTGAATGATCTTACAGTCA 

CTCAGGATGGGAGGAAGATTTATTTCACCGATTCTAGCAGCAAATGGCAAAGACGAGACTACC 

TGCTTCTGGTGATGGAGGGCACAGATGACGGGCGCCTGCTGGAGTATGATACTGTGACCAGGG 

AAGTAAAAGTTTTATTGGACCAGCTGCGGTTCCCGAATGGAGTCCAGCTGTCTCCTGCAGAAG 

ACTTTGTCCTGGTGGCAGAAACAACCATGGCCAGGATACGAAGAGTCTACGTTTCTGGCCTGA 

TGAAGGGCGGGGCTGATCTGTTTGTGGAGAACATGCCTGGATTTCCAGACAACATCCGGCCCA 

GCAGCTCTGGGGGGTACTGGGTGGGCATGTCGACCATCCGCCCTAACCCTGGGTTTTCCATGC 

TGGATTTCTTATCTGAGAGACCCTGGATTAAAAGGATGATTTTTAAGCTCTTTAGTCAAGAGA 

CGGTGATGAAGTTTGTGCCGCGGTACAGCCTCGTCCTAGAACTCAGCGACAGCGGTGCCTTCC 

GGAGAAGCCTGCATGATCCCGATGGGCTGGTGGCCACCTACATCAGCGAGGTGCACGAACACG 

ATGGGCACCTGTACCTGGGCTCTTTCAGGTCCCCCTTCCTCTGCAGACTCAGCCTCCAGGCTG 

TTTASCCCTCCCAGATAGCTGCCCCTGCCACGCAGGCCAGGAGTCTTCACACTCAGGCACCAG 

GCCTGGTCCAGGAGGAGCTGTGGACACAGTCGTGGTTCAAGTGTCCACATGCACCTGTTAGTC 

CCTGAGAGGTGGTGGGAATGGCTGCTTCATTCCTCGAGGATGCCCGGGCCCCACCTGGGCTTG 

TCTTTCTGTTTAGAGGGAAGTGTAACATATCTGCCATGAGGAACATAAATTCATGTAAAGCCA 

TTTTCTCTTAAACAAAACAAAACTTTCTAAGTACAATCATTCTCTAGGATTTGGGAAGCTCCT 

TGCACTTGGAACAGGGCTCAGGTGGGTGGAGCAGTAAGGCACTACCCAGAGAGCTTGCTGCTG 

CGGCCCTGTCCTGCGGCCTCAAAGTTCTTCTTTACTATATATAACGTGCGGTCATACCTTTCT 

TCGTTGTGGTGGGGATGGAAGAGCAGAGGGAGCATGGCCCAGGGGTGTTGAGGCCAGCGGTGA 

GAGCCGTGTTAGCCAAGACATGGAACTGTGTTCTCAAGGGTTATGTGGGGCGTGGGCTCTCCA 

TAGTGTGTATGAAAAGCTTGTTGACTCTAGCGGCTCAGAGAGGACTTTGCTGGGTTTCTTTCT 

GTGAATATCTCCGTGCTGACCATGCTGGAATTGGATGATTCTGCAATTCGGGACCTACTGCAG 

GGGTCCGTTTAGTAACGTCTTGTCTGTGATCTTTGTTCTTGACCTCTAGACCCCAAGATGTGA 

ACAGTGCACGTGTTAATGTCATCTTTGCTCATGTGTTATAAGCCCCAAGTTGCTGTATATTTT 
CACAAGTATGTCTACACACTGG 
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FIGURE 106 



MLAVSLTVPLLGAMMLLESPIDPQPLSFKEPPLLLGVLHPNTKLRQAERLFENQLVGPESIAH 
IGDVMFTGTADGRVVKLENGEIETIARFGSGPCKTRDDEPVCGRPLGIRAGPNGTLFVADAYK 
GLFEVNPWKREVKLLLSSETPIEGKNMSFVNDLTVTQDGRKIYFTDSSSKWQRRDYLLLVMEG 
TDDGRLLEYDTVTREVKVLLDQLRFPNGVQLSPAEDFVLVAETTMARIRRVYVSGLMKGGADL 
FVENMPGFPDNIRPSSSGGYWVGMSTIRPNPGFSMLDFLSERPWIKRMIFKLFSQETVMKFVP 
RYSLVLELSDSGAFRRSLHDPDGLVATYISEVHEHDGHLYLGSFRSPFLCRLSLQAV 

Importamt features of the protein: 
Signal peptide: 

amino acids 1-13 

Transmembrane domain: 

amino acids 1-'21 (possible type II) 

N-glycosylation sites. 

amino acids 116-119, 152-155 

Casein kinase II phosphorylation sites. 

amino acids 19-22, 27-30, 98-101, 146-149, 221-224, 286-289, 332- 
335 

N-nQfristoylation sites. 

amino acids 71-76, 92-97, 189-194, 244-249, 338-343 

Amidatxon site. 

amino acids 164-167 
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FIGURE 107 

AACGAAGCGTGCGCGCTTTGGTAACCGGCTAGAAATCCCGCACGCGCGCCTGCCTCCTCTCCC 
CAGGCCTGAGCTGCCCCTCCCACTGCCTTTCCTTCTTCCCGCGAGTCAGAAGCTTCGCGAGGG 
CCCAGAGAGGCGGTGGGGTGGGCGACCCTACGCCAGCTCCGGGCGGGAGAAAGCCGACCCTCT 
CCCGCGCCCCAGGAAACCGCCGGCGTTCGGCGCTGCGCAGAGCCMSGAATTCTCCTGGCTGG 
AGACGCGCTGGGCGCGGCCCTTTTACCTGGCGTTCGTGTTCTGCCTGGCCCTGGGGCTGCTGC 
AGGCCATTAAGCTGTACCTGCGGAGGCAGCGGCTGCTGCGGGACCTGCGCCCCTTCCCAGCGC 
CCCCCACCCACTGGTTCCTTGGGCACCAGAAGTTTATTCAGGATGATAACATGGAGAAGCTTG 
AGGAAATTATTGAAAAATACCCTCGTGCCTTCCCTTTCTGGATTGGGCCCTTTCAGGCATTTT 
TCTGTATCTATGACCCAGACTATGCAAAGACACTTCTGAGCAGAACAGATCCCAAGTCCCAGT 
ACCTGCAGAAATTCTCACCTCCACTTCTTGGAAAAGGACTAGCGGCTCTAGACGGACCCAAGT 
GGTTCCAGCATCGTCGCCTACTAACTCCTGGATTCCATTTTAACATCCTGAAAGCATACATTG 
AGGTGATGGCTCATTCTGTGAAAATGATGCTGGATAAGTGGGAGAAGATTTGCAGCACTCAGG 
ACACAAGCGTGGAGGTCTATGAGCACATCAACTCGATGTCTCTGGATATAATCATGAAATGCG 
CTTTCAGCAAGGAGACCAACTGCCAGACAAACAGCACCCATGATCCTTATGCAAAAGCCATAT 
TTGAACTCAGCAAAATCATATTTCACCGCTTGTACAGTTTGTTGTATCACAGTGACATAATTT 
TCAAACTCAGCCCTCAGGGCTACCGCTTCCAGAAGTTAAGCCGAGTGTTGAATCAGTACACAG 
ATACAATAATCCAGGAAAGAAAGAAATCCCTCCAGGCTGGGGTAAAGCAGGATAACACTCCGA 
AGAGGAAGTACCAGGATTTTCTGGATATTGTCCTTTCTGCCAAGGATGAAAGTGGTAGCAGCT 
TCTCAGATATTGATGTACACTCTGAAGTGAGCACATTCCTGTTGGCAGGACATGACACCTTGG 
CAGCAAGCATCTCCTGGATCCTTTACtGCCTGGCTCTGAACCCTGAGCATCAAGAGAGATGCC 

gggaggaggtcaggggcatcctgggggatgggtcttctatcacttgggaccagctgggtgaga 

TGTCGTACACCACAATGTGCATCAAGGAGACGTGCCGATTGATTCCTGCAGTCCCGTCCATTT 

ccagagatctcagcaagccacttaccttcccagatggatgcacattgcctgcagggatcaccg 

TGGTTCTTAGTATTTGGGGTCTTCACCACAACCCTGCTGTCTGGAAAAACCCAAAGGTCTTTG 

accccttgaggttctctcaggagaattctgatcagagacacccctatgcctacttaccattct 
cagctggatcaaggaactgcattgggcaggagtttgccatgattgagttaaaggtaaccattg 
ccttgattctgctccacttcagagtgactccagaccccaccaggcctcttactttccccaacc 

ATTTTATCCTCAAGCCCAAGAATGGGATGTATTTGCACCTGAAGAAACTCTCTGAATGTTAGA 

tctcagggtacaatgattaaacgtactttgtttttcgaagttaaatttacagctaatgatcca 
agcagatagaaagggatcaatgtatggtgggaggattggaggttggtgggataggggtctctg 

TGAAGAGATCCAAAATCATTTCTAGGTACACAGTGTGTCAGCTAGATCTGTTTCTATATAACT 

TTGGGAGATTTTCAGATCTTTTCTGTTAAACTTTCACTACTATTAATGCTGTATACACCAATA 

GACTTTCATATATTTTCTGTTGTTTTTAAAATAGTTTTCAGAATTATGCAAGTAATAAGTGCA 

TGTATGCTCACTGTCAAAAATTCCCAACACTAGAAAATCATGTAGAATAAAAATTTTAAATCT 

CACTTCACTTAGCCGACATTCCATGCCCTGACCAATCCTACTGCTTTTCCTAAAAACAGAATA 

ATTTGGTGTGCATTCTTTCAGACTTTTTCCTATACATTTTATATGTAGAAATGTAGCAATGTA 

TTTGTATAGATGTGATCATTCCTATATTGTTATTGATTTTTTTCACTTAATAAAAATTCACCT 
TATTCCTTAAAA 
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FIGURE 108 

MEFSWLETRWARPFYLAFVFCLALGLLQAIKLYLRRQRLLRDLRPFPAPPTHWFLGHQKFIQD 

DNMEKLEEIIEKYPRAFPFWIGPFQAFFCIYDPDYAKTLLSRTpPKSQYLQKFSPPLLGKGLA 

ALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKMMLDKWEKICSTQDTSVEVYEHINSMSL 

DI IMKCAFSKETNCQTNSTHDPYAKAI FELSKI I FHRLYSLLYHS DI I FKLS PQGYRFQKLSR 

VLNQYTDTIIQERKKSLQAGVKQDNTPKRKYQDFLDIVLSAKDESGSSFSDIDVHSEVSTFLL 

AGHDTLAASISWILYCLALNPEHQERCREEVRGILGDGSSITWDQLGEMSYTTMCIKETCRLI 

PAVPSISRDLSKPLTFPDGCTLPAGITWLSIWGLHHNPAVWKNPKVFDPLRFSQENSDQRHP . 

YAYLPFSAGSRNCIGQEFAMIELKVTIALILLHFRVTPDPTRPLTFPNHFILKPKNGMYLHLK 

KLSEC 

Important features of the protein: 
Signal peptide: 

amino acids 1-29 

Transmembrane domains: 

amino acids 310-330, 397-413, 459-473 

N-glycosylation site. 

amino acids 206-210 

csAMP- and cGMP-dependent protein kin:ase phosphorylation site. 

amino acids 265-269, 504-520 

N*myristoylation sites. 

amino acids 25-31, 298-304, 353-359, 450-456, 456-462 

Cytochroxde P450 cysteine heme-iron ligand signature: 

amino acids 447-457 

Cytochrome P450 cysteine heme-iron ligand proteins • 

amino acids 444-475 
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FIGURE IQQ 

GGCGTTCCGGGCCTCAACTTTGGCGTCGTGAGATTCTTGTGAGGCGTCTGCCTGGAAGCCGGC 

AGCAATTTTGCTTCTTTAAAGAGAAAAAGAAGGCTAGGGACTCAGATTCCTGGATTCTGAGAT 

CCAGACCAGCTCCTCCCAGACCTCTCCAGAAGAAGCCATGGGAACCCCTCGTATCCAGCATTT 

GCTGATCCTCCTGGTCCTAGGAGCCTCCCTCCTGACCTCGGGCCTAGAGCTGTATTGTCAAAA 

GGGTCTGTCCATGACTGTGGAAGCAGATCCAGCCAATATGTTTAACTGGACCACAGAGGAAGT 

GGAGACTTGTGACAAAGGGGCACTTTGCCAGGAAACCATACTAATAATTAAAGCAGGGACTGA 

GACAGCCATTTTGGCCACGAAGGGCTGCATCCCGGAAGGGGAGGAGGCCATAACAATTGTCCA 

GCACTCTTCACCTCCCGGCCTGATCGTGACCTCCTACAGTAACTACTGTGAGGATTCCTTCTG 

TAATGACAAAGACAGCCTGTCTCAGTTTTGGGAGTTCAGTGAGACCACAGCTTCCACTGTGTC 

AACTUiCCCTCCATTGTCCAACCTGTGTGGCTTTGGGGACCTGTTTCAGTGCTCCTTCTCTTCC 

CTGTCCCAATGGTACAACTCGATGCTATCAAGGAAAACTTGAGATCACTGGAGGTGGCATTGA 

GTCGTCTGTGGAGGTCAAAGGCTGTACAGCCATGATTGGCTGCAGGCTGATGTCTGGAATCTT 

AGCAGTAGGACCCATGTTTGTGAGGGAAGCGTGCCCACATCAGCTGCTCACTCAACCTCGAAA 

GACTGAAAATGGGGCCACCTGTCTTCCCATTCCTGTTTGGGGGTTACAGCTACTGCTGCCATT 

GCTGCTGCCATCATTTATTCACTTTTCCiaaGAAGGCACTTCTGGGCCTGGGTCTGAGGACAT 

CTTTTTTGACTGGGAGCCTTCTTACTGTTGAGGTTCAACAAGCTGAGGAGTAGATGGGAATTT 

GAGGGAGAATACAGAGATACTATGAACGTATTTGACATTTTTAATACAATTTCTGCTATAATT 

TTTGTATGCAGTAGGCGTTACTAATAAACATTTCTGCTGTGA 



# 
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FIGURE HQ 



MGTPRIQHLLILLVLGASLLTSGLELYCQKGLSMTVEADPANMFNWTTEEVETCDKGALCQET 
ILIIKAGTETAILATKGCIPEGEEAITIVQHSSPPGLIVTSYSNYCEDSFCNDKDSLSQFWEF 
SETTASTVSTTLHCPTCVALGTCFSAPSLPCPNGTTRCYQGKLEITGGGIESSVEVKGCTAMI 



Important features of the protein: 
Signal peptide: 

amino acids 1-23 . 

Transmeobrane domain: 

amino acids 184-201 

N-glycosylation sites . 

amino acids 45-49, 159-163 

N-myristoylation sites. 

amino acids 31-37, 70-76, 99-105, 147-153, 160-166, 174-180, 
175-181 



GCRLMSGILAVGPMFVREACPHQLLTQPRKTENGATCLPIPVWGLQLLLPLLLPSFIHFS 
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FIGURE 111 

CGAGAAGAGGACAGAGGAGACTGAGCAAAGGGGGGTGGGCTCCAGGCGACCCCTAGCCCAATTCTGCCCCTCCAT 
CCCAAGGGGCAGAGAAATTGTCTTTCTTTGCTGACTCCTACGAGGAAAAAAAAAAAAAAAAAAAAAACCATTTAA 
AGGGAAAGATAAACGGAGACGGAGGAAAGGTGGCAGCCAGATTACTTAGAGAGGCACAGAGGAGAGAGATCGGGG 
TGAGTCGCCATGGGGACTCCCAGGGCCCAGCACCCGCCGCCTCCCCAGCTGCTGTTCCTAATTCTGCTGAGCTGT 
CCCTGGATCCAGGGTCTGCCCCTGAAGGAGGAGGAGATATTGCCAGAGCCTGGAAGTGAGACCCCCACGGTGGCC 
TCTGAGGCCCTGGCTGAACTGCTTCATGGGGCCCTGCTGAGGAGGGGCCCAGAGATGGGCTACCTGCCAGGM^^ 

^J»^^^^^™H'^°'^^^^^^^°^^^*=^^^C^C^CCC^<5CCCTGCCTCCCCAGGGCCTCCCCTTGGGC 

GGAGGAGAGGAGGAGACGACGACCACCATCATCACCACGACAACTGTTACCACTACGGTGACCAGCCCAGTTCTG 

TGTAATAACAACATCTCCGAGGGCGAAGGGTATGTGGAGTCTCCAGATCTGGGGAGCCCCGTCAGCCGCACCCTG 

CTGTCACAGGAAGAGGAGCTCCTGGTGCTGGCTGGTGGGGGATCCCCAGGCCTGGCCCCCCGACTCCTGGCCAAC 

^«S^™S^SS^^^^°'^^'''^^°*^'^^^^^<=TAT^^<^'^CCTACCTCCTGAGCTGTGGCTTCCCTCCCCGGCCGGCC 
^«^^S^^^^^°'^^^*^'^^^^°^'^^<^^'^<^^^^^TGGGGGCACTGCCACCTTTCACTGTGATTCGGGCTACCAGCTG 
CAGGGAGAGGAGACCCTCATCTGCCTCAATGGCACCCGGCCATCCTGGAACGGTGAAACCCCCAGCTGCATGGCA 

^S^I™^°^^*^^^°^°'^^^^''^*^^<=^'^C^G^<5<^<5<5CGCCGGCTGCACCTGaVCTTTGAAAGGGTCTCGCTG 
GATGAGGACAATGACCGGCTGATGGTGCGCTCAGGGGGCAGCCCCCTATCCCCCGTGATCTATGATTCGGACATG 

^^^S^^J2^*^^'^^*^'^^"^^^^""^<^CCTTTGAGGAGGATCGCTGCTTCGCCCCCTTCCTGGCACATGGA 
MTGTCACTACCACGGACCCTGAGTATCGCCCAGGGGCACTGGCAACCTTCTCGTGCCTCCCAGGATATGCCCTG 

»oo^H^S^^'^^'*^^^°^'^^^^<^T^^^^CCAGCTGGCGTGGTCCTCTCTCCCGACTGGCCCCAGAGCTAT 

AGCCCGGGCCAAGACTGCGTGTGGGGCGTGCACGTCCAGGAAGAGAAGCGCATCTTGCTCCAAGTTGAGATATTG 

^^^JS^»^^^^^*^^'^'^^^^^^^^^TTC'^A^'^G*^<^ACGGTCCCAGCGCCCGAGTCTTGGCC^^^ 

™^^?SSS^^^^^^*^^^^'^<=^^'^^<=T*=''<^'^GCCCGACCTCACACTGCAGTTTCAGGCACCGCCCGGGCCC 

^^^^^^^SnS*^'^*^^'^^^^°^°^'^^^^^'^'r°<=^^^'^CAAAGAGGTCCCGAGGAACGACACGTGCCCCGAGCTG 

^^^^S^'^^^'^^^^^'^^'^^^^^^^^^^^^^TCCCACGGGGACCTGATCCGGGGCACGGTGCTCACCTACCAGTGC 

^ro™^^I^S^^^'^'^°^''^*^°^^'=^^^*='^'^'^CTCACTTGCCAGTGGGACCTGTCTTGGAGCGCCGCGCCGCCC 

^$^^^^^S^?^^^^'r<5ACTTGTGCTGACCCTGGCGAGATTGCCAACGGGCACCGCACCGCCTCGGACGCCGGC 

^^^^^^^^^^^'^^^'^^^^■^^^^'^'^'^^^^'^^'^CCTGCCAGGGTACAGCCTCGAGGGGGCAGCCATGCTCACCTG^ 

T^^^^^^^^H^^^^'^GCACACCCAAGTGGAGCGATAGGGTCCCCAAATGCGCCTTGAAGTACGAGCCGTGCCTG 

™™SSS55'^°^'^<^C<='^^'5^TGGCTACCAGACGCTGTACAAGCACCACTACCAGGCGGGCGAGTCTCTGCGCTTC 

!J^^^^o^JS^'^^*^^^^^^^^^^^^'^<='^°^^^^<^T<^ACCATCACCTGTGTGCCCGGCCACCCCTCCCAG^ 

^^^»^^SS^S^'^^^^'^'^'^^^^^*=^*<^^C^ACAGATCCATCACGGCAGCTGGAAGGGGGGAAC^ 

^^^^J^5JSS35^^CTAGGCTTGGTCATTGTCCTCGGCAGTGGCGTTTACATCTACTACACCAAGCTTCAGGGA 

™^^o^^'^^^^°^^^^^^'^°°^'^^CC^C"rCCTACAGCCCCATCACCGTGGAGTCGGACTTCAGCAACCCGCTG 

CCCTCCCCTCCTCATTCGGGCA6AGGGAAATACGGGACCCGGTCTCTGCCTCCTGGCTGCCCTCCTCCCTGGCTG 

cScccaaaaS^^^"^^'^^^'^^^^^^^^^''^^^'^^^^^^^ 
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FIGURE 112 

MGTPRAQHPPPPQLLFLILLSCPWIQGLPLKEEEILPEPGSETPTVASEALAELLHGALLRRG 
PEMGYLPGSDPDPTLATPPAGQTLAVPSLPRATEPGTGPLTTAVTPNGVRGAGPTAPELLTPP 
PGTTAPPPPSPASPGPPLGPEGGEEETTTTIITTTTVTTTVTSPVLCNNNISEGEGYVESPDL 
GSPVSRTLGLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELLVLAGGGSPGLAPRLLANSSMLG 
EGQVLRSPTNRLLLHFQSPRVPRGGGFRIHYQAYLLSCGFPPRPAHGDVSVTDLHPGGTATFH 
CDSGYQLQGEETLICLNGTRPSWNGETPSCMASCGGTIHNATLGRIVSPEPGGAVGPNLTCRW 
VIEAAEGRRLHLHFERVSLDEDNDRLMVRSGGSPLSPVIYDSDMDDVPERGLISDAQSLYVEL 
LSETPANPLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPGALATFSCLPGYALEPPGPPN 
AIECVDPtEPHWNDTEPACKAMCGGELSEPAGVVLSPDWPQSYSPGQDCVWGVHVQEEKRILL 
QVEILNVREGDMLTLFDGDGPSARVLAQLRGPQPRRRLLSSGPDLTLQFQAPPGPPNPGLGQG 
' FVLHFKEVPRNDTCPELPPPEWGWRTASHGDLIRGTVLTYQCEPGYELLGSDILTCQWDLSWS 
AAPPACQKIMTCADPGEIANGHRTASDAGFPVGSHVQYRCLPGYSLEGAAMLTCYSRDTGTPK 
WSDRVPKCALKYEPCLNPGVPENGYQTLYKHHYQAGESLRFFCYEGFELIGEVTITCVPGHPS 
QWTSQPPLCKVTQTTDPSRQLEGGNLALAILLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSH 
S YS P IT VES DFSN PLYEAGDTRE YEVS I 

Important features of the protexn: 
Signal peptide: 
amino acids 1-27 
Transmembrane domain: 
amino acids 842-864 
N-glycosylation sites. 

amino acids 176-180, 222-226, 247-251, 332-336, 355-359, 373-377, 

473-477, 517-521, 641-645 

Tyrosine kinase phosphorylation site. 

amino acids 61-69 

N-myrlstoylatlon sites. 

amino acids 2-8, 84-90, 111-117, 114-120, 190-196, 198-204, 
235-241, 309-315, 333-339, 351-357, 472-478, 484-490, 528-534, 
626-632, 665-671, 775-781, 842-848 
Amldatlon site, 
amino acids 384-388 

Prokaryotlc membrane lipoprotein lipid attachment site. 

amino acids 12-23 

CUB domain proteins profile . 

amino acids 202-218, 376-392, 553-569 




• 
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FIGURE 113 



GCCGCGGGCGGAGCTGCCTGCCGGTCCCGCGCCGCGCGTCCGCACTCCTCGGCCCTCGGGCGGTCGATGGGACGG 
GGCGCCGCGGAGCAGGAGGCGGCGCCCGTCGGGGTGCTCGGGCCGCGCGGGAGCCCACTGTGGGGCTCGGGCATG 
• GCGGGCCGCAGGACCTGAGCTCTCCTCAGGGGAGCGGGGAGGCAGCTGCTGGCCGGCGATGGGGACGGAGTGGGG 
CCGTCGCCGCCGCGCCGAGCCGTGAGCGCCGAGCCACCGCCGCCGCTACCTCAGCCCTTCGCGAAGCGCCGGGCA 
GCTCGGGAACATGGCCCTGGAGCGGCTCTGCTCGGTCCTCAAAGTGTTGTTAATAACAGTACTGGTAGTGGAAGG 
GATTGCCGTGGCCCAAAAAACCCAAGATGGACAAAATATTGGAATCAAGCATATTCCTGCAACCCAGTGTGGCAT 
TTGGGTTCGAACCAGCAATGGAGGTCATTTTGCTTCGCCAAATTATCCTGACTCATATCCACCAAACAAGGAGTG 
TATCTACATTTTGGAAGCTGCTCCACGTCAAAGAATAGAGTTGACCTTTGATGAACATTATTATATAGAACCATC 
ATTTGAGTGTCGGTTTGATCACTTGGAAGTTCGAGATGGGCCATTTGGTTTCTCTCCTCTTATAGATCGTTACTG 
TGGCGTGAAAAGCCCTCCATTAATTAGATCAACAGGGAGATTCATGTGGATTAAGTTTAGTTCTGATGAAGAGCT 
TGAAGGACTGGGATTTCGAGCAAAATATTCATTTATTCCAGATCCAGACTTTACTTACCTAGGAGGTATTTTAAA 
TCCCATTCCAGATTGTCAGTTCGAGCTCTCGGGAGCTGATGGAATAGTGCGCTCTAGTCAGGTAGAACAAGAGGA 
GAAAACAAAACCAGGCCAAGCCGTTGATTGCATCTGGACCATTAAAGCCACTCCAAAAGCTAAGATTTATTTGAG 
GTTCCTAGATTATCAAATGGAGCACTCAAATGAATGCT^GAGAAACTTCGTTGCAGTCTATGATGGAAGCAGTTC 
TATTGAAAACCTGAAGGCCAAGTTTTGCAGCACTGTGGCCAATGATGTAATGCTTAAAACAGGAATTGGAGTGAT 
TCGAATGTGGGCAGATGAAGGTAGTCGGCTTAGCAGGTTTCGAATGCTCTTTACTTCCTTTGTGGAGCCTCCCTG 
CACAAGCAGCACTTTCTTTTGCCATAGCAACATGTGCATCAATAATTCTTTAGTCTGTAATGGTGTCCAAT^TTG 
TGCATACCCTTGGGATGAAAATCATTGTAAAGAAAAGAAAAAAGCAGGAGTATTTGAACAAATCACTAAGACTCA 
TGGAACAATTATTGGCATTACTTCAGGGATTGTCTTGGTCCTTCTCATTATTTCTATTTTAGTACAAGTGAAACA 
GCCTCGAAAAAAGGTCATGGCTTGCAAAACCGCTTTTAATAAAACCGGGTTCCAAGAAGTGTTTGATCCTCCTCA 
TTATGAACTGTTTTCACTAAGGGACAAAGAGATTTCTGCAGACCTGGCAGACTTGTCGGAAGAATTGGACAACTA 
CCAGAAGATGCGGCGCTCCTCOACCGCCTCCCGCTGCATCCACGACCACCACTGTGGGTCGCAGGCCTCCAGCGT 
CAAACAAAGCAGGACCAACCTCAGTTCCATGGAACTTCCTTTCCGAAATGACTTTGCACAACCACAGCCAATGAA 
AACATTTAATAGCACCTTCAAGAAAAGTAGTTACACTTTCAAACAGGGACATGAGTGCCCTGAGCAGGCCCTGGA 
AGACCGAGTAATGGAGGAGATTCCCTGTGAAATTTATGTCAGGGGGCGAGAAGATTCTGCACAAGCATCCATATC 
CATTGACTTCSA^TCTTCTGCTAATGGTGATGTGAATTCTTAGGGTGTGTACGTACGCAGCCTCCAGGGCACCAT 
ACTGTTTCCAGCAGCCAACCCTTTTCTCCCATCACAACTACGAAGACCTTGATTTACCGTTAACCTATTGTATGG 
TGATGTTTTTATTCTCTCAGGCAGTCTATATATGTTAAACCAATCAAGGAACTTACTCTATTCAGTGGAAACAAT 
AATCATCTCTATTGCTTGGTGTCATTTATAGGAAGCACTGCCAGTTAAAGAGCATTAGAAGAGGTGGTTGGATGG 
AGCCAGGCTCAGGCTGCCTCTTCGTTTTAGCAACAAGAAGACTGCTCTTGACTGATAACAGCTCTGTCAATATTT 
TGATGCCACAATAAACTTGATTTTTTTTTACATTCCTTTTATTTTTCCTTTCTCTAAATTTAATTTGTTTTATAA 
GCCTATCGTTTTACCATTTCATTTTCTTACATAAGTACAAGTGGTTAATGTACCACATACTTCAGTATAGGCATT 
TGTTCTTGAGTGTGTCAAAATACAGCTAGTTACTGTGCCAATTAAGACCCAGTTGTATTTCACCCATCTGTTTCT 
TCTTGGCTAATCTCTGTACTTCTGCCTTTTAATTACTGGGCCCTTATTCCTTATTTTCTGTGAGAAATAATAGAT 
GATATGATTTATTACCTTTCAATTATATTTTTCTCAGTTATACTAGAAAATTTCATAATCCTGGGATATATGTAC 
CATTGTCAGCTATGACTAAAAATTTGAAAAAGATAAAAATTTCTAGCAAGCCTTTGAAGTTTACCAAGTATAGTC 
ACATTCAGTGACAGCCCATTCATTCCAGTAAAGAATCATTTCATTCACTTTGGGAGAGGCCTATAATTACATTTA. 
TTTGCAATGTTTCTCTTCGCTAGATTGTTACATAGCTCCCATTCTGTTGGTTTTGCTTACAGCATATGGTAACCA 
AGGTTAGATGCCAGTTAAAATTCCTTAGAAATTGGATGAGCCTTGAGATTGCTTCTTAACTGGGACATGACATTT 
TTCTAGCTCTTATCAAGAATAACAACTTCCACTTTTTTTTAAACTGCACTTTTGACTTTTTTTATGGTATAAAAA 
CAATAATTTATAAACATAAAAGCTCATTGTGTTTTTTAGACTTTTGATATTATTtGATACTGTACAAACTTTATT 
AAATCAAGATGAAAGACCTACAGGACAGATTCCTTTCAGTGTTCACATCAGTGGCTTTGTATGCAAATATGCTGT. 
GTTGGACCTGGACGCTATAACTTATTGTAAAGACCTTGGAAATGTGGACATAAGCTCTTTCTTTCCTTTTGTTAC 
TGTATTTAGTTTGTGATAAATTTTTCACTGTGTGATATTTATGCTCTAAATCACTACACAAATCCCATATTAAAA 
TATACATTGTACCTGAAAAAAAA 
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FIGURE 114 

MALERLCSVLKVLLITVLWEGIAVAQKTQDGQNIGIKHIPATQCGIWVRTSNGGHFASPNYP 
DSYPPNKECIYILEAAPRQRIELTFDEHYYIEPSFECRFDHLEVRDGPFGFSPLIDRYCGVKS 
PPLIRSTGRFMWIKFSSDEELEGLGFRAKYSFIPDPDFTYLGGILNPIPDCQFELSGADGIVR 
SSQVEQEEKTKPGQAVDCIWTIKATPKAKIYLRFLDYQMEHSNECKRNFVAVYDGSSSIENLK 
AKFCSTVANDVMLKTGIGVIRMWADEGSRLSRFRMLFTSFVEPPCTSSTFFCHSNMCINNSLV 
CNGVQNCAYPWDENHCKEKKKAGVFEQITKTHGTIIGITSGIVLVLLIISILVQVKQPRKKVM 
ACKTAFNKTGFQEVFDPPHYELFSLRDKEISADLADLSEELDNYQKMRRSSTASRCIHDHHCG 
SQASSVKQSRTNLSSMELPFRNDFAQPQPMKTFNSTFKKSSYTFKQGHECPEQALEDRVMEEI 
PCE I YVRGRE DS AQAS I S I DF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 348-369 

N-glycosylation sites. 

amino acids 311-315, 385-389, 453-457, 475-479 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 426-430, 479-483 

N-myristoylation sites. 

amino acids 22-28, 32-38, 54-60, 186-192, 279-285, 318-324, 
348-354, 352-358, 441-447 
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FIGURE 115 

GGTCTCTGTCCTTGGCTGTGGCTCCTGCGCTCTGGCTGAGCCATGTTCCTTCTCCTCGCCCTC 

CTCACTGAGCTTGGAAGACTGCAAGCCCACGAAGGTTCTGAAGGAATATTTCTGCATGTCACA 

GTTCCACGGAAGATTAAGTCAAATGACAGTGAAGTTTCAGAGAGGAAGATGATTTACATCATT 

ACAATTGATGGACAAeCTTACACTCTACATCTCGGAAAACAATCATTCTTACCCCAGAACTTT 

TTGGTTTATACATATAATGAAACTGGATCTTTGCATTCTGTGTCTCCATATTTTATGATGCAT 

TGCCATTACCAAGGATATGCTGCCGAATTTCCAAATTCATTTGTGACACTCAGTATATGTTCT 

GGTCTCAGGGGATTTCTCCAGTTTGAAAATATCAGTTATGGAATTGAACCAGTAGAATCTTCA 

GCAAGATTTGAGCATATAATTTATCAAATGAAAAATAATGATCCAAATGTATCCATTTTAGCA 

GTAAATTACAGTCATATTTGGCAGAAAGACCAGCCCTACAAAGTTCCTTTAAACTCACAGATA 

AAAAATCTTTCAAAACTATTACCCCAATATCTGGAAATATACATTATAGTGGAAAAAGCTTTG 

ATGTTTACCCAGTTCAAATTGACTGTTATACTGTCTTCCTTGGAATTGTGGTCAAATGAAAAC 

CAGATTTCCACCAGTGGGGATGCTGATGATATATTACAAAGATTTTTGGCATGGAAACGGGAC 

TATCTCATCCTACGGCCCCATGACATAGCATACTTACTTGTTTACAGGAAACATCCTAAATAT 

GTGGGAGCAACATTTCCTGGCACCGTATGCAATAAAAGCTATGATGCAGGTATTGCTATGTAT 

CCAGATGCAATAGGTTTGGAGGGATTTTCGGTTATTATAGCTCAACTGCTTGGCCTTAATGTA 

GGATTAACATATGATGACATCACTCAGTGTTTCTGTCTGAGAGCTACATGCATCATGAATCAT 

GAAGCAGTGAGTGCCAGTGGTAGAAAGATTTTTAGCAACTGCAGCATGCACGACTATAGATAT 

TTTGTTTCAAAATTTGAGACTAAATGCCTTCAGAAGCTTTCAAATTTGCAACCATTACATCAA 

AATCAACCAGTGTGTGGTAATGGGATTTTGGAATCCAATGAAGAATGTGACTGTGGTAATAAA 

AATGAATGTCAATTTAAGAAGTGCTGTGATTATAACACATGTAAACTGAAGGGCTCAGTAAAA 

TGTGGTTCTGGACCATGTTGTACATCAAAGTGTGAGTTGTCAATAGCAGGCACTCCATGTAGA 

AAGAGTATTGATCCAGAGTGTGATTTTACAGAGTACTGCAATGGAACCTCTAGTAATTGTGTT 

CCTGACACTTATGCACTGAATGGCCGTTTGTGCAAGTTGGGAACTGCCTATTGCTATAACGGA 

CAATGTCAAACTACTGATAACCAGTGTGCCAAGATATTTGGAAAAGGTGCTCAAGGTGCTCCA 

TTTGCCTGTTTTAAAGAAGTTAATTCTCTGCATGAAAGATCTGAAAACTGTGGTTTTAAAAAT 

TCACAACCATTACCTTGTGAACGGAAGGATGTTCTCTGTGGAAAATTAGCTTGTGTTCAGCCA 

CATAAAAATGCTAATAAAAGTGACGCTCAATCTACAGTTTATTCATATATTCAAGACCATGTA 

TGTGTATCTATAGCCACTGGTTCCTCCATGAGATCAGATGGAACAGACAATGCCTATGTGGCT 

GATGGCACCATGTGTGGTCCAGAAATGTACTGTGTAAATAAAACCTGCAGAAAAGTTCATTTA 

ATGGGATATAACTGTAATGCCACCACAAAATGCAAAGGGAAAGGGATATGTAATAATTTTGGT 

AATTGTCAATGCTTCCCTGGACATAGACCTCCAGATTGTAAATTCCAGTTTGGTTCCCCAGGG 

GGTAGTATTGATGATGGAAATTTTCAGAAATCTGGTGACTTTTATACTGAAAAAGGCTACAAT 

ACACACTGGAACAACTGGTTTATTCTGAGTTTCTGCATTTTTCTGCCGTTTTTCATAGTTTTC 

ACCACTGTGATCTTTAAAAGAAATGAAATAAGTAAATCATGTAACAGAGAGAATGCAGAGTAT 

AATCGTAATTCATCCGTTGTATCAGAAAGCGATGACGTGGGACATTSATATTGCACAGAACTT 

CCATAGCAAATAACCTAAAGGAACGAATGTGCTTTATTTATAACCTTACGTTATCCCCAATGC 

ATTGTAAATGTCAAACTTTTGGAAAATAAAGCCTGCGTGCCCTCCC 
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FIGURE 116 

MFLLLALLTELGRLQAHEGSEGIFLHVTVPRKIKSNDSEVSERKMIYIITIDGQPYTLHLGKQ 
SFLPQNFLVYTYNETGSLHSVSPYFMMHCHYQGYAAEFPNSFVTLSICSGLRGFLQFENISYG 
lEPVESSARFEHIIYQMKNNDPNVSILAVNYSHIWQKDQPYKVPLNSQIKNLSKLLPQYLEIY 
IIVEKALMFTQFKLTVILSSLELWSNENQISTSGDADDILQRFLAWKRDYLILRPHDIAYLLV 
YRKHPKYVGATFPGTVCNKSYDAGIAMYPDAIGLEGFSVIIAQLLGLNVGLTYDDITQCFCLR 
ATCIMNHEAVSASGRKIFSNCSMHDYRYFVSKFETKCLQKLSNLQPLHQNQPVCGNGILESNE 
ECDCGNKNECQFKKCCDYNTCKLKGSVKCGSGPCCTSKCELSIAGTPCRKSIDPECDFTEYCN 
GTSSNCVPDTYALNGRLCKLGTAYCYNGQCQTTDNQCAKIFGKGAQGAPFACFKEVNSLHERS 
ENCGFKNSQPLPCERKDVLCGKLACVQPHKNANKSDAQSTVYSYIQDHVCVSIATGSSMRSDG 
TDNAYVADGTMCGPEMYCVNKTCRKVHLMGYNCNATTKCKGKGICNNFGNCQCFPGHRPPDCK 
FQFGSPGGSIDDGNFQKSGDFYTEKGYNTHWNNWFILSFCIFLPFFIVFTTVIFKRNEISKSC 
NRENAEYNRNSSWSESDDVGH 

Important featuires of the protein: 
Signal peptide: 

amino acids 1-16 
Transmembrane domain: 
amino acids 665-684 
N-glycbsylation sites. 

amino acids 36-39, 76-79, 122-125, 149-152, 156-159, 177-180, 
270-273, 335-338, 441-444, 537-540, 587-590, 601-604, 703-706 
Casein kinase II phosphorylation sites. 

amino acids 74-77, 208-211, 221-224, 304-307, 337-340, 346-349, 
376-380, 415-418, 499-502, 639-642, 708-711 
Tyrosine kinase phosphorylation site, 
amino acids 243-249 
N-myristoylation sites. 

amino acids 53-58, 79-84, 266-271, 298-303, 372-377, 403-408, 
408-413, 442-447, 462-467, 469-474, 488-493, 567-572, 610-615, 
616-621, 634-639 
Amidation site, 
amino acids 328-331 
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FIGURE 117 

CCCACGCGTCCGCGGACGCGTGGGGCTCAGTGGGCGTCGCGCGAAGGCTAAGGGAGTGTGGCG 

GGCGGCTCCGGGAGCCAACATSCCTCGGTATGCGCAGCTGGTCATGGGCCCCGCGGGCAGCGG 

GAAGAGCACCTACTGTGCCACCATGGTCCAGCACTGTGAAGCCCTCAACCGGTCTGTCCAAGT 

TGTAAACCTGGATCCAGCAGCAGAACACTTCAACTACTCCGTGATGGCTGACATCCGGGAACT 

GATCGAGGTGGATGATGTAATGGAGGATGATTCTCTGCGATTCGGTCCCAACGGAGGATTGGT 

ATTTTGCATGGAGTACTTTGCCAATAATTTTGACTGGCTGGAGAACTGTCTTGGCCATGTAGA 

GGACGACTATATCCTTTTTGATTGTCCAGGTCAGATTGAGTTGTACACTCACCTGCCTGTGAT 

GAAACATCTGGTCCAGCAGCTCGAGCAGTGGGAGTTCCGAGTCTGTGGAGTTTTTCTTGTTGA 

TTCTCAGTTCATGGTGGAGTCATTCAAGTTTATTTCTGGCATCTTGGCAGCCCTGAGTGCCAT 

GATCTCTCTAGAAATTCCGCAAGTCAACATCATGACAAAAATGGATCTGCTGAGTAAAAAAGC 

AAAAAAGGAAATTGAGAAATTTTTAGATCCAGACATGTATTCTTTATTAGAAGATTCTACAAG 

TGACTTAAGAAGCAAAAAATTCAAGAAACTGACTAAAGCTATATGTGGACTGATTGATGACTA 

CAGCATGGTTCGATTTTTACCTTACGATCAGTCAGATGAAGAAAGCATGAACATTGTATTGCA 

GCATATTGATTTTGCCATTCAATATGGAGAAGACCTAGAATTTAAAGAACCAAAGGAACGTGA 

AGATGAGTCTTCCTCTATGTTTGACGAATATTTTCAAGAATGCCAGGATGAAES^GAGTTTA 

CTAAAAGTAACCATCTAAAGAGCTTGTGGCCAAACCAGCAGAACATTCTTCTCTTCAAAGGAT 

GCAATAGTAGAAAGCTACTTATTTTAATGAAAAAAAGTAAAACTTCGTTCTTTATCAGCCTCA 

TGCCTGAATCAAATTTTTAATTATTCTGAAACTGCTGCTGTTTAAAGTGGAATCTTTTAGTAT 

TATAACAGCATCACTTTAGATTTTGTAAGTCAAAATTGAAATGAATGCACATAGATTTATATA 

TAAATTAGCACCTGAGCTAAGGTTAAGGCCGGTCTAAACTTATTTTCACTTTTTGTATTATTT 

TTGAGATGCAGGAATTACTGTAACAAAATATGTATGTCCGAAGGGAAAAAGCTGCAAGGATAT 

ATATAAGACCACTGCTTATCTGTATCTTCCCATTTTCCTATATTGAAAATGTATATTATTTAT 

ATAACTTAAAAAGTAAAAATAACTATGTTTTGAGAT 
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FIGURE 118 

MPRYAQLVMGPAGSGKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSVMADIRELIEVDDV 
MEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQIELYTHLPVMKHLVQQ 
LEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQVNIMTKMDLLSKKAKKEIEK 
FLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRFLPYDQSDEESMNIVLQHlbFAI 
QYGEDLEFKEPKEREDESSSMFDEYFQECQDE 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-29 

Transmembrane dozaain: 

amino acids 151-170 

N-glycosylation sites . 

amino acids 31-35, 47-51 

gAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 212-216 

Tyrosine kinase phosphorylation site. 

amino acids 189-197 

N-myristoylatxon sites. 

amino acids 13-19, 76-82, 154-160 

ATP/GTP-bindxng site motif A (P-loop) , ^ 

amino acids 10-18 ' 
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FIGURE 119 

GGGCGCTGGGAGACACCGGACGCCCGCTCGGCTGCGCTGCGGCTCAGGCCCCCGCTCGGGCCC 

GACCCGCTCGGTCACCGCCGGCTCGGGCGCGCACCTGCCGGCTGCGGCCCCAGGGCCATGCGG 

AGGCCCACGAGGAGGCCGGCGGCCACGCGCATCCCGTAGCCCAGGTGGCCCAGGTCTGCACCG 

CGGCGGCCTCGGCGCCATGGAGCCCCCGTATTCGCTGACGGCGCACTACGATGAGTTCCAAGA 

GGTCAAGTACGTGAGCCGCTGCGGCGCGGGGGGCGCGCGCGGGGCCTCCCTGCCCCCGGGCTT 

CCCGTTGGGCGCTGCGCGCAGCGTCACCGGGGCCCGGTCCGGGCTGCCGCGCTGGAACCGGCG 

CGAGGTGTGCCTGCTGTCGGGGCTGGTGTTCGCCGCCGGCCTCTGCGCCATTCTGGCGGCTAT 

GCTGGCCCTCAAGTACCTGGGCCCGGTCGCGGCCGGCGGCGGCGCCTGTCCCGAGGGCTGCCC 

TGAGCGCAAGGCCTTCGCGCGCGCCGCTCGCTTCCTGGCCGCCAACCTGGACGCCAGCATCGA 

CCCATGCCAGGACTTCTACTCGTTCGCCTGCGGCGGTTGGCTGCGGCGCCACGCCATCCCCGA 

CGACAAGCTCACCTATGGCACCATCGCGGCCATCGGCGAGCAAAACGAGGAGCGCCTACGGCG 

CCTGCTGGCGCGGCCCGGGGGTGGGCCTGGCGGCGCGGCCCAGCGCAAGGTGCGCGCCTTCTT 

CCGCTCGTGCCTCGACATGCGCGAGATCGAGCGACTGGGCCCGCGACCCATGCTAGAGGTCAT 

CGAGGACTGCGGGGGCTGGGACCTGGGCGGCGCGGAGGAGCGTCCGGGGGTCGCGGCGCGATG 

GGACCTCAACCGGCTGCTGTACAAGGCGCAGGGCGTGTACAGCGCCGCCGCGCTCTTCTCGCT 

CACGGTCAGCCTGGACGACAGGAACTCCTCGCGCTACGTCATCCGCATTGACCAGGATGGGCT 

CACCCTGCCAGAGAGGACCCTGTACCTCGCTCAGGATGAGGACAGTGAGAAGATCCTGGCAGC 

ATACAGGGTGTTCATGGAGCGAGTGCTCAGCCTCCTGGGTGCAGACGCTGTGGAACAGAAGGC 

CCAAGAGATCCTGCAAGTGGAGCAGCAGCTGGCCAACATCACTGTGTCAGAGTATGACGACCT 

ACGGCGAGATGTCAGCTCCATGTACAACAAGGTGACGCTGGGGCAGCTGCAGAAGATCACCCC 

CCACTTGCGGTGGAAGTGGCTGCTAGACCAGATCTTCCAGGAGGACTTCTCAGAGGAAGAGGA 

GGTGGTGCTGCTGGCGACAGACTACATGCAGCAGGTGTCGCAGCTCATCCGCTCCACACCCCA 

CCGGGTCCTGCACAACTACCTGGTGTGGCGCGTGGTGGTGGTCCTGAGTGAACACCTGTCCCC 

GCCATTCCGTGAGGCACTGCACGAGCTGGCACAGGAGATGGAGGGCAGCGACAAGCCACAGGA 

GCTGGCCCGGGTCTGCTTGGGCCAGGCCAATCGCCACTTTGGCATGGCGCTTGGCGCCCTCTT 

TGTACATGAGCACTTCTCAGCCGCCAGCAAAGCCAAGGTGCAGCAGCTAGTGGAAGACATCAA 

GTACATCCTGGGCCAGCGCCTGGAGGAGCTGGACTGGATGGACGCCGAGACCAGGGCTGCTGC 

TCGGGCCAAGCTCCAGTACATGATGGTGATGGTCGGCTACCCGGACTTCCTGCTGAAACCCGA 

TGCTGTGGACAAGGAGTATGAGTTTGAGGTCCATGAGAAGACCTACTTCAAGAACATCTTGAA 

CAGCATCCCCTTCAGCATCCAGCTCTCAGTTAAGAAGATTCGGCAGGAGGTGGACAAGTCCAC 

GTGGCTGCTCCCCCCACAGGCGCTCAATGCCTACTATCTACCCAACAAGAACCAGATGGTGTT 

CCCCGCGGGCATCCTGCAGCCCACCCTGTACGACCCTGACTTCCCACAGTCTCTCAACTACGG 

GGGCATCGGCACCATCATTGGACATGAGCTGACCCACGGCTACGACGACTGGGGGGGCCAGTA 

TGACCGCTCAGGGAACCTGCTGCACTGGTGGACGGAGGCCTCCTACAGCCGCTTCCTGCGAAA 

GGCTGAGTGCATCGTCCGTCTCTATGACAACTTCACTGTCTACAACCAGCGGGTGAACGGGAA 

ACACACGCTTGGGGAGAACATCGCAGATATGGGCGTCCTCAAGCTGGCCTACCACGCCTATCA 

GAAGTGGGTGCGGGAGCACGGCCCAGAGCACCCACTTCCCCGGCTCAAGTACACACATGACCA 

GCTCTTCTTCATTGCCTTTGCCCAGAACTGGTGCATCAAGCGGCGGTCGCAGTCCATCTACCT 

GCAGGTGCTGACTGACAAGCATGCCCCTGAGCACTACAGGGTGCTGGGCAGTGTGTCCCAGTT 

TGAGGAGTTTGGCCGGGCTTTCCACTGTCCCAAGGACTCACCCATGAACCCTGCCCACAAGTG 

TTCCGTGTGG2GAGCCTGGCTGCCCGCCTGCACGCCCCCACTGCCCCCGCACGAATCACCTCC 

TGCTGGCTACCGGGGCAGGCATGCACCCGGTGCCAGCCCCGCTCTGGGCACCACCTGCCTTCC 

AGCCCCTCCAGGACCCGGTCCCCCTGCTGCCCCTCACTTCAGGAGGGGCCTGGAGCAGGGTGA 

GGCTGGACTTTGGGGGGCTGTGAGGGAAATATACTGGGGTCCCCAGATTCTGCTCTAAGGGGG 

CCAGACCCTCTGCCAGGCTGGATTGTACGGGCCCCACCTTCGCTGTGTTCTTGCTGCAAAGTC 

TGGTCAATAAATCACTGCACTGTTAAAAAAAAA 
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FIGURE 120 

MEPPYSLTAHYDEFQEVKYVSRCGAGGARGASLPPGFPLGAARSVTGARSGLPRWNRREVCLL 
SGLVFAAGLCAILAAMLALKYLGPVAAGGGACPEGCPERKAFARAARFLAANLDASIDPCQDF 
YSFACGGWLRRHAIPDDKLTYGTIAAIGEQNEERLRRLLARPGGGPGGAAQRKVRAFFRSCLD 
MREIERLGPRPMLEVIEDCGGWDLGGAEERPGVAARWDLNRLLYKAQGVYSAAALFSLTVSLD 
DRNSSRYVIRIDQDGLTLPERTLYLAQDEDSEKILAAYRVFMERVLSLLGADAVEQKAQEILQ 
VEQQLANITVSEYDDLRRDVSSMYNKVTLGQLQKITPHLRWKWLLDQIFQEDFSEEEEWLLA 
TDYMQQVSQLIRSTPHRVLHNYLVWRVVWLSEHLSPPFREALHELAQEMEGSDKPQELARVC 
LGQANRHFGMALGALFVHEHFSAASKAKVQQLVEDIKYILGQRLEELDWMDAETRAAARAKLQ 
YMMVMVGYPDFLLKPDAVDKEYEFEVHEKTYFKNILNSIPFSIQLSVKKIRQEVDKSTWLLPP 
QALNAYYLPNKNQMVFPAGILQPTLYDPDFPQSLNYGGIGTIIGHELTHGYDDWGGQYDRSGN 
LLHWWTEASYSRFLRKAECIVRLYDNFTVYNQRVNGKHTLGENIADMGVLKLAYPIAYQKWVRE 
HGPEHPLPRLKYTHDQLFFIAFAQNWCIKRRSQSIYLQVLTDKHAPEHYRVLGSVSQFEEFGR 
AFHCPKDSPMNPAHKCSVW 

Important features of the protein: 
Transmembrane domain: 

amino acids 64-88 

N-glycosylation sites. 

amino acids 255-259, 322-326, 656-660 

cAMP- and cGMP-dependent protein kinase phosphorylation site • 

amino acids 722-726 
N-myristoylation site. 

amino acids 24-30, . 26-32, 27-33, 40-46, 47-53, 65-71, 148-154, 

169-175, 170-176, 237-243, 450-456, 604-610, 607-613 

Prokaryotic membrane lipoprotein lipid attachment site . 

amino acids 85-96 

Prenyl groi^ binding site. 

amino acids 772-777 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 609-619 

Neutral zinc metallopeptidases, zinc-binding region proteins. 

amino acids 609-619 
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FIGURE 121 

CGGACTGCCCGGACCGCGCGMGGAGTCGACCGGCAGCGTCGGGGAGGCCCCGGGCGGACCCC 

GGGTGCTGGTGGTGGGCGGCGGCATCGCGGGGCTGGGCGCGGCGCAGAGGCTCTGCGGCCACT 

CCGCCTTCCCGCACCTGCGGGTCCTGGAGGCCACGGCCCGCGCCGGGGGCCGCATCCGCTCGG 

AGCGCTGCTTCGGTGGCGTGGTGGAGGTGGGCGCGCACTGGATCCATGGGCCCTCCCGGGGTA 

ACCCCGTCTTCCAGCTGGCTGCTGAGTACGGGCTGCTGGGGGAGAAGGAGCTGTCCCAGGAGA 

ACCAGCTGGTGGAGACCGGGGGTCACGTGGGCCTGCCCTCCGTGAGCTACGCCAGCTCCGGGG 

CCAGCGTGAGCCTCCAGCTGGTGGCGGAGATGGCGACTCTGTTCTACGGCCTGATAGACCAGA 

CCCGGGAGTTCCTGCACGCTGCAGAGACCCCGGTGCCCAGCGTCGGGGAGTACCTCAAGAAGG 

AGATTGGCCAGCACGTGGCCGGCTGGACAGAGGATGAGGAGACCAGGAAGCTGAAGCTGGCCG 

TCCTGAACTCCTTCTTCAACCTGGAATGCTGTGTGAGCGGCACCCACAGCATGGACCTGGTGG 

CCCTGGCACCCTTTGGGGAGTATACCGTGCTGCCGGGGCTGGACTGCACCTTTTCTAAGGGCT 

ATCAAGGACTCACAAACTGCATGATGGCCGCCCTGCCGGAGGACACTGTAGTTTTTGAGAAGC 

CTGTGAAGACCATCCACTGGAACGGGTCCTTCCAGGAGGCAGCCTTTCCCGGGGAGACCTTTC 

CAGTGTCGGTAGAGTGTGAGGATGGAGACCGGTTCCCGGCGCACCATGTCATCGTCACCGTGC 

CCTTAGGTTTTCTTAGGGAACATTTGGACACCTTCTTTGACCCTCCCCTGCCGGCTGAGAAGG 

CAGAAGCAATCAGGAAGATAGGCTTTGGGACCAACAACAAAATCTTCCTGGAGTTTGAGGAGC 

CCTTCTGGGAGCCAGACTGCCAGCTGATCCAGCTGGTGTGGGAGGACACGTCGCCCCTGGAGG 

ATGCTGCCCCTGAGCTACAGGACGCCTGGTTCCGGAAGCTCATTGGCTTTGTGGTCCTGCCTG 

CCTTTGCGTCTGTCCACGTTCTCTGTGGGTTCATTGCCGGACTTGAGTCTGAGTTCATGGAGA 

CTCTGTCGGATGAAGAAGTACTTCTGTGTCTCACCCAAGTGCTCCGGAGAGTGACAGGAAACC 

CACGGCTCCCCGCGCCCAAGAGCGTCCTGCGGTCTCGCTGGCACAGCGCCCCGTACACTAGGG 

GGTCCTACAGCTACGTGGCCGTGGGCAGTACTGGGGGCGACCTGGACCTGCTGGCTCAGCCCC 

TCCCTGCAGACGGCGCCGGCGCCCAGCTCCAGATCCTGTTTGCGGGGGAAGCCACACATCGCA 

CGTTTTACTCCACGACGCACGGGGCTCTGCTGTCGGGATGGAGGGAGGCCGACCGCCTCCTCA 

GTCTGTGGGCCCCGGAGGTGCAGCAGCCCAGGCCGAGGCTC2ASCTGGGCCCAGCCTACTCTG 

TTCCACCCGTGTCGGGGGTAGGCTGGGACCGTCATTTCTTCTGACAGATTTCAGTGTGGCTTG 

AAATTTGGGGATGTTAATGAGGGTCCTCTGGTTTTTGGTAACCAGGGCCACCTTCTCAGTTCT 

TGTGTCTGTTATTGGAGTCTGGCCAGGGTTGACTTGAGCTGAGACACCAGATGCTCACGGAGA 

TGCTGGACACATAAAGCAAGTTACAGCCACAAAAAAAAAAAA 
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FIGURE 122 



MESTGSVGEAPGGPRVLVVGGGIAGLGAAQRLCGHSAFPHLRVLEATARAGGRIRSERCFGGV 
VEVGAHWIHGPSRGNPVFQLAAEYGLLGEKELSQENQLVETGGHVGLPSVSYASSGASVSLQL 
VAEMATLFYGLIDQTREFLHAAETPVPSVGEYLKKEIGQHVAGWTEDEETRKLKLAVLNSFFN 
LECCVSGTHSMDLVALAPFGEYTVLPGLDCTFSKGYQGLTNCMt4AALPEDTVVFEKPVKTIHW 
NGSFQEAAFPGETFPVSVECEDGDRFPAHHVIVTVPLGFLREHLDTFFDPPLPAEKAEAIRKI 
GFGTNNKIFLEFEEPFWEPDCQLIQLVWEDTSPLEDAAPELQDAWFRKLIGFWLPAFASVHV 
LCGFIAGLESEFMETLSDEEVLLCLTQVLRRVTGNPRLPAPKSVLRSRWHSAPYTRGSYSYVA 
VGSTGGDLDLLAQPLPADGAGAQLQILFAGEATHRTFYSTTHGALLSGWREADRLLSLWAPQV 
QQPRPRL 

Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 364-385 

N-glycosylation site. 

amino acids 253-257 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 408-412 

N-n^ristoylation sites. 

amino acids 20-26, 21-27, 25-31, 105-111, 119-125, 164-170, 
216-222, 227-233, 443-449, 484-490 

Aminooxidase Flavin containing amine oxidase: 

amino acids 23-497 
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FIGURE 123 

CGGACGCGTGGGGGAAGATGGATAAATAATTCTGTCACACGTGCCCTGGCCTCTGGAGCTCAGCTGCCAGTCCAC 
GTCTAGGGAATCTTAGCATCTGGGACCAAGACACTTTACAGCAATCATCACCCTTTGCAGAGGAGGTGAGCTCAC 
CAGGACTCATCTGCCATTTCAGACCTTTTGCTGCTACCTGCCAGGTGGCCCCCACTGCTGACGAGAGATGGTGGA 
TCTCTCAGTCTCCCCGGACTCCTTGAAGCCAGTATCGCTGACCAGCAGTCTTGTCTTCCTCATGCACCTCCTCCT 
CCTTCAGCCTGGGGAGCCGAGCTCAGAGGTCAAGGTGCTAGGCCCTGAGTATCCCATCCTGGCCCTCGTCGGGGA 
GGAGGTGGAGTTCCCGTGCCACCTATGGCCACAGCTGGATGCCCAGCAAATGGAGATCCGCTGGTTCCGGAGTCA 

^^^S^^^^^^^^<^^^<=^CATCGCCTATGGCAGCGTGGTCCTGCAGCTTCACAGCATCATCCCCTCTGACAA 
^^^^^^™^^^^'^^^^^^'^^^^^^^^^°^C^CT'rCTCTGGCGAAGCTCTCTGGGAACTGGAGGTAGCAGGGCT 

^J^S^ S^^^'^^^*^^°^®^*^*^AC^C^«5<5ACAGTGCCTGCCTCCAGAGTTTGAAGCCATCGTCTG 
S^^IS? '^'^^'^^^'''^^^^''^''''^^'^CATCTGTGGTTGTCCGAGCGGGAGCCCTCAGCAATGTGTCCGTCT^ 
~^^5S^°^'^^^^'^^'^'^°^^^^*'^^^^^<5^<5TTGGTGGTCCAGATAGCAGACGTGTTCGTACCCGGAGCCTC 
^^^^^'"■^^■^^^^<^^^^^^CC<5CTGCTGTTGGTCCTCGCGGCGCTGGCGCTGGGCGTCCTCCG 
^^™S?^'^'^^^*^^'^'^^'^^'^<^^^<^^<^<^AGGCGGAGAAGAGACAAGAGAAACTCACTGCAGAGCTGGA 
^S^P °^^^°^*^^^^^'^^'2^°^C<^<^<^CTGAAGGCCAGGCTGAGTGGAGAGCAGCCCAAAAATATGCAGT 
GGATGTGACGCTGGACCCGGCCTCGGCGCACCCCAGCCTGGAGGTGTCGGAGGATGGCAAGAGCGTGTCTTCCCG 
^™S5£™^^'^^^'^'^'^*^^<^'^C°^<^'r<^SCCACCCGCAGCGGTTCTCGGAGCAGACGTGCGCGCTGAGCCTGGAGCG 
GTTCTCCGCCGGCCGCCACTACTGGGAGGTGCACGTGGGCCGCCGCAGCCGCTGGTTCCTGGGCGCCTGCCTGGC 
CGCGGTGCCGCGCGCGGGGCCTGCGCGCCTGAGCCCTGCGGCCGGCTACTGGGTGCTGGGGCTGTGGAACGGCTG 
CGAGTACTTCGTCCTGGCCCCGCACCGCGTCGCGCTCACCCTGCGCGTGCCCCCGCGGCGCCTGGGCGTCTTCCT 
^S^^]^^9<^^^^^C^^*G*GCTGTCCTTCTTCAACGTGTCCGACGGCTCCCACATCTTCACCTTCCACGACACCTT 
CTCGGGCGCGCTCTGTGCGTACTTCAGGCCCAGGGCCCACGACGGCGGCGAACATCCGGATCCCCTGACCATCTG 
CCCGCTGCCGGTTAGAGGGACGGGCGTCCCCGAAGAGAACGACAGTGACACCTGGCTACAGCCCTATGA6CCCGC 
GGACCCCGCCCTGGACTGGTGGTGAGGCGCCCTCGTGGCCGCGGGACTGGCCCCGGGGGGCCCCCTGGATCCCAG 
^^^^^»^^^^!^^^^^^^'^°^^^^°^^^<^^^^^C*GSTGCACCAGCCAAAATGTCAGCG^^ 
SSS^S^^^r^^^^^^'^'^^^^^^'^^^^^T^^T^^TGCGATTTTCTTCAAGGAAAGGAGACAAGTCCAA^^ 
^^I^JS°*^^<^T<5GGACTGAGCGAAGGAGTACAAATATATCCACGTCGCTCAGAGCTGGGGTGCTCACGGTGGGC 
GGTGGGCAAGAAGCCAGCATGGAAGAAAGAAGGGAGAAAACTTTGGTGACTGCCTTAGAGGGATCAGTTAATTTG 

S^I^o^^^*^^*^°®^^°ATCCTAGTTTTAAAAACCATTTCCCATTAAAATGAAGTTGGAGGAACAGCTGCT 

TCTGAGCCGGGGCAAAAATTTCAAGGTGAGCCTGGAGCATTGTGTGTGGTGAAGTAAAATAAAGGCTCAAAACGT 

^»^^S^^^'^*^*^^^^^*^°®^'^^®^°^^^C^°<^CCGAAGGGGCCTCACTGACCAATTGTGGGACAATTTG 

,™™^»^^'^'''^^^°°^'^^*^^'^'^^'^^^A<^ACTGAATAAAAACATAATCCATGAGTTCATGCTGATACTCAA 

»^^™'^'^'^^^^'^'^^<^^^A^<^^GGTTTCTTTTGGAGGTGAAATCTAATTATTGGTGAGAGTCTTGGAGA 

^S™™P*^^^'^^^^^'^*^'^'^^^^^*^''^CCTTATACACTACTTATAAGTTTGAAAGGGGAAAGGTTACCTTTAC 

AATGGAGACATCTACCAGATCATCCAAGTGATTAAATTTAACATCATCAATGATGGGACCAAGGACATTATTAGT 

TTGACAACTGGGGAAAGAAGTGTTCTTCACCCCCTACCCCCAAGACATTCTCTCTGTCGGCCAGGCTGGAGTGCA 

gcctcaacctcctgggcccaagtgatcctcccacctcagcacacaacaccatgcccaaStSScctS?m 

AGACGGGGGTCTCACTTTGTTACCCAGGCTGGTCTCAAACTCCTGCGCTCAAGCAATCCTCCCACCTGGGCCTCC 

S^^^^'^'^^^'^^^^^^^^^'^'^^^^'^^^^^'f'^TGCCTGGCTTCATTTTCAGAGTGAGACATTTGTACTGTGGCTA 

TGTAGGAGAACATTCTTGTTCTTAGCAAACATACTGAAGTTTTTAGATATTAATTACCACAGTGTCTGCCACTGA 

ATTTCCAGTGACTAAGTGGAAAAATATAAAACATATGAATATAAAGAAAGAAAGAGACAAGTCAAATGTAGTAAA 

ATGACAACACTTGGTGACTCTAGGTGACTGGTCGACAGATGTTCATTGTACTATCAATGTGGCTTTGCTGTGGGT 

TTGAAATTTTGCAAACTAAGAGTTGGGTGGCGGGGAGAAGGATACACCAAAAAACTAAGTGATTATCTTTGGATG 

GGAAAATGTTTGGTAATTGCATTCTTAAAATGTCTTCTTTGTATTTTTTAATGTTCAATAATGTATATGrATCAG 
TTCTGTAATAAAGGGGAAAACACTTTTCA 
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FIGURE 124 

MVDLSVSPDSLKPVSLTSSLVFLMHLLLLQPGEPSSEVKVLGPEYPILALVGEEVEFPCHLWP 
QLDAQQMEIRWFRSQTFNVVHLYQEQQELPGRQMPAFRNRTKLVKDDIAYGSWLQLHSIIPS 
DKGTYGCRFHSDNFSGEALWELEVAGLGSDPHLSLEGFKEGGIQLRLRSSGWYPKPKVQWRDH 
QGQCLPPEFEAIVWDAQDLFSLETSVWRAGALSNVSVSIQNLLLSQKKELVVQIADVFVPGA 
SAWKSAFVATLPLLLVLAALALGVLRKQRRSREKLRKQAEKRQEKLTAELEKLQTELDWRRAE 
GQAEWRAAQKYAVDVTLDPASAHPSLEVSEDGKSVSSRGAPPGPAPGHPQRFSEQTCALSLER 
FSAGRHYWEVHVGRRSRWFLGACLAAVPRAGPARLSPAAGYWVLGLWNGCEYFVLAPHRVALT 
LRVPPRRLGVFLDYEAGELSFFNVSDGSHIFTFHDTFSGALCAYFRPRAHDGGEHPDPLTICP 
LPVRGTGVPEENDSDTWLQPYEPADPALDWW 

Important: features of the protein: 
Signal peptide : 

amino acids 1-34 

Transmembrane domain: 

amino acids 247-272 

N-glycosylation sites. 

amino acids 102-106, 139-143, 224-228, 464-468, 516-520 

Tyrosine kinase phosphorylation site. 

amino acids 105-114 

N-myristoylatxon sites. 

amino acids 129-135, 220-226, 399-405, 423-429^; 480-486 



Amidation site. 

amino acids 390-394 
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FIGURE 125 

TGAGCTAGGATTCTGCCTCTGCACTCCTAGCTGC^TGAOVGAGCAAGACCCTGTCTCAAfl^GAAAM^ 
^S^^^^^^^^^^^AC^TG^CAGTTCCTGGCCTCAAAGATCTTCCAAAGGAA^^ 

accaatgctgcaggaaaaagcaacatatttaagttatccaataacacctatccaataattgtStcS 
gacmggtagagttgtttatatttcttttccttttaggtgaaacaccattcaaagtcgtagtcSat^^^ 

n$Ji^f^'^''''^^°^"'^^''''^^CC"^^CCTTTGGACAGGAATGATGGAAcS^^ 

^^^^^^^^^^'^'''^'''^^^^^^^^^^^^^tagaggtcctttatggtgatgaacatgtggctca^^^ 

^™^'^'^^^^^^^'^^'=*=^^^^^^^ctgtgagtgtccggaagatcctcaggcctggcaSgact^^ 

tgtccaaccaaggaaccacagattgcaaaagattttgcttcctttcccagcatcaatctccagcaaa 

gaagtccccaaaaggtttggggatgagagaggtgccattgttcattacacgattctcSt^^ 

agatctttagggaaatacacagacttcaagatgttctctgatgagattttg™^ 

cccatacctatcatttcatggtgtggctctctggattcaagagatgttgtccttccaacgtatSStSS^^ 

I^^^I^^J»^^^''''''^''^^^''^^^^^^^^^^atctcctctctattcagggaKa^^^ 

^»^^^'^^*^^°^°^^'^^^^^^^^^<^gtagagacagccgagaggagaggctccagttggtac^^ 

agatatccatatctcatgctgggcgacagtctggttttaaagcaggactcgccatattatcaacatct 
gcactagaaccttggaagcattatgttccaattaaaagaaatctgSStttattaS^ 
^^^^^^'''''^'^^^^^^'T'^caaaagaaggacagttgLggctaSg^^ 
aggctttactgctactattacovagtactgcagaaatatgccgagcgccagtccagcaaacccg^^ 

^^^^^^^^^^^^^^^^^^^^'^^ctgtgtatcccggctacactttaaggaaagattgaatctaagctgt 

St^S^o^^'^'^^'^^'^'^^^^^^^^^^'^TGGACTAGTTCTCCCGGTGGCTTTATATATGTAGATGG^^^ 

tactggttgagtatccctcatctgaaatgcttaggaccaggagtgtttcaggcttcagatStta^^^ 
aatatttgcatgtacataatgaggtatcttggggatgagatccaagtctaaaScSttSS^ 
atataccttgttcacataccctgaaggtaattttatataatatttttaataatttgtgcatSScaa^^ 
^^^^^!3^^^^^^'^^<^^gcaaaggtgtcactatcttagcgacccaagtgSgct^^ 

ttggattttggggtatttcagattttagatttttgtatgaggaatgttcaacctgtatttcaacSSattac^^ 

aatatcattgaatattaatatcttttgcgtaaaaactgctattatcagcatStactSct^ 

tcgggatcatagccgatagagagacttgctaaaatataaatcagcctctgcaaaaSctSSatatt 

™f^J^T^TTATTGGTTTATTTCTATCCCCTGTTCACTTTTTCTCTTCCACTTCcSTS^^ 
^I»J»™''r'''"*'^^^^^''^^^^^CGTCACTGCATAATTTCTCCTCTTACAAGCTSSTT^ 

taacagcttccttttagaaggtctgataaggatatttaaggaagaagagaatgactctgttattSS^ 
^^^^S^'^^^^^'^^^^^^^'^tagtaattccaaatctatctgagttgtatacccatca^ 

TCAGGTGAAAGGCATTCATCATTTTTAAGCTGAAAAGGGGATCCTTGAAAACACTGAAAACCTCTACAACAATCT 
I^^™2^^'''''^*^^^''®=^T^^^^^'r^TAGGCCAAGAACAAAGGCAAGC^ 

].^^^,TIS^^^^^^^°^^^^'^'^^C^'''r<=^CAAATTTTAAAATATGTACTCTGGCCGGTTGCAGTGGCTCACG 

^ST^^^^'^^^^^^^^TTTGGCAGGCCGAGATGGGCGGATCTTTTGAGGTCAGGAGTTTGAGAC^^^ 

n^J^^^»^^'^^''''^^^<=^^<=^'™T^C'^TTAGCCAGGCATGGTGGCAT^^^^ 

?SS??5GGGT?SSr^^^^ 

^S^;^^™ ^°*^°^^^^^°^°^^^'^^^T^C^TCTCAAAAACTGAAAATAAAAATAAAAATATGT^^ 

TlA^n^»^3f''*''^^^^°^^^^T<^^^CTATTTTTAAAGTGGTTACATCT^ 

^™^o^^^"^''^''^'^^^^™CCAAATTTCAGAACAGTCTAATCAAT6Ca^GTATCCAA^ 

taatacccatccattgtgcaaaaccacaagcacgcaagtattaaataagagcaagctgtcctgS^^ 
atgaatttgtgtcttaaatattgtacattgtgtttgaggcttgtcaaaactgggSmScaa 

™^^»?^^^^*^"=^^^^^^^^^^^'=<^'f^^TCCCTCAAAAAATTCT^^^ 

^^^^rSr^'^^^^^^^'^'r'^CAGCCGCAGCCTTTAAAGAGTTAGTGTTAATGGCTTTCTGG^ 

»^S™™°^"'*^'^^"^^^'^^°^'^^T'^C^CCAATATCTGAGGAGAAGATGGACT^ 

JSIII??^^^r^^^^''''''^''^'^^^''"^^^^''^T'^T'^TTACTGTAAGTATGGATCAAAT^^ 

^^^^^S^^CTTAGAAAAGACAATAAATGCACAAAAGATATAAACAGGAACAGCAAATATTTATATTT^^ 
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FIGURE 126 

MVELFIFLFLLGETPFKVWKSLSPKELVRIHVPKPLDRNDGTFLMRYRMYETVDEGLklEVL 

YGDEHVAQSPYILKGPVYHEYCECPEDPQAWQKTLSCPTKEPQIAKDFASFPSINLQQMLKEV 

PKRFGDERGAIVHYTILNNHVYRRSLGKYTDFKMFSDEILLSLTRKVLLPDLEFYVNLGDWPL 

EHRKVNGTPSPIPIISWCGSLDSRDWLPTYDITHSMLEAMRGVTNDLLSIQGNTGPSWINKT 

ERAFFRGRDSREERLQLVQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQV 

NVDGTVT^YRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVKWAKEND 

EEAKKIAKEGQLMARDLLQPHRLYCYYY.QVLQKYAERQSSKPEVRDGMELVPQPEDSTAICQC 
HRKKPSREEL 

Important features of the protein : 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 250-254, 363-367 

CAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 444-448 

N-myristoylation site. 

amino acids 208-214, 319-325, 388-394 

Endoplasmic reticulum targeting sequence. 

amino acids 448-453 

Mitochondrial energy transfer proteins signature. 

amino acids 25-34 
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FIGURE 127 

AGCCGTCGGAGGGAGCCGGAGCGCTTCTCCCGAGTTGGTGATAGATTGGTGGTCATCCAACAT 

GCAGAAATGAATGAGCAGTGAAAAGCAGCAGAGCCGATGGGTCATGAGGATGTAAGTGCGTTT 

GAAGGCTTCCACACCCTCTACTCCAGGAATCATGAATAAACTGGAGGATAAGCAGGACCAGAT 

GATACCMGAAGAGAAGTTTACAGGCCCTCTATTGCCAACTGTTAAGTTTCCTGCTGATCTTG 

GCACTGACCGAAGCGCTGGCATTTGCCATCCAGGAACCATCTCCCAGGGAATCTCTTCAGGTC 

CTCCCTTCAGGCACTCCCCCGGGAACCATGGTGACAGCACCCCACAGCTCTACCAGACATACT 

TCTGTGGTGATGCTGACCCCCAATCCCGATGGACCCCCCTCACAGGCTGCAGCTCCCATGGCA 

ACACTGACACCCCGTGCAGAGGGGCACCCTCCTACGCACACCATCTCCACCATCGCTGCGACA 

GTAACCGCCCCCTATTCTGAAAGCTCCCTGTCCACAGGGCCCGCTCCAGCAGCCATGGCAACC 

ACATCCTCCAAGCCAGAGGGCCGCCCTCGAGGGCAGGCTGCCCCCACCATCCTGCTGACAAAG 

CCACCGGGGGCCACCAGCCGCCCCACCACAGCGCCCCCCCGCACTACCACACGCAGGCCCCCC 

AGGCCCCCAGGCTCTTCCCGAAAAGGGGCTGGTAATTCATCACGCCCTGTCCCGCCTGCACCT 

GGTGGCCACTCCAGGAGTAAAGAAGGACAGCGAGGACGAAATCCAAGCTCCACACCTCTGGGG 

CAGAAGCGGCCCCTGGGGAAAATCTTTCAGATCTACAAGGGCAACTTCACAGGGTCTGTGGAA 

CCAGAGCCCTCTACCCTCACCCCCAGGACCCCACTCTGGGGCTACTCCTCTTCACCACAGCCC 

CAGACAGTGGCTGCGACCACAGTGCCCAGCAATACCTCATGGGCACCCACCACCACCTCCCTG 

GGGCCTGCAAAGGACAAGCCAGGCCTTCGCAGAGCAGCCCAGGGGGGTGGTTCTACCTTCACC 

AGCCAAGGAGGGACACCAGATGCCACAGCAGCCTCAGGTGCCCCTGTCAGTCCACAAGCTGCC 

CCAGTGCCTTCTCAGCGCCCCCACCACGGTGACCCACAGGATGGCCCCAGCCATAGTGACTCT 

TGGCTTACTGTTACCCCTGGCACCAGCAGACCTCTGTCTACCAGCTCTGGGGTCTTCACGGCT 

GCCACGGGGCCCACCCCAGCTGCCTTCGATACCAGTGTCTCAGCCCCTTCCCAGGGGATTCCT 

CAGGGAGCATCCACAACCCCACAAGCTCCAACCCATCCCTCCAGGGTCTCAGAAAGCACTATT 

TCTGGAGCCAAGGAGGAGACTGTGGCCACCCTCACCATGACCGACCGGGTGCCCAGTCCTCTC 

TCCACAGTGGTATCCACAGCCACAGGCAATTTCCTCAACCGCCTGGTCCCCGCCGGGACCTGG 

AAGCCTGGGACAGCAGGGAACATCTCCCATGTGGCCGAGGGGGACAAACCGCAGCACAGAGCC 

ACCATCTGCCTGAGCAAGATGGATATCGCCTGGGTGATCCTGGCCATCAGCGTGCCCATCTCC 

TCCTGCTCTGTCCTGCTGACGGTGTGCTGCATGAAGAGGAAGAAGAAGACCGCCAACCCGGAG 

AACAACCTGAGCTACTGGAACAACACCATCACCATGGACTACTTCAACAGGCATGCTGTGGAG 

CTGCCCAGGGAGATCCAGTCCCTTGAAACCTCTGAGGACCAGCTCTCAGAGCCCCGCTCCCCA 

GCCAATGGCGACTATAGAGACACTGGGATGGTCCTTGTTAACCCCTTCTGTCAAGAAACACTG 

TTTGTGGGAAACGATCAAGTATCTGAGATCTAACTACAGCAGGCATCACTTTGCCATTCCGTA 

TTTTTCGTCTCTAAATTATAAATATACAAATATATATATTATAAATATAACCTTGTGTAACCC 

TGACTTAATGAGAAACATTTTCAGCTTTTTTTCCTATGAATTGTCAACATCTTTTTTACAAGT 

GTGGTTTAAAAAAAAAAAAACTTTACAGAATGATCTGTGGCTTTATAAAATAAAGGTATTTCT 
AAGCAAAAjUU^AA7\AAAAA7^ 
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FIGURE 128 

MKRSLQALYCQLLSFLLILALTEALAFAIQEPSPRESLQVLPSGTPPGTMVTAPHSSTRHTSV 

VMLTPNPDGPPSQAAAPMATLTPRAEGHPPTHTISTIAATVTAPYSESSLSTGPAPAAMATTS 

SKPEGRPRGQAAPTILLTKPPGATSRPTTAPPRTTTRRPPRPPGSSRKGAGNSSRPVPPAPGG 

HSRSKEGQRGRNPSSTPLGQKRPLGKIFQIYKGNFTGSVEPEPSTLTPRTPLWGYSSSPQPQT 

VAATTVPSNTSWAPTTTSLGPAKDKPGLRRAAQGGGSTFTSQGGTPDATAASGAPVSPQAAPV 

PSQRPHHGDPQDGPSHSDSWLTVTPGTSRPLSTSSGVFTAATGPTPAAFDTSVSAPSQGIPQG 

ASTTPQAPTHPSRVSESTISGAKEETVATLTMTDRVPSPLSTVVSTATGNFLNRLVPAGTWKP 

GTAGNISHVAEGDKPQHRATICLSKMDIAWVILAISVPISSCSVLLTVCCMKRKKKTANPENN 

LSYWNNTITMDYFNRHAVELPREIQSLETSEDQLSEPRSPANGDYRDTGMVLVNPFCQETLFV 
GNDQVSEI. 

lo^ortant features of the protein: 
Signal peptide : 

amino acids 1-28 

Transmembrane domain : 

amino acids 4 69-487 

N-glycosylation sites. 

amino acids 178-182, 223-227, 261-265, 446-450, 504-508, 509-513 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 495-499 

N-myristoylation sites. 

amino acids 44-50, 48-54, 175-181, 222-228, 279-285, 286-292, 
288-294, 296-302, 351-357,. 374-380, 427-433, 442-448 

TonB-dependent receptor proteins signature 1. 

amino acids 1-44 
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FIGURE 129 

AGGCGAGGCGCGGCGCCGCTGCACACACGCACACGGAGCTAT6GGGTGCCATGTTGCCACCAG 

CTGCCACGTGGCCTGGCTTTTGGTGCTGATCTCTGGATGCTGGGGCCAGGTGAACCGGCTGCC 

CTTCTTCACCAACCACTTCTTTGATACATACCTGCTGATCAGCGAGGACACGCCTGTGGGTTC 

TTCTGTGACCCAGTTGCTGGCCCAAGACATGGACAATGACCCCCTGGTGTTTGGCGTGTCTGG 

GGAGGAGGCCTCTCGCTTCTTTGCAGTGGAGCCTGACACTGGCGTGGTGTGGCTCCGGCAGCC 

ACTGGACAGAGAGACCAAGTCAGAGTTCACCGTGGAGTTCTCTGTCAGCGACCACCAGGGGGT 

GATCACACGGAAGGTGAACATCCAGGTCGGGGATGTGAATGACAACGCGCCCACATTTCACAA 

TCAGCCCTACAGCGTCCGCATCCCTGAGAATACACCAGTGGGGACGCCCATCTTCATCGTGAA 

TGCCACAGACCCCGACTTGGGGGCAGGGGGCAGCGTCCTCTACTCCTTCCAGCCCCCCTCCCA 

ATTCTTCGCCATTGACAGCGCCCGCGGTATCGTCACAGTGATCCGGGAGCTGGACTACGAGAC 

CACACAGGCCTACCAGCTCACGGTCAACGCCACAGATCAAGACAAGACCAGGCCTCTGTCCAC 

CCTGGCCAACTTGGCCATCATCATCACAGATGTCCAGGACATGGACCCCATCTTCATCAACCT 

GCCTTACAGCACCAACATCTACGAGCATTCTCCTCCGGGCACGACGGTGCGCATCATCACCGC 

CATAGACCAGGATAAAGGACGTCCCCGGGGCATTGGCTACACCATCGTTTCAGGGAATACCAA 

CAGCATCTTTGCCCTGGACTACATCAGCGGAGTGCTGACCTTGAATGGCCTGCTGGACCGGGA 

GAACCCCCTGTACAGCCATGGCTTCATCCTGACTGTGAAGGGCACGGAGCTGAACGATGACCG 

CACCCCATCTGACGCTACAGTCACCACGACCTTCAATATCCTGGTTATTGACATCAATGACAA 

TGCCCCGGAGTTCAACAGCTCCGAGTACAGCGTGGCCATCACTGAGCTGGCACAGGTCGGCTT 

TGCCCTTCCACTCTTCATCCAGGTGGTGGACAAGGATGAGAATTTGGGCCTGAACAGCATGTT 

TGAGGTGTACTTGGTGGGGAACAACTCCCACCACTTCATCATCTCCCCGACCTCCGTCCAGGG 

GAAGGCGGACATTCGTATTCGGGTGGCCATCCCACTGGACTACGAGACCGTGGACCGCTACGA 

CTTTGATCTCTTTGCCAATGAGAGTGTGCCTGACCATGTGGGCTATGCCAAGGTGAAGATCAC 

TCTCATCAATGAAAATGACAACCGGCCCATCTTCAGCCAGCCACTGTACAACATCAGCCTGTA 

CGAGAACGTCACCGTGGGGACCTCTGTGCTGACAGTCCTGGTGAGTCCCCGCTTCACTGCAGG 

GCCACTGAGCTCTCCAGGGCCGACTGTGGTGAGGCACCCAGAGGGATTTTGTCCAAGGGACCT 

CAGCAATCAGGGAAGGAGGCACCCCCAAATCCCTGAGCTGTGTTTGTTGGTGTATTAAATAAA 

GTTTTTGGACTCTTCAGGAAGGGGCTCCCTTGACCTAGGTTGCAATATGGAAAAGGAGCCAAC 

CTGAGGGGTGACGAGACTGAGCTGAGGACACTGGTTTTCTGCCTTTCCCTGAGAGAGACTCAG 

TGAGGGTGGGCTGGGAGCCCTGGAAGCCCCCTCAAATGGGTGGGAAGGTGCCAGCCATCCTTG 

AGAAGGGCTy^CCCTCTCCATGTGAGCACAGGCACCAGAGAGGGGCAGGCGCCTGGAGGGTACC 

GGGGCACCCCCAGCTGCCCATGGCTGGACTTGCCCTTTGACAAGGGGCCCTCCCAGTGTCATT 

TGTATCTGTCAGTACTCTTGGTTGCAAGGGACAGAAACCCTTAAGTAGTTCAAGCAAAAAAGG 

ATTGGCTCATGTAACTCAAAAGTATAAGTGATTTCAGGCCGGGCTCGGTGGCTCACGCCTGTC 

ATCCAACACCTTGAGAAAGCCGAGGTGGGCGGATCACTTGAGGTCGGGAGTTTGAGACCAGCC 

TGGCCAACATGGCAAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGCAC 

ACGCCTGTAGTCCCAGCTACTAGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCGG 

AGGTTGCAGTGAGCCGAGATTGTGTCACTGCCCTCCAGCCTGGGCGACAGAGCCAGATTCTGT 
CTC 
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FIGURE 130 

MGCHVATSCHVAWLLVLISGCWGQVNRLPFFTNHFFDTYLLISEDTPVGSSVTQLLAQDMDND 
PLVFGVSGEEASRFFAVEPDTGWWLRQPLDRETKSEFTVEFSySDHQGVITRKVNIQVGDVN 
DNAPTFHNQPYSVRIPENTPVGTPIFIVNATDPDLGAGGSVLYSFQPPSQFFAIDSARGIVTV 
IRELDYETTQAYQLTVNATDQDKTRPLSTLANLAIIITDVQDMDPIFINLPYSTNIYEHSPPG 
TTVRIITAIDQDKGRPRGIGYTIVSGNTNSIFALDYISGVLTLNGLLDRENPLYSHGFILTVK 
GTELNDDRTPSDATVTTTFNILVIDINDNAPEFNSSEYSVAITELAQVGFALPLFIQVVDKDE 
NLGLNSMFEVYLVGNNSHHFIISPTSVQGKADIRIRVAIPLDYETVDRYDFDLFANESVPDHV 

GYAKVKITLINENDNRPIFSQPLYNISLYENVTVGTSVLTVLVSPRFTAGPLSSPGPTWRHP 
EGFCPRDLSNQGRRHPQIPELCLLVY 

Important features of the protexn: 
Signal peptide: 
amino acids 1-23 
Transmembrane domain: 
amino acids 355-374 
N-glycosylation sites. 

amino acids 155-159, 206-210, 349-353, 393-397, 434-438, 466-470, 
472-476 

N-myristoylation sites. 

amino acids 2-8, 49-55, 162-168, 270-276, 278-284, 316-322 
Amidation site, 
amino acids 515-519 

Prokaryotic meinbrane lipoprotein lipid attachment site. 

amino acids 11-22 
Leucine zipper pattern. 

amino acids 298-320 

PTS HPR component serine phosphorylation site signature. 

amino acids 377-393 

Cadherins extracellular repeated domain signature. 

amino acids 120-131, 336-347 
Cadherins extracellular 

amino acids 120-144, 336-360 
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FIGURE 131 

GTGGGCCGCCCCTGCTGCTGCCGTCCATGCTGATGTTTGCGGTGATCGTGGCCTCCAGCGGGC 

TGCTGCTCATGATCGAGCGGGGCATCCTGGCCGAGATGAAGCCCCTGCCCCTGCACCCGCCCG 

GCCGCGAGGGCACAGCCTGGCGCGGGAAAGCCCCCAAGCCTGGGGGCCTGTCCCTCAGGGCTG 

GGGACGCGGACTTGCAAGTGCGGCAGGACGTCCGGAACAGGACCCTGCGGGCGGTGTGCGGAC 

AGCCAGGCATGCCCCGGGACCCCTGGGACTTGCCGGTGGGGCAGCGGCGCACCCTGCTGCGCC 

ACATCCTCGTAAGTGACCGTTACCGCTTCCTCTACTGCTACGTCCCCAAGGTGGCCTGCTCTA 

ACTGGAAGCGGGTGATGAAGGTGCTGGCAGGCGTCCTGGACAGCGTGGACGTCCGCCTCAAGA 

TGGACCACCGCAGTGACCTGGTGTTCCTGGCCGACCTGCGGCCTGAGGAGATTCGCTACCGCC 

TGCAGCACTACTTTAAGTTCCTGTTTGTGCGGGAGCCCTTGGAACGCCTCCTCTCTGCCTACC 

GCAACAAGTTTGGCGAGATCCGAGAGTACCAGCAACGCTATGGGGCTGAGATAGTGAGGCGGT 

ACAGGGCTGGAGCGGGGCCCAGCCCTGCAGGCGACGATGTCACATTCCCCGAGTTCCTGAGAT 

ACCTGGTGGATGAGGACCCTGAGCGCATGAATGAGCATTGGATGCCCGTGTACCACCTGTGCC 

AGCCTTGTGCCGTGCACTATGACTTTGTGGGCTCCTATGAGAGGCTGGAGGCTGATGCAAATC 

AGGTGCTGGAGTGGGTACGGGCACCACCTCACGTCCGATTTCCAGCTCGCCAGGCCTGGTACC 

GGCCAGCCAGCCCCGAAAGCCTGCATTACCACTTGTGCAGTGCCCCCCGGGCCCTGCTGCAGG 

ATGTGCTGCCTAAGTATATCCTGGACTTCTCCCTCTTTGCCTACCCACTGCCTAATGTCACCA 

AGGAGGCGTGTCAGCAGSSaCCATGGGTGTGGGGCCAGCAGCTGGTGGGGACTGGTTTCAACG 

CCAGCTTTCTGTGCTTCTGCCTGTCATTCGGAGAAACTCTGGCTCTGGGGCTTGGGGCTTCTC 

AGGATCCTGGATGGCAGAGACTGCCCTCAGAAGTTCCTTGTCCAGGGTGGGCACCCACAGTGA 

CTCAGAGGACAGGGCTAGGCAGGAGACCTGCTGCTCCTCATTGGGGGGATCTCTTGGGGGGCA 

GACACCAGTTTGCCAATGAAGCAACACATCTGATCTAAAGACTGGCTCCAGACCCCGGGCTGC 

CAGGATTATGCAGTCCACTTGGTCTACCTTAATTTAACCTGTGGCCAAACTCAGAGATGGTAC 

CAGCCAGGGGCAAGCATGACCAGAGCCAGGGACCCTGTGGCTCTGATCCCCCATTTATCCACC 

CCATGTGCCTCAGGACTAGAGTGAGCAATCATACCTTATAAATGACTTTTGTGCCTTTCTGCt 

CCAGTCTCAAAATTTCCTACACCTGCCAGTTCTTTACATTTTTCCAAGGAAAGGAAAACGGAA 

GCAGGGTTCTTGCCTGGTAGCTCCAGGACCCAGCTCTGCAGGCACCCAAAGACCCTCTGTGCC 

CAGCCTCTTCCTTGAGTTCTCGGAACCTCCTCCCTAATTCTCCCTTCCTTCCCCACAAGGCCT 

TTGAGGTTGTGACTGTGGCTGGTATATCTGGCTGCCATTTTTCTGATGCATTTATTTAAAATT 

TGTACTTTTTGATAGAACCCTTGTAAGGGCTTTGTTTTCCTAATAGCTGACTTTTTAATAAAG 
CAGTTTTATATAT 
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FIGURE 132 

MLMFAVIVASSGLLLMIERGILAEMKPLPLHPPGREGTAWRGKAPKPGGLSLRAGDADLQVRQ 
DVRNRTLRAVCGQPGMPRDPWDLPVGQRRTLLRHILVSDRYRFLYCYVPKVACSNWKRVMKVL 
AGVLDSVDVRLKMDHRSDLVFLADLRPEEIRYRLQHYFKFLFVREPLERLLSAYRNKFGEIRE 
YQQRYGAEIVRRYRAGAGPSPAGDDVTFPEFLRYLVDEDPERMNEHWMPVYHLCQPCAVHYDF 
VGSYERLEADANQVLEWVRAPPHVRFPARQAWYRPASPESLHYHLCSAPRALLQDVLPKYILD 
FSLFAYPLPNVTKEACQQ 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-23 

N-glycosylation sites « 

amino acids 67-71, 325-329 

Tyrosine kinase phosphorylation sites. 

amino acids 152-159, 183-183 

N-myristoylation sites. 

amino acids 89-95, 128-134 
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FIGURE 133 

CGGCAGTTCTGGCCCCTGCAGCTGGAGGTACCCTGAGTTCTGAGGGTCGTAGTGCTGTTTCTG 

GTATTCTCATCGCGGTCACCTCTACCGGTGTGGACAAGTAAAGTTTGAATCAGCTTCTCCATG 

GCCTGGGCACCAGTTCCCGGCTGAGCCATTTTCCTTTTGGCTAAAAGTCCCCGCCCAGAGGCC 

AATTCGTCGCGGCGGCGGTGGAGATCGCAGGTCGCTCAGGCTTGCAGATGGGTCAAGGGTTGT 

GGAGAGTGGTCAGAAACCAGCAGCTGCAACAAGAAGGCTACAGTGAGCAAGGCTACCTCACCA 

GAGAGCAGAGCAGGAGAATGGATGCGAGCAACATTTCTAACACCAATCATCGTAAACAAGTCC 

AAGGAGGCATTGACATATATCATCTTTTGAAGGCAAGGAAATCGAAAGAACAGGAAGGATTCA 

TTAATTTGGAAATGTTGCCTCCTGAGCTAAGCTTTACCATCTTGTCCTACCTGAATGCAACTG 

ACCTTTGCTTGGCTTCATGTGTTTGGCAGGACCTTGCGAATGATGAACTTCTCTGGCAAGGGT 

TGTGCAAATCCACTTGGGGTCACTGTTCCATATACAATAAGAACCCACCTTTAGGATTTTCTT 

TTAGAAAATTGTATATGCAGCTGGATGAAGGCAGCCTCACCTTTAATGCCAACCCAGATGAGG 

GAGTGAACTACTTTATGTCCAAGGGTATCCTGGATGATTCGCCAAAGGAAATAGCAAAGTTTA 

TCTTCTGTACAAGAACACTAAATTGGAAAAAACTGAGAATCTATCTTGATGAAAGGAGAGATG 

TCTTGGATGACCTTGTAACATTGCATAATTTTAGAAATCAGTTCTTGCCAAATGCACTGAGAG 

AATTTTTTCGTCATATCCATGCCCCTGAAGAGCGTGGAGAGTATCTTGAAACTCTTATAACAA 

AGTTCTCACATAGATTCTGTGCTTGCAACCCTGATTTAATGCGAGAACTTGGCCTTAGTCCTG 

ATGCTGTCTATGTACTGTGCTACTCTTTGATTCTACTTTCCATTGACCTCACTAGCCCTCATG 

TGAAGAATAAAATGTCAAAAAGGGAATTTATTCGAAATACCCGTCGCGCTGCTCAAAATATTA 

GTGAAGATTTTGTAGGGCATCTTTATGACAATATCTACCTTATTGGCCATGTGGCTGCAiaaA 

AAGCACAATTGCTAGGACTTCAGTTTTTACTTCAGACTAAAGCTACCCAAGGACTTAGCAGAT 

ATGGGGGTTACATCAGTGCTGGTCATTGTAGCCTGAGTATACAATCAAGCTTCAGTGTGCAAC 

CTTTTTTTCTTTTGCCATTTTCTATTTTAGTAATTTCCTTGGGGAACTAAATAATTTTGCAGA 

ATTTTTCCTAATTTTGTTTATCACGTTTTGCACAAAGCAGAGCCACTGTCTAACACAGCTGTT 

AACGAATGATAAACTGACATTATACTCTAAAAGATGGTGTATTTGTGCATTAGATTTGCCTGA 

AAAACTTTATCCATTTCCATTCTTTATACAAATACCATGTAATGTGTACATATTTAACTAAAG 

AGATTTATAGTCATAATTATTTTATTGTAAAGATTTTAACTAAAGTTTTTCCTTTTCTCTC 
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FIGURE 134 

MGQGLWRWRNQQLQQEGYSEQGYLTREQSRRMDASNISNTNHRKQVQGGIDIYHLLKARKSK 

EQEGFINLEMLPPELSFTILSYLNATDLCLASCVWQDLANDELLWQGLCKSTWGHCSIYNKNP 

PLGFS FRKLYMQLDEGSLTFNANPDEGVNYFMSKGILDDS PKEIAKFI FCTRTLNWKKLRI YL 

DERRDVLDDLVTLHNFRNQFLPNALREFFRHIHAPEERGEYLETLITKFSHRFCACNPDLMRE 

LGLSPDAVYVLCYSLILLSIDLTSPHVKNKMSKREFIRNTRRAAQNISEDFVGHLYDNIYLIG 
HVAA 

iB^ortant features of the protein: 
Transm^nbrane domain: 

amino acids 253r272 

N-glycosylation sites. 

amino acids 37-41, 87-91, 298-302 



N-myristoylation site. 

amino acids 110-116 
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FIGURE 135 

GGCACGAGGGAGCCTCCGTTAGGGGGTGGGAAAGGACTTTGCCATAGGTCGCTGAGGCCACCA 

TCTGCTCTCTTACTGGCCAAGGGCGTAAAAAGATAGTCTTCCCATTAGCTAGAGAGCAAACCC 

CAGAAAGCCTATTGGCTGCGCCGTCCGCGGGCCTTGGTCCGCTTTGAAGGCGGGCTGCGGCTG 

CGAGAGGAGGGCGGGCGGGAGGCTAGCTGTTGTCGTGGTTGCTCGGAGGCACGTGTGCAGTCC 

CGGAAGCGGCGAGGGGAAACTGCTCCGCGCGCGCCGCGGGAGGAGGAACCGCCCGGTCCTTTA 

GGGTCCGGGCCCGGCCGGGCCATGGATTCAATGCCTGAGCCCGCGTCCCGCTGTCTTCTGCTT 

CTTCCCTTGCTGCTGCTGCTGCTGCTGCTGCTGCCGGCCCCGGAGCTGGGCCCGAGCCAGGCC 

GGAGCTGAGGAGAACGACTGGGTTCGCCTGCCCAGCAAATGCGAAGTGTGTAAATATGTTGCT 

GTGGAGCTGAAGTCAGCCTTTGAGGAAACCGGCAAGACCAAGGAGGTGATTGGCACGGGCTAT 

GGCATCCTGGACCAGAAGGCCTCTGGAGTCAAATACACCAAGTCGGACTTGCGGTTAATCGAA 

GTCACTGAGACCATTTGCAAGAlGGCTCCTGGATTATAGCCTGCACAAGGAGAGGACCGGCAGC 

AATCGATTTGCCAAGGGCATGTCAGAGACCTTTGAGACATTACACAACCTGGTACACAAAGGG 

GTCAAGGTGGTGATGGACATCCCCTATGAGCTGTGGAACGAGACTTCTGCAGAGGTGGCTGAC 

CTCAAGAAGCAGTGTGATGTGCTGGTGGAAGAGTTTGAGGAGGTGATCGAGGACTGGTACAGG 

AACCACCAGGAGGAAGACCTGACTGAATTCCTCTGCGCCAACCACGTGCTGAAGGGAAAAGAC 

ACCAGTTGCCTGGCAGAGCAGTGGTCCGGCAAGAAGGGAGACACAGCTGCCCTGGGAGGGAAG 

AAGTCCAAGAAGAAGAGCAGCAGGGCCAAGGCAGCAGGCGGCAGGAGTAGCAGCAGCAAACAA 

AGGAAGGAGCTGGGTGGCCTTGAGGGAGACCCCAGCCCCGAGGAGGATGAGGGCATCCAGAAG 

GCATCCCCTCTCACACACAGCCCCCCTGATGAGCTCTCAGCCCACCCAGCATCCTCTGTCCTG 

AGACCCCTGATTTTGAAGCTGAGGAGTCAGGGGCATGGCTCTGGCAGGCCGGGATGGCCCCGC 

AGCCTTCAGCCCCTCCTTGCCTTGGCTGTGCCCTCTTCTGCCAAGGAAAGACACAAGCCCCA6 

GAAGAACTCAGAGCCGTCATGGGTAGCCCACGCCGTCCTTTCCCCTCCCCAAGTGTTTCTCTC 

CTGACCCAGGGTTCAGGCAGGCCTTGTGGTTTCAGGACTGCAAGGACTCCAGTGTGAACTCAG 

GAGGGGCAGGTGTCAGAACTGGGCACCAGGACTGGAGCCCCCTCCGGAGACCAAACTCACCAT 

CCCTCAGTCCTCCCCAACAGGGTACTAGGACTGCAGCCCCCTGTAGCTCCTCTCTGCTTACCC 

CTCCTGTGGACACCTTGCACTCTGCCTGGCCCTTCCCAGAGCCCAAAGAGTAAAAATGTTCTG 
GTTCTGATTTCTGAAAAAAAAAAAAAAAAAAATTCCT 



• 
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FIGURE 136 



MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELKSAF 
EETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSNRFAKGM 
SETFETLHNLVHKGVKVVMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDWYRNHQEEDL 
TEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRSSSSKQRKELGGL 
EGDPSPEEDEGIQKASPLTHSPPDEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

N-glycosylation site. 

amino acids 153-157 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 227-231, 228-232 

Tyrosine kinase phosphorylation site. 

amino acids 142-150 

N-myristoylation sites. 

amino acids 36-42, 74-80, 86-92, 125-131, 222-228, 237-243, 
250-256, 263-269 

Amidation sites. 

amino acids 212-216, 222-226 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 62-70 
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FIGURE 137 



CACGCCTCCCGCTGCCAGCCCGGCACCGGGATCTTAATCAGTCACTATGAAAACTCATTAGCT 

CCACAGCAATGAGTCCTCCACTGCTGAAGCTTGGCGCTGTGCTTAGTACCATGGCAATGATCT 

CAAACTGGATGTCCCAAACTCTCCCATCCTTGGTGGGACTGAACACCACGAGGCTGTCGACTC 

CGGATACCTTAACTCAGATTAGTCCTAAAGAAGGGTGGCAGGTGTACAGCTCAGCTCAGGATC 

CTGATGGGCGGTGCATTTGCACAGTTGTTGCTCCAGAACAAAACCTGTGTTCCCGGGATGCCA 

AAAGCAGGCAACTTCGCCAACTACTGGAAAAGGTTCAGAACATGTCCCAGTCTATTGAAGTCT 

TAAACTTGAGAACTCAGAGAGATTTCCAATATGTTTTAAAAATGGAAACCCAAATGAAAGGGC 

TGAAGGCAAAATTTCGGCAGATTGAAGATGATCGAAAGACACTTATGACCAAGCATTTTCAGG 

AGTTGAAAGAGAAAATGGACGAGCTCCTGCCTTTGATCCCCGTGCTGGAACAGTACAAAACAG 

ATGCTAAGTTAATCACCCAGTTCAAGGAGGAAATAAGGAATCTGTCTGCTGTCCTCACTGGTA 

TTCAGGAGGAAATTGGTGCCTATGACTACGAGGAACTACACCAAAGAGTGCTGAGCTTGGA/y^ 

CAAGACTTCGTGACTGCATGAAAAAGCTAACATGTGGCAAACTGATGAAAATCACAGGCCCAG 

TTACAGTCAAGACATCTGGAACCCGATTTGGTGCTTGGATGACAGACCCTTTAGCATCTGAGA 

AAAACAACAGAGTCTGGTACATGGACAGTTATACTAACAATAAAATTGTTCGTGAATACAAAT 

CAATTGCAGACTTTGTCAGTGGGGCTGAATCAAGGACATACAACCTTCCTTTCAAGTGGGCAG 

GAACTAACCATGTTGTCTACAATGGCTCACTCTATTTTAACAAGTATCAGAGTAATATCATCA 

TCAAATACAGCTTTGATATGGGGAGAGTGCTTGCCCAACGAAGCCTGGAGTATGCTGGTTTTC 

ATAATGTTTACCCCTACACATGGGGTGGATTCTCTGACATCGACCTAATGGCTGATGAAATCG 

GGCTGTGGGCTGTGTATGCAACTAACCAGAATGCAGGCAATATTGTCATCAGCCAACTTAACC 

AAGATACCTTGGAGGTGATGAAGAGCTGGAGCACTGGCTACCCCAAGAGAAGTGCAGGGGAAT 

CTTTCATGATCTGTGGGACACTGTATGTCACCAACTCCCACTTAACTGGAGCCAAGGTGTATT 

ATTCCTATTCCACCAAAACCTCCACATATGAGTACACAGACATTCCCTTCCATAACCAATACT 

TTCACATATCCATGCTTGACTACAATGCAAGAGATCGAGCTCTCTATGCCTGGAACAATGGCC 

ACCAGGTGCTGTTCAATGTCACCCTTTTCCATATCATCAAGACAGAGGATGACACATAGGCAA 

ATGTGACATGTTTTCATTGATTTAAACAGTGTGATTTGTGATAAACTCTATAAGACCCCTTCC 

GTTTTTTTCTTCACTATTATTTTTCATCATTTCTCCAAAGCT^AAGCATTTTTATTGTAAAGTT 

GGTGTTTCAAAAACATAGCTGAGCTTGTCTAACTTACCATGTTGGAAACACATCTTAACTTCT 

AAATTTACAAGGCCTATCATGTCCTTGTCATGAAAAGCACTAAAAAAAAAAAAGAGTTTAAGT 

GGCTAAAGTCATAGTTTTGCAAGAGATTAATGATCTGCCTTATATTAGAGTCAGAGACTAATG 

GTGGCTTAAATGCACGAATGTCTTTTTTTTTAAAACTGTCATTTTTTACTGTCTTTTGCTCCA 

TCTCAGGAAATATTTTGGTAGGAATTAGGAGAAC2\AAAAGCACTTTTATCCCATTTATTTCTT 

TAAAAAATGTAAGGATTTCATTTATATTGAAAAATAATATTAATCATTTTGCTGTTAACACAA 

TTCTCTGATGCGGTGCTGTACAGTCATTTTTAAATCTCTTGCTAACATTTTATTGGCAGTATG 

TATTTCTACCATTGTAACCACCATTGTGCTATTGTATCTCTTCACTTCTGTGAAAGTAATATT 

TTTTATAAAANACACTGNAATTTTAAA7U\AAAAAAAAAAAAACAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 138 

MSPPLLKLGAVLSTMAMISNWMSQTLPSLVGLNTTRLSTPDTLTQISPKEGWQVYSSAQDPDG 
RCICTVVAPEQNLCSRDAKSRQLRQLLEKVQNMSQSIEVLNLRTQRDFQYVLKMETQMKGLKA 
KFRQIEDDRKTLMTKHFQELKEECMDELLPLIPVLEQYKTDAKLITQFKEEIRNLSAVLTGIQE 
EIGAYDYEELHQRVLSLETRLRDCMKKLTCGKLMKITGPVTVKTSGTRFGAWMTDPLASEKNN 
RVWYMDSYTNNKIVREYKSIADFVSGAESRTYNLPFKWAGTNHVVYNGSLYFNKYQSNIIIKY 
SFDMGRVLAQRSLEYAGFHNVYPYTWGGFiSDIDLMADEIGLWAVYATNQNAGNIVISQLNQDT 
LEVMKSWSTGYPKRSAGESFMICGTLYVTNSHLTGAKVYYSYSTKTSTYEYTDIPFHNQYFHI 
SMLDYNARDRALYAWNNGHQVLFNVTLFHI IKTEDDT 

Important features of the protein: 
Signal peptide : 

amino acids 1-16 

N-glycosylation sites. 

amino acids 33-37, 95-99, 179-183, 299-303, 465-469 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 215-219 

Tyrosine kinase phosphorylation site. 

amino acids 106-114 

N-myristoylation sites . 

amino acids 9-15, 31-37, 235-241, 239-245 
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FIGURE 139 

GAAGCAGTGCAGAGAGGAGAGCGGAGCGGAGCTGCCGCTGAGCAAAGGCCTTCACCATGGCCG 

AGTCCCCCGGCTGCTGCTCCGTCTGGGCCCGCTGCCTCCACTGCCTGTATAGCTGCCACTGGA 

GGAAATGCCCCAGAGAGAGGATGCAAACCAGCAAGTGCGACTGTATCTGGTTTGGCCTGCTCT 

TCCTCACCTTCCTCCTTTCCCTGAGCTGGCTGTACATCGGGCTCGTCCTTCTCAATGACCTGC 

ACAACTTCAATGAATTCCTCTTCCGCCGCTGGGGACACTGGATGGACTGGTCCCTGGCATTCC 

TGCTGGTCATCTCTCTACTGGTCACATATGCATCCTTGCTATTGGTCCTGGCCCTGCTCCTGC 

GGCTTTGTAGACAGCCCCTGCATCTGCACAGCCTCCACAAGGTGCTGCTGCTCCTCATTATGC 

TGCTTGTGGCGGCTGGCCTTGTGGGACTGGACATCCAATGGCAGCAGGAGTGGCATAGCTTGC 

GTGTGTCACTGCAGGCCACAGCCCCATTCCTTCATATTGGAGCAGCCGCTGGAATTGCCCTCC 

TGGCCTGGCCTGTGGCTGATACCTTCTACCGTATCCACCGAAGAGGTCCCAAGATTCTGCTAC 

TGCTCCTATTTTTTGGAGTTGTCCTGGTCATCTACTTGGCCCCCCTATGCATCTCCTCACCCT 

GCATCATGGAACCCAGAGACTTACCACCCAAGCCTGGGCTGGTGGGACACCGAGGGGCCCCCA 

TGCTGGCTCCCGAGAACACCCTGATGTCCTTGCGGAAGACAGCTGAATGCGGAGCTACTGTGT 

TTGAGACTGATGTGATGGTCAGCTCCGATGGGGTCCCCTTCCTCATGCATGAT6AGCACCTCA 

GCAGGACCACGAATGTAGCCTCTGTATTCCCAACCCGAATCACAGCCCACAGCAGTGACTTCT 

CCTGGACTGAACTGAAGAGACTCAATGCTGGATCCTGGTTCCTAGAGAGGCGACCCTTCTGGG 

GGGCCAAACCGCTGGCAGGCCCTGATCAGAAAGAGGCTGAGAGTCAGACGGTACCAGCATTAG 

AAGAGCTATTGGAGGAAGCTGCAGCCCTCAACCTTTCCATCATGTTCGACTTGCGCCGACCCC 

CACAGAACCACACATACTATGACACTTTTGTGATCCAGACATTGGAGACTGTGCTGAATGCAA 

GGGTGCCCCAAGCCATGGTCTTTTGGCTACCAGATGAAGATCGGGCTAATGTCCAACGACGGG 

CACCTGGAATGCGCCAGATATATGGACGTCAGGGAGGCAACAGAACGGAGAGGCCCCAGTTTC 

TTAACCTCCCCTATCAAGATCTGCCACTATTGGATATCAAGGCATTGCATAAGGATAATGTCT 

CGGTGAACCTATTTGTAGTGAACAAGCCCTGGCTCTTCTCTCTGCTTTGGTGTGCAGGGGTGG 

ATTCGGTCACCACCAACGACTGCCAGCTGCTGCAGCAGATGCGTTACCCTATCTGGCTTATTA 

CCCCTCAAACCrACCTAATCATATGGGTCATTACCAATTGTGTTTCCACCATGCTGCTTTTGT 

GGACCTTCCTCCTCCAAAGGAGATTTGTTAAGAAGAGAGGGAAAACTGGCTTAGAAACAGCAG 

TGCTGCTGACAAGGATCAACAATTTCATGATGGAGTGAATGCCCTGCCCTGCTTCCCCACCCA 

AGCCAGTCTACATTGCCCAAACAGCAAGGGTTGGAGAGTGGCTTAAGTGGAATGCTTCAGGGG 

TGGTGGGTTGCAAGTGGGGGGAGCTTTGCCAACAGGAGGTTTTGAACCATGAGGGCCCTCTGC 

CCAGGTGATGGGCATTCCCTTU^GCTGCTATGGAATCTGCTCCCTTTGGGGTTTTGACCTGAGA 

TGTTTGGGAAGAGAGTGAGTAATGAGAAGTTTCTCCTCAAATGAAACTAGAACAGAGGAAGTA 
AAAGGGAGATTGCTCGGA 
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FIGURE 140 

MAESPGCCSVWARCLHCLYSCHWRKCPRERMQTSKCDCIWFGLLFLTFLLSLSWLYIGLVLLN 
DLHNFNEFLFRRWGHWMDWSIJVFLLVISLLVTYASLLLVLALLLRLCRQPLHLHSLHKVLLLL 
IMLLVAAGLVGLDIQWQQEWHSLRVSLQATAPFLHIGAAAGIALLAWPVADTFYRIHRRGPKI 
LLLLLFFGVVLVIYLAPLCISSPCIMEPRDLPPKPGLVGHRGAPMLAPENTLMSLRKTAECGA 
TVFETDVMVSSDGVPFLMHDEHLSRTTNVASVFPTRITAHSSDFSWTELKRLNAGSWFLERRP 
FWGAKPLAGPDQKEAESQTVPALEELLEEAAALNLSIMFDLRRPPQNHTYYDTFVIQTLETVL 
NARVPQAMVFWLPDEDRANVQRRAPGMRQIYGRQGGNRTERPQFLNLPYQDLPLLDIKALHKD 
NVSVNLFWNKPWLFSLLWCAGVDSVTTNDCQLLQQMRYPIWLITPQTYLIIWVITNCVSTML 
LLWTFLLQRRFVKKRGKTGLETAVLLTRINNFMME 

Important features of the protein: 
Transmembrane domains: 

amino acids 38-60, 83-107, 122-138, 156-173, 189-210, 484-506 
N-glycosylation sites. 

amino acids 349-353, 362-366, 415-419, 442-446 

N-myristoylation sites. 

amino acids 163-169, 413-419, 523-529 

Leucine zipper pattern. 

amino acids 93-115, 109-131 

Glutamine amidotransf erases class-II active site. 

amino acids 1-13 
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FIGURE 141 



GCCGCCGGCCCGGGCTGGAGCCGAGCGCAGCAGCCACCGCCGCCGCCGCGCCAGAAGTTTGGGTTGAACCGGAGC 

TGCCGGGAGGAAACTTTTTTCTTTTTTCCCCCTCCCTCCCGGGAGGAGGAGGAGGAGGAGGAGGGGAAGCTGCCG 

CCGGCGCCAAGGCTCGTGGGCTCGGGGTCGGCGCGGCCCGCAGAAGGGGCGGGGGCCTCGCCCCGCGAGGGGAGG 

CGCGCCCCGGGGGCCCCGAGAGGGGCGGTGAGGACCGCGGGCTGCTGGTGCGGCGGCGGCGGCGCGTGTGCCCCG 

CGCAGGGGAGGGCGCCCGCCCCGCTCCCGGCCCGGCTGCGAGGAGGAGGCGGCGGCGGCGCAGGAGGATGTACTT 

GGTGGCGGGGGACAGGGGGTTGGCCGGCTGCGGGCACCTCCTGGTCTCGCTGCTGGGGCTGCTGCTGCTGCTGGC 

GCGCTCCGGCACCCGGGCGCTGGTCTGCCTGCCCTGTGACGAGTCCAAGTGCGAGGAGCCCAGGAACTGCCCGGG 

GAGCATCGTGCAGGGCGTCTGCGGCTGCTGCTACACGTGCGCCAGCCAGAGGAACGAGAGCTGCGGCGGCACCTT 

CGGGATTTACGGAACCTGCGACCGGGGGCTGCGTTGTGTCATCCGCCCCCCGCTCAATGGCGACTCCCTCACCGA 

GTACGAAGCGGGCGTTTGCGAAGATGAGAACTGGACTGATGACCAACTGCTTGGTTTTAAACCATGCAATGAAAA 

CCTTATTGCTGGCTGCAATATAATCAATGGGAAATGTGAATGTAACACCATTCGAACCTGCAGCAATCCCTTTGA 

GTTTCCAAGTCAGGATATGTGCCTTTCAGCTTTAAAGAGAATTGAAGAAGAGAAGCCAGATTGCTCCAAGGCCCG 

CTGTGAAGTCCAGTTCTCTCCACGTTGTCCTGAAGATTCTGTTCTGATCGAGGGTTATGCTCCTCCTGGGGAGTG 

CTGTCCCTTACCCAGCCGCTGCGTGTGCAACCCCGCAGGCTGTCTGCGCAAAGTCTGCCAGCCGGGAAACCTGAA 

CATACTAGTGTCAAAAGCCTCAGGGAAGCCGGGAGAGTGCTGTGACCTCTATGAGTGCAAACCAGTTTTCGGCGT 

GGACTGCAGGACTGTGGAATGCCCTCCTGTTCAGCAGACCGCGTGTCCCCCGGACAGCTATGAAACTCAAGTCAG 

ACTAACTGCAGATGGTTGCTGTACTTTGCCAACAAGATGCGAGTGTCTCTCTGGCTTATGTGGTTTCCCCGTGTG 

TGAGGTGGGATCCACTCCCCGCATAGTCTCTCGTGGCGATGGGACACCTGGAAAGTGCTGTGATGTCTTTGAATG 

TGTTAATGATACAAAGCCAGCCTGCGTATTTAACAATGTGGAATATTATGATGGAGACATGTTTCGAATGGACAA 

CTGTCGGTTCTGTCGATGCCAAGGGGGCGTTGCCATCTGCTTCACTGCCCAGTGTGGTGAGATAAACTGCGAGAG 

GTACTACGTGCCCGAAGGAGAGTGCTGCCCAGTGTGTGAAGATCCAGTGTATCCTTTTAATAATCCCGCTGGCTG 

CTATGCCAATGGCCTGATCCTTGCCCACGGAGACCGGTGGCGGGAAGACGACTGCACATTCTGCCAGTGCGTCAA 

CGGTGAACGCCACTGCGTTGCGACCGTCTGCGGACAGACCTGCACAAACCCTGTGAAAGTGCCTGGGGAGTGTTG 

CCCTGTGTGCGAAGAACCAACCATCATCACAGTTGATCCACCTGCATGTGGGGAGTTATCAAACTGCACTCTGAC 

AGGGAAGGACTGCATTAATGGTTTCAAACGCGATCACAATGGTTGTCGGACCTGTCAGTGCATAAACACCGAGGA 

ACTATGTTCAGAAGGTAAACAAGGCTGCACCTTGAACTGTCCCTTCGGTTTCCTTACTGATGCCCAAAACTGTGA 

GATCTGTGAGTGCCGCCCAAGGCCCAAGAAGTGCAGACCCATAATCTGTGACAAGTATTGTCCACTTGGATTGCT 

GAAGAATAAGCACGGCTGTGACATCTGTCGCTGTAAGAAATGTCCAGAGCTCTCATGCAGTAAGATCTGCCCCTT 

GGGTTTCCAGCAGGACAGTCACGGCTGTCTTATCTGCAAGTGCAGAGAGGCCTCTGCTTCAGCTGGGCCACCCAT 

CCTGTCGGGCACTTGTCTCACCGTGGATGGTCATCATCATAAAAATGAGGAGAGCTGGCACGATGGGTGCCGGGA 

ATGCTACTGTCTCAATGGACGGGAAATGTGTGCCCTGATCACCTGCCCGGTGCCTGCCTGTGGCAACCCCACCAT 

TCACCCTGGACAGTGCTGCCCATCATGTGCAGATGACTTTGTGGTGCAGAAGCCAGAGCTCAGTACTCCCTCCAT 

TTGCCACGCCCCTGGAGGAGAATACTTTGTGGAAGGAGAAACGTGGAACATTGACTCCTGTACTCAGTGCACCTG 

CCACAGCGGACGGGTGCTGTGTGAGACAGAGGTGTGCCCACCGCTGCTCTGCCAGAACCCCTCACGCACCCAGGA 

TTCCTGCTGCCCACAGTGTACAGATCAACCTTTTCGGCCTTCCTTGTCCCGCAATAACAGCGTACCTAATTACTG 

CAAAAATGATGAAGGGGATATATTCCTGGCAGCTGAGTCCTGGAAGCCTGACGTTTGTACCAGCTGCATCTGCAT 

TGATAGCGTAATTAGCTGTTTCTCTGAGTCCTGCCCTTCTGTATCCTGTGAAAGACCTGTCTTGAGAAAAGGCCA 

GTGTTGTCCCTACTGCATAGAAGACAGAATTCCAAAGAAGGTGGTGTGCCACTTCAGTGGGAAGGCCTATGCCGA 

CGAGGAGCGGTGGGACCTTGACAGCTGCACCCACTGCTACTGCCTGCAGGGCCAGACCCTCTGCTCGACCGTCAG 

CTGCCCCCCTCTGCCCTGTGTTGAGCCCATCAACGTGGAAGGAAGTTGCTGCCCAATGTGTCCAGAAATGTATGT 

CCCAGAACCAACCAATATACCCATTGAGAAGACAAACCATCGAGGAGAGGTTGACCTGGAGGTTCCCCTGTGGCC 

CACGCCTAGTGAAAATGATATCGTCCATCTCCCTAGAGATATGGGTCACCTCCAGGTAGATTACAGAGATAACAG 

GCTGCACCCAAGTGAAGATTCTTCACTGGACTCCATTGCCTCAGTTGTGGTTCCCATAATTATATGCCTCTCTAT 

TATAATAGCATTCCTATTCATCAATCAGAAGAAACAGTGGATACCACTGCTTTGCTGGTATCGAACACCAACTAA 

GCCTTCTTCCTTAAATAATCAGCTAGTATCTGTGGACTGCAAGAAAGGAACCAGAGTCCAGGTGGACAGTTCCCA 

GAGAATGCTAAGAATTGCAGAACCAGATGCAAGATTCAGTGGCTTCTACAGCATGCAAAAACAGAACCATCTACA 

GGCAGACAATTTCTACCAAACAGTGTGAAGAAAGGCAACTAGGATGAGGTTTCAAAAGACGGAAGACGACTAAAT 

CTGCTCTAAAAAGTAAACTAGAATTTGTGCACTTGCTTAGTGGATTGTATTGGATTGTGACTTGATGTACAGCGC 

TAAGACCTTACTGGGATGGGCTCTGTCTACAGCAATGTGCAGAACAAGCATTCCCACTTTTCCTCAAAAAA 
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FIGURE 142 

MYLVAGDRGLAGCGHLLVSLLGLLLLLARSGTRALVCLPCDESKCEEPRNCPGSIVQGVCGCC 
YTCASQRNESCGGTFGIYGTCDRGLRCVIRPPLNGDSLTEYEAGVCEDENWTDDQLLGFKPCN 
ENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEEEKPDCSKARCEVQFSPRCPE 
DS VL I EG YAP PGECC PL PS RC VCN PAGCLRKVCQ PGNLN I LVS ECASGK PGECC DL YECKP VFG 
VDCRTVECPPVQQTACPPDSYETQVRLTADGCCTLPTRCECLSGLCGFPVCEVGSTPRIVSRG 
DGTPGKCCDVFECVNDTKPACVFNNVEYYDGDMFRMDNCRFCRCQGGVAICFTAQCGEINCER 
YYVPEGECCPVCEDPVYPFNNPAGCYANGLILAHGDRWREDDCTFCQCVNGERHCVATVCGQT 
. CTNPVKVPGECCPVCEEPTIITVDPPACGELSNCTLTGKDCINGFKRDHNGCRTCQCINTEEL 
CSERKQGCTLNCPFGFLTDAQNCEICECRPRPkKCRPIICDKYCPLGLLKNKHGCDICRCKKC 
PELSCSKICPLGFQQDSHGCLICKCREASASAGPPILSGTCLTVDGHHHKNEESWHDGCRECY 
CLNGREMCALITCPVPACGNPTIHPGQCCPSCADDFVVQKPELSTPSICHAPGGEYFVEGETW 
NiDSCTQCTCHSGRVLCETEVCPPLLCQNPSRTQDSCCPQCTDQPFRPSLSRNNSVPNYCKND 
EGDIFLAAESWKPDVCTSCICIDSVISCFSESCPSVSCERPVLRKGQCCPYCIEDTIPKKVVC 
HFSGKAYADEERWDLDSCTHCYCLQGQTLCSTVSCPPLPCVEPINVEGSCCPMCPEMYVPEPT 
NIPIEKTNHRGEVDLEVPLWPTPSENDIVHLPRDMGHLQVDYRDNRLHPSEDSSLDSIASVW 
PIIICLSIIIAFLFINQKKQWIPLLCWYRTPTKPSSLNNQLVSVDCKKGTRVQVDSSQRMLRI 
AEPDARFSGFYSMQKQNHLQADNFYQTV 

In5>ortant features of the protein: 
Signal peptide: 
amino acids 1^34 
Transmembrane domain: 
amino acids 940-962 
N-glycosylation sites. 

amino acids 71-75, 113-117, 330-334, 474-478, 746-750 
CAMP- and cGMP- dependent protein kinase phosphorylation site, 
amino acids 992-996 
N-myristoylation site. 

amino acids 9-15, 58-64, 61-67, 75-81, 79-85, 362-368, 402-408, 407-413, 
439-445, 492-498, 511-517, 551-557, 558-564, 586-592, 606-612, 625-631, 
845-851 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 52-63, 844-855 
Cell attachment sequence, 
amino acids 314-317 
Leucine zipper pattern. 

amino acids 3-25 

Eukaryotic thiol (cysteine) proteases cysteine active site. 

amino acids 57-69 

VHFC domain proteins. 

amino acids 448-456, 382-390 

C- terminal cystine knot proteins 

amino acids 60-86 
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FIGURE 143 

GACGTCTGGCCGGCTCCCGGCGAAGGGCAGCGGAGGAGCGGCCCAGAGCGCGCAGCTAGGGCA 

CTGGCGAAACCCCGGGACAGTCCCTCTCCGTGCGGGGGCGGCGCAGAGCAGTCCCATCCCCGG 

GGTCCCGGGCGCGGCTGACTGCCGGCTGGTTCCCTGCGCGCAGTAGCTCCCCGAGCCGGGCTG 

CACCGGAGGCGGCGAGMGGTCGCGCGCGTCGGCCTCCTGCTGCGCGCCCTGCAGCTGCTACT 

GTGGGGCCACCTGGACGCCCAGCCCGCGGAGCGCGGAGGCCAGGAGCTGCGCAAGGAGGCGGA 

GGCATTCCTAGAGAAGTACGGATACCTCAATGAACAGGTCCCCAAAGCTCCCACCTCCACTCG 

ATTCAGCGATGCCATCAGAGCGTTTCAGTGGGTGTCCCAGCTACCTGTCAGCGGCGtGTTGGA 

CCGCGCCACCCTGCGCCAGATGACTCGTCCCCGCTGCGGGGTTACAGATACCAACAGTTATGC 

GGCCTGGGCTGAGAGGATCAGTGACTTGTTTGCTAGACACCGGACCAAAATGAGGCGTAAGAA 

ACGCTTTGCAAAGCAAGGTAACAAATGGTACAAGCAGCACCTCTCCTACCGCCTGGTGAACTG 

GCCTGAGCATCTGCCGGAGCCGGCAGTTCGGGGCGCCGTGCGCGCCGCCTTCCAGTTGTGGAG 

CAACGTCTCAGCGCTGGAGTTCTGGGAGGCCCCAGCCACAGGCCCCGCTGACATCCGGCTCAC 

CTTCTTCCAAGGGGACCACAACGATGGGCTGGGCAATGCCTTTGATGGCCCAGGGGGCGCCCT 

GGCGCACGCCTTCCTGCCCCGCCGCGGCGAAGCGCACTTCGACCAAGATGAGCGCTGGTCCCT 

GAGCCGCCGCCGCGGGCGCAACCTGTTCGTGGTGCTGGCGCACGAGATCGGTCACACGCTTGG 

CCTCACCCACTCGCCCGCGCCGCGCGCGCTCATGGCGCCCTACTACAAGAGGCTGGGCCGCGA 

CGCGCTGCTCAGCTGGGACGACGTGCTGGCCGTGCAGAGCCTGTATGGGAAGCCCCTAGGGGG 

CTCAGTGGCCGTCCAGCTCCCAGGAAAGCTGTTCACTGACTTTGAGACCTGGGACTCCTACAG 

CCCCCAAGGAAGGCGCCCTGAAACGCAGGGCCCTAAATACTGCCACTCTTCCTTCGATGCCAT 

CACTGTAGACAGGCAACAGCAACTGTACATTTTTAAAGGGAGCCATTTCTGGGAGGTGGCAGC 

TGATGGCAACGTCTCAGAGCCCCGTCCACTGCAGGAAAGATGGGTCGGGCTGCCCCCCAACAT 

TGAGGCTGCGGCAGTGTCATTGAATGATGGAGATTTCTACTTCTTCAAAGGGGGTCGATGCTG 

GAGGTTCCGGGGCCCCAAGCCAGTGTGGGGTCTCCCACAGCTGTGCCGGGCAGGGGGCCTGCC 

CCGCCATCCTGACGCCGCCCTCTTCTTCCCTCCTCTGCGCCGCCTCATCCTCTTCAAGGGTGC 

CCGCTACTACGTGCTGGCCCGAGGGGGACTGCAAGTGGAGCCCTACTACCCCCGAAGTCTGCA 

GGACTGGGGAGGCATCCCTGAGGAGGTCAGCGGCGCCCTGCCGAGGCCCGATGGCTCCATCAT 

CTTCTTCCGAGATGACCGCTACTGGCGCCTCGACCAGGCCAAACTGCAGGCAACCACCTCGGG 

CCGCTGGGCCACCGAGCTGCCCTGGATGGGCTGCTGGCATGCCAACTCGGGGAGCGCCCTGTT 

CTGAAGGCACCTCCTCACCTCAGAAACTGGTGGTGCTCTCAGGGCAAAATCATGTTCCCCACC 

CCCGGGGCAGAACCCCTCTTAGAAGCCTCTGAGTCCCTCTGCAGAAGACCGGGCAGCAAAGCC 

TCCATCTGGAAGTCTGTCTGCCTTTGTTCCTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MVARVGLLLRALQLLLWGHLDAQPAERGGQELRKEAEAFLEKYGYLNEQVPKAPTSTRFSDAI 
RAFQWVSQLPVSGVLDRATLRQMTRPRCGVTDTNSYAAWAERISDLFARHRTKMRRKKRFAKQ 
GNKWYKQHLSYRLVNWPEHLPEPAVRGAVRAAFQLWSNVSALEFWEAPATGPADIRLTFFQGD 
HNDGLGNAFDGPGGALAHAFLPRRGEAHFDQDERWSLSRRRGRNLFWLAHEIGHTLGLTHSP 
APRALMAPYYKRLGRDALLSWDDVLAVQSLYGKPLGGSVAVQLPGKLFTDFETWDSYSPQGRR 
PETQGPKYCHSSFDAITVDRQQQLYIFKGSHFWEVAADGNVSEPRPLQERWVGLPPNIEAAAV 
SLNDGDFYFFKGGRCWRFRGPKPVWGLPQLCRAGGLPRHPDAALFFPPLRRLILFKGARYYVL 
ARGGLQVEPYYPRSLQDWGGIPEEVSGALPRPDGSIIFFRDDRYWRLDQAKLQATTSGRWATE 
LPWMGCWHANSGSALF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 164-168, 355-359 

N-myristoylation sites. 

amino acids 92-98, 153-159, 193-19.9, 202-208, 288-294, 368-374, 
509-515 

Amidation site. 

amino acids 312-316 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 237-247 

Matrixins cysteine switch 

amino acids 231-262, 271-284 - 

Hemopexin domain protein 

amino acids 66-108, 231-262 
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FIGURE 145 



GCCGGCTAGGGCGCCGGAGCCGCACGCAGCCGCGGGGCTCCGAGAGGCGCGCACTGGGGCTGGGACTGCGCGGCG 

CCGCCGCTGCGAGCGCCACTGAGCGGTCGCGCAACTTCGGAGGCACAGCGCCGGAGCCAGGCGAGCGCTCAGAGA 

CCCGGAGCCAGAGGGGCGCGCCGGAGCCTCGTTCGAGAGCCGGCGCCAGGCACCCACCGCGCTCCGAGTGCCAGG 

CGGCCCTCCGCGCAGCGTGGCTTCCGCTGCCCCCACGGAAGGCACGGGCTGGCGCTGCCGGGCGCCGGGGAGGAC 

GGCGAGGAGGAGGCGGCGGCGGCGGAGACGGCGGCGGCGAGACTGGGGCCAGGGAGACAGCCCTGGGGGAGAGGC 

GCCCGAACCAGGCCGCGGGAGCATGGGGGCCCGGAGCGGAGCTCGGGGCGCGCTGCTGCTGGCACTGCTGCTCTG 

CTGGGACCCGAGGCTGAGCCAAGCAGGCACTGATTCTGGCAGCGAGGTGCTCCCTGACTCCTTCCCGTCAGCGCC 

AGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACGCCTACATTGTGAAGAACAAGCCTGTGGAGCTCCG 

CTGCCGCGCCTTCCCCGCCACACAGATCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCAC 

ACAGGAAGGCCTGGATGAGGCCACCGGCCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGGTGGA 

GGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAGGCACCACCAAGAGTCGCCG 

AGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATCAGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCA 

TGAGGTTCTCCTGCAGTGCCGCCCGCCGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGT 

CATCGACCCCACCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCCTGTC 

GGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCACCACTGCCACCGTCATCGT 

CTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCACCCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAA 

GCGCACCCGGACCTGCACCAACCCCGCTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGAC 

CGCCTGCACCACCATCTGCCCAGTCGATGGGGCGTGGACGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTGAGTG 

TGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACGCTGCT 

CGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAAAATAAGAAAACTCTAAGCGACCCCAACAGCCACCTGCT 

GGAGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGGCAATCCTCATGGC 

GGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAGACATCACTGACTCATCTGCTGCCCTGAC 

TGGTGGTTTCCACCCCGTCAACTTTAAGACGGCAAGGCCCAGCAACCCGCAGCTCCTACACCCCTCTGTGCCTCC 

TGACCTGACAGCCAGCGCCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCAT 

GACCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCACCACGGGCTCTGG 

GCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTGGCACATACCCTAGCGATTTCGCCCGGGA 

CACCCACTTCCTGCACCTGCGCAGCGCCAGCCTCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAG 

CAGCGTCAGCGGCACCTTTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCC 

CAATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAAGTACCCTCCCGCT 

TTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGCCGCCCCGT 

CATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCA 

GGGCCACTGGGAGGAGGTGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAG 

GGCCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAA 

GCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCCTCCGGGTCTACTGCCTGGA 

GGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAA 

ACCGCTAATGTTCAAGGACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAG 

CAAGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGGCCCTCCACTGCAC 

TTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCAAGATCTGCGTGCGGCAAGTGGAAGGGGA 

GGGCCAGATATTCCAGCTGCATACCACTCTGGCAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCC 

TGGCAGCACTGTCACCACCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAA 

CAGCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTATGGACCGGTACCT 

GAATTACTTTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACCTCTGGGAAGCTCTGCAGCAGGACGATGG 

GGACCTCAACAGCCTGGCGAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGG 

GGACTGC3PGAGCCTCCTGGGACAGCGGGCTGGCAGGGACTGGCAGGAGGCAGGTGCAGGGAGGCCTGGGGCAGCC 

TCCTGATGGGGATGTTTGGCCTCTGCTTCCTCCCAGTTCACAGCCAGAGTTGCCTCTCCTCCTCCTCTTCCCCAA 

CCCCCAGACCATGACCAGCCTTAGAAAATCCATGTACTCTGTTGTTAGAGGGCCCAGAGTTCCTTCTCCACCCCC 

GCTCTCTCTCTCTTGGCCTGAGATCTCTGTGCAGGAACCAAGATGGGGCTGAAGCCTCTGGAGGCAGTTGGTTGG 

GGGCGGGCAGGCAGGAGGCCCTCCCTCCACCCCCCCACCCTCAGCCCGGCAACTTCTGGGTTCCGTGGGTTTTAG 

TTCCGTTCTTCGTTTTCTTCCTCCGTTATTGATTTCTCCTTTCTCCCTAAGCCCCCTTCTGCTTCCACGCCCTTT 

TCCTCTTTGAAGAGTCAAGTACAATTCAGACAAACTGCTTTCTCCTGTCCAAAAGCAAAAAGGCAAAGGAAAGAA 

AGAAAGCTTCAGACCGCTAGTAAGGCTCAAAGAAGAAGAAAAACACCAAAACCACAAGGGAAAAGAAAAACCCAG 

TTTCTTAGGAAACGCAAACGATTTATTATCCAGATTATTTGGATAAGTCCTTTTTAAAA 
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FIGURE 146 



MGARSGARGALLLALLLCWDPRLSQAGTDSGSEVLPDSFPSAPAEPLPYFLQEPQDAYIVKNK 
PVELRCRAFPATQIYFKCNGEWVSQNDHVTQEGLDEATGLRVREVQIEVSRQQVEELFGLEDY 
WCQCVAWSSAGTTKSRRAYVRIAYLRKNFDQEPLGKEVPLDHEVLLQCRPPEGVPVAEVEWLK 
NEDVIDPTQDTNFLLTIDHNLIIRQARLSDTANYTCVAKNIVAKRRSTTATVIVYVNGGWSSW 
AEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQAFQKTACTTICPVDGAWTEWSKWSACS 
TECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCMQNKKTLSDPNSHLLEASGDAALYA 
GLVVAIFVVVAILMAVGVVVYRRNCRDFDTDITDSSAALTGGFHPVNFKTARPSNPQLLHPSV 
PPDLTASAGIYRGPVYALQDSTDKIPMTNSPLLDPLPSLKVKVYSSSTTGSGPGLADGADLLG 
VLPPGTYPSDFARDTHFLHLRSASLGSQQLLGLPRDPGSSVSGTFGCLGGRLSIPGTGVSLLV 
PNGAIPQGKFYEMYLLINKAESTLPLSEGTQTVLSPSVTCGPTGLLLCRPVILTMPHCAEVSA 
RDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEPRACHILLDQLGTYVFTGESYSRSAVK 
RLQLAVFAPALCTSLEYSLRVYCLEDTPVALKEVLELERTLGGYLVEEPKPLMFKDSYHNLRL 
SLHDLPHAHWRSKLLAKYQEIPFYHIWSGSQKALHCTFTLERHSLASTELTCKICVRQVEGEG 
QIFQLHTTLAETPAGSLDTLCSAPGSTVTTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRM 
LAQKLSMDRYLNYFATKASPTGVILDLWEALQQDDGDLNSLASALEEMGKSEMLVAVATDGDC 

Znportant features o£ the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 374-395 

N-glycosylation sites. 

amino acids 222-225, 347-350 
Glycosaminoglycan attachment site. 

amino acids 4 92-4 95 

cAMP- and c(3fP-dependent protein kinase phosphorylation site. 

amino acids 233-236, 234-237 

Casein kinase II phosphorylation sites. 

amino acids 30-33, 87-90, 251-254, 341-344, 359-362, 629-632, 651-654, 706- 
709, 757-760, 827-830, 925-928, 941-944 
Tyrosine kinase phosphorylation sites, 
amino acids 216-223, 773-780 
K-myristoylation sites. 

amino acids 2-7, 6-11, 27-32, 96-101, 137-142, 179-184, 247-252, 281-286, 334- 
339, 379-384, 491-496, 495-500, 509-514, 542-547, 547-552, 550-555, 553-558, 
560-565, 611-616, 785-790, 834-839, 844-849 
Prokaryotic m^nbrane lipoprotein lipid attachment site, 
amino acids 541-551 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 926-933 

Growth factor and cytokines receptors family signature 2. 

amino acids 306-312 
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FIGURE 147 



GAGAGGGACAGAGGCTGGAGAAGGATGTATGGCCTGCCCTGGGCTTGTCTGTTCCCTCCTGAGCCTGAGCCCCTT 

ACCTTCCTGACCCCATCAAGCACACACTGGCTCTGCTGGCTCCCCTGCTGGGCCTGGGCCTGGGGCTGGCCCTGA 

GTCAGCTGGCTGCAGGGGCCACAGACTGCAAGTTCCTTGGCCCGGCAGAGCACCTGACATTCACCCCAGCAGCCA 

GGGCCCGGTGGCTGGCCCCTCGAGTTCGTGCGCCAGGACTCCTGGACTCCCTCTATGGCACCGTGCGCCGCTTCC 

TCTCGGTGGTGCAGCTCAATCCTTTCCCTTCAGAGTTGGTAAAGGCCCTACTGAATGAGCTGGCCTCCGTGAAGG 

TGAATGAGGTGGTGCGGTACGAGGCGGGCTACGTGGTATGCGCTGTGATCGCGGGCCTCTACCTGCTGCTGGTGC 

CCACTGCCGGGCTTTGCTTCTGCTGCTGCCGCTGCCACCGGCGCTGCGGGGGACGAGTGAAGACAGAGCACAAGG 

CGCTGGCCTGTGAGCGCGCGGCCCTCATGGTCTTCCTGCTGCTGACCACCCTCTTGCTGCTGATTGGTGTGGTCT 

GTGCCTTTGTCACCAACCAGCGCACGCATGAACAGATGGGCCCCAGCATCGAGGCCATGCCTGAGACCCTGCTCA 

GCCTCTGGGGCCTGGTCTCTGATGTCCCCCAAGAGCTGCAGGCCGTGGCACAGCAATTCTCCCTGCCCCAGGAGC 

AAGTCTCAGAGGAGCTGGATGGTGTTGGTGTGAGCATTGGGAGCGCGATCCACACTCAGCTCAGGAGCTCCGTGT 

ACCCCTTGCTGGCGGCCGTGGGCAGTTTGGGCCAGGTCCTGCAGGTCTCCGTGCACCACCTGCAAACCTTGAATG 

CTACAGTGGTAGAGCTGCAGGCCGGGCAGCAGGACCTGGAGCCAGCCATCCGGGAACACCGGGACCGCCTCCTTG 

AGCTGCTGCAGGAGGCCAGGTGCCAGGGAGATTGTGCAGGGGCCCTGAGCTGGGCCCGCACCCTGGAGCTGGGTG 

CTGACTTCAGCCAGGTGCCCTCTGTGGACCATGTCCTGCACCAGCTAAAAGGTGTCCCCGAGGCCAACTTCTCCA 

GCATGGTCCAGGAGGAGAACAGCACCTTCAACGCCCTTCCAGCCCTGGCTGCCATGCAGACATCCAGCGTGGTGC 

AAGAGCTGAAGAAGGCAGTGGCCCAGCAGCCGGAAGGGGTGAGGACACTGGCTGAAGGGTTCCCGGGCTTGGAGG 

CAGCTTCCCGCTGGGCCCAGGCACTGCAGGAGGTGGAGGAGAGCAGCCGCCCCTACCTGCAGGAGGTGCAGAGAT 

ACGAGACCTACAGGTGGATCGTGGGCTGCGTGCTGTGCTCCGTGGTCCTATTCGTGGTGCTCTGCAACCTGCTGG 

GCCTCAATCTGGGCATCTGGGGCCTGTCTGCCAGGGACGACCCCAGCCACCCAGAAGCCAAGGGCGAGGCTGGAG 

CCCGCTTCCTCATGGCAGGTGTGGGCCTCAGCTTCCTCTTTGCTGCACCCCTCATCCTCCTGGTGTTCGCCACCT 

TCCTGGTGGGTGGCAACGTGCAGACGCTGGTGTGCCGGAGCTGGGAGAACGGCGAGCTCTTTGAGTTTGCAGACA 

CCCCAGGGAACCTGCCCCCGTCCATGAACCTGTCGCAACTTCTTGGCCTGAGGAAGAACATCAGCATCCACCAAG 

CCTATCAGCAGTGCAAGGAAGGGGCAGCGCTCTGGACAGTCCTGCAGCTCAACGACTCCTACGACCTGGAGGAGC 

ACCTGGATATCAACCAGTATACCAACAAGCTACGGCAGGAGTTGCAGAGCCTGAAAGTAGACACACAGAGCCTGG 

ACCTGCTGAGCTCAGCCGCCCGCCGGGACCTGGAGGCCCTGCAGAGCAGTGGGCTTCAGCGCATCCACTACCCCG 

ACTTCCTCGTTCAGATCCAGAGGCCCGTGGTGAAGACCAGCATGGAGCAGCTGGCCCAGGAGCTGCAAGGACTGG 

CCCAGGCCCAAGACAATTCTGTGCTGGGGCAGCGGCTGCAGGAGGAGGCCCAAGGACTCAGAAACCTTCACCAGG 

AGAAGGTCGTCCCCCAGCAGAGCCTTGTGGCAAAGCTCAACCTCAGCGTCAGGGCCCTGGAGTCCTCTGCCCCGA 

ATCTCCAGCTGGAGACCTCAGATGTCCTAGCCAATGTCACCTACCTGAAAGGAGAGCTGCCTGCCTGGGCAGCCA 

GGATCCTGAGGAAtGTGAGTGAGTGTTTCCTGGCCCGGGAGATGGGCTACTTCTCCCAGTACGTGGCCTGGGTGA 

GAGAGGAGGTGACTCAGCGCATTGCCACCTGCCAGCCCCTCTCCGGAGCCCTGGACAACAGCCGTGTGATCCTGT 

GTGACATGATGGCTGACCCCTGGAATGCCTTCTGGTTCTGCCTGGCATGGTGCACCTTCTTCCTGATCCCCAGCA 

TCATCTTTGCCGTCAAGACCTCCAAATACTTCCGTCCTATCCGGAAACGCCTCAGCTCCACCAGCTCTGAGGAGA 

CTCAGCTCTTCCACATCCCCCGGGTTACCTCCCTGAAGCTGTAGGGCCTTGTGGGGTGAGGTGACCCTGAGGCTG 

CCTGTCCTCCCCTTTGATTTAGCCTGGGCCACAGGACTTCGGTAGCTCTTGCCCCAGAGCCCAGGCTGGCATCCA 

GGCCTGGACTGTCCCCAGTTCCGGCTTACCTGGCCCCACCTTGCCTGCTCCTTTCCACCCCTTTCTGCTCACGAC 

CCCCATCATTCACGCTCAGAATCACATGGGACTTCTGTGCAGCTGCAGAGCCAGCAAGTCCCTACAGGTGTCACC 

CGTTACCCCCATGCTGGTGGCATCCTCACAGGAAGAGCCTGTTCTCCACCTGCTGGAGCCTGGACCCTGGGGTGG 

GACAGAGGCCTCGTCCAACCCCACTCCCCTTCCCGTGTGTCTTCCCCCTGCCAAGCCTCCCCCTGCCAAGCCTCC 

CCCTGCCCCTCTCTGAGCCCCTCGCCCCCCACACCGTCCTCATCTGGCCTCCCCCCTGGCCCCCACTTCCCTCTT 

ATGCCCTTCCTGGCCCTTTGCTTCCTCCCTTAGTCCCCTCTTCACCATATCTCCACTGCTACCTTGCTGGCCCCA 

GAGACCACCCTGCCCAACCAAACCACTCAGGTAACGCCACTAATCAGGCAGGGGCCACCATGGCCTAGGTCTGGG 

CTGGCTGCAGGCCCTGCCTCATGGCCTCTGAGCCCTCCACTGCCCCAGGGCCTTGGGCCCTCTGCAGATCTCATC 

CAGGATTTATTGTTGTCCAGTGGGGTGAGGGAGGCCTGTCTGAAGGCCGAGCCTCCCTGCCTGCACCCAAGTTAG 

AAATGGGGGTACCAGCACTTAGCTTCTCTCTGAGTGCTGGCTCCCAAGGAAGGGACCTGGGACCTGGGCCACAGT 

GGGGGCTTGCCCTTACCTCTTCAGAAGGAAGCATCTTCCACAGCCCCCACCCAACTTTCTTAGGAGTGATCTGGT 

GGCCAGAACAGGATTTTGCACGGCCCCTTTTATCCTGCGCATGTGGCCTAGGGTCATCCCCAGCCCATCCCTGTG 

TCAGCCCTGAGTGCTGGACACTGCGTTCCAGAAATGAGGAAGAGGAGAGAGAAGAGATGGACAGACCTCAGATCC 

ATTAAAGTGTTCTCACTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



wo 01/40466 



PCTAJSOO/32678 



148/550 



FIGURE 148 



MKHTLALLAPLLGLGLGLALSQLAAGATDCKFLGPAEHLTFTPAARARWLAPRVRAPGLL 
DSLYGTVRRFLSWQLNPFPSELVKALLNELASVKVNEWRYEAGYVVCAVIAGLYLLLV 
PTAGLCFCCCRCHRRCGGRVKTEHKALACERAALMVFLLLTTLLLLIGVVCAFVTNQRTH 
EQMGPSIEAMPETLLSLWGLVSDVPQELQAVAQQFSLPQEQVSEELDGVGVSIGSAIHTQ 
LRSSVYPLLAAVGSLGQVLQVSVHHLQTLNATVVELQAGQQDLEPAIREHRDRLLELLQE 
ARCQGDCAGALSWARTLELGADFSQVPSVDHVLHQLKGVPEANFSSMVQEENSTFNALPA 
LAAMQTSSWQELKKAVAQQPEGVRTLAEGFPGLEAASRWAQALQEVEESSRPYLQEVQR 
YETYRWIVGCVLCSWLFVVLCNLLGLNLGIWGLSARDDPSHPEAKGEAGARTLMAGVGL 
SFLFAAPLILLVFATFLVGGNVQTLVCRSWENGELFEFADTPGNLPPSMNLSQLLGLRKN 
ISIHQAYQQCKEGAALWTVLQLNDSYDLEEHLDINQYTNKLRQELQSLKVDTQSLDLLSS 
AARRDLEALQSSGLQRIHYPDFLVQIQRPWKTSMEQLAQELQGLAQAQDNSVLGQRLQE 
EAQGLRNLHQEKVVPQQSLVAKLNLSVRALESSAPNLQLETSDVLANVTYLKGELPAWAA 
RILRNVSECFLAREMGYFSQYVAWVREEVTQRIATCQPLSGALDNSRVILCDMMADPWNA 
FWFCLAWCTFFLIPSIIFAVKTSKYFRPIRKRLSSTSSEETQLFHIPRVTSLKL 

Signal peptide: 

amino acids 1-17 
Transmembrane domain: 

amino acids 105-125, 153-173, 428-449, 476-500, 778-797 
N-glycosylation sites: 

amino acids 270-273, 343-347, 352-356, 530-534, 540-5.46, 563-567, 
684-688, 707-711, 725-729 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 811-815 

Tyrosine kinase phosphorylation site. 

amino acids 95-103 
N-nryristoylation sites. 

amino acids 13-19, 15-21, 17-23, 26-32, 58-64, 124-130, 168-174, 
228-234, 230-236, 320-326, 338-344, 393-399*, 429-435, 446-452, 
477-483, 500-506, 536-542, 644-650, 761-767 
Phospholipase A2 histidine active site« 

aminop acids 129-137 

4Fe-'4S ferredoxins, iron-sulfur binding region signature. 

amino acids 126-138 

Mitochondrial energy transfer proteins signature. 

amino acids 80-89 
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FIGURE 149 

CACAGCTCCCTTCCCAGGACGTGAAAATCTGCCTTCTCACCATGAGGCTTCTAGTCCTTTCCA 
GCCTGCTCTGTATCCTGCTTCTCTGCTTCTCCATCTTCTCCACAGAAGGGAAGAGGCGTCCTG 
CCAAGGCCTGGTCAGGCAGGAGAACCAGGCTCTGCTGCCACCGAGTCCCTAGCCCCAACTCAA 
CAAACCTGAAAGGACATCATGTGAGGCTCTGTAAACCATGCAAGCTTGAGCCAGAGCCCCGCC 
TTTGGGTGGTGCCTGGGGCACTCCCACAGGTGTAGCACTCCCAAAGCAAGACTCCAGACAGCG 
GAGAACCTCATGCCrGGCACCTGAGGTACCCAGCAGCCTCCTGTCTCCCCTTTCAGCCTTCAC 
AGCAGTGAGCTGCAATGTTGGAGGGCTTCATCTCGGGCTGCAAGGACCCTGGGAAAGTTCCAG 
AACTCCACGTCCTTGTCTCAATTGTGCCATCAACTTTCAGAGCTATCATGAGCCAACCTCACC 
CCACAGGGCCTCAGTCGCCACCATGTGGGCCTCTCCAGTGCAAACCACCGAGCATTCCACCAT 
GACCGGTCACAGCTACAAATCCAGAGACCATCAATCCTGCTAGAGTGCAGGGTGGCAAGCACC 
CAAGGGTGGCTGACCAAGACTGCAGAGTCTCCTCCATCTTCAGGTCCATTCAGCCTCCTGGCA 
TTTAACTACCAGCATCCAGTGGTCCCCAAGGAATCCCTTCCTAGCCTCCTGACATGAGTCTGC 
TGGAAAGAGCATCCAAACAAACAAGTAATAAATAAATAAATAAACTCA 
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FIGURE 150 

MRLLVLSSLLCILLLCFSIFSTEGKRRPAKAWSGRRTRLCCHRVPSPNSTNLKGHHVRLCKPC 
KLE PE PRLW VVPGAL PQ V 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 48-52 



Amidation sites* 

amino acids 23-27, 33-37 
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FIGURE 151 

CACCGGAGGGCAGGCAGCTGACGGAGCTGCGCTGCGTTCGCCTCGTTTGCCTCGCGCCCTCCA 

CTGGAGCTGTTCGCGCCTCCCGGCTCCCACCGCAGCCCACCCGGCAGAGGAGTCGCTACCAGC 

GCCCAGTGCGCTCTGTCAGTCCGCAAACTCCTTGCCGCCCGCCCCGGGCTGGGCACCAAATAC 

CAGGCTACCATGGTCTACAAGACTCTCTTCGCTCTTTGCATCTTAACTGCAGGATGGAGGGTA 

CAGAGTCTGCCTACATCAGCTCCTTTGTCTGTTTCTCTTCCGACAAACATTGTACCACCGACC 

ACCATCTGGACTAGCTCTCCACAAAACACTGATGCAGACACTGCCTCCCCATCCAACGGCACT 

CACAACAACTCGGTGCTCCCAGTTACAGCATCAGCCCCAACATCTCTGCTTCCTAAGAACATT 

TCCATAGAGTCCAGAGAAGAGGAGATCACCAGCCCAGGTTCGAATTGGGAAGGCACAAACACA 

GACCCCTCACCTTCTGGGTTCTCGTCAACAAGCGGTGGAGTCCACTTAACAACCACGTTGGAG 

GAACACAGCTCGGGCACTCCTGAAGCAGGCGTGGCAGCTACACTGTCGCAGTCCGCTGCTGAG 

CCTCCCACACTCATCTCCCCTCAAGCTCCAGCCTCATCACCCTCATCCCTATCAACCTCACCA 

CCTGAGGTCTTTTCTGCCTCCGTTACTACCAACCATAGCTCCACTGTGACCAGCACCCAACCC 

ACTGGAGCTCCAACTGCACCAGAGTCCCCGACAGAGGAGTCCAGCTCTGACCACACACCCACT 

TCACATGCCACAGCTGAGCCAGTGCCCCAGGAGAAAACACCCCCAACAACTGTGTCAGGCAAA 

GTGATGTGTGAGCTCATAGACATGGAGACCACCACCACCTTTCCCAGGGTGATCATGCAGGAA 

GTAGAACATGCATTAAGTTCAGGCAGCATCGCCGCCATTACCGTGACAGTCATTGCCGTGGTG 

CTGCTGGTGTTTGGAGTTGCAGCCTACCTAAAAATCAGGCATTCCTCCTATGGAAGACTTTTG 

GACGACCATGACTACGGGTCCTGGGGAAACTACAACAACCCTCTGTACGATGACTCCTAACAA 

TGGAATATGGCCTGGGATGAGGATTAACTGTTCTTTATTTATAAGTGCTTATCCAGTAGAATT 

AATAAGTACCTGATGCGCATTGAACGACAATCTTAAGCCCTGTTTTGTTGGTATGGTTGTTTT 

TGTTTTCCTCCCTCTCCTCTGGCTGCTACAACTTCCCCTTTCTGGTACAAGAAGAACCATTCT 

TTAAAGGTGAGTGGAGGCTGATTTGCAGCTGAAGTGGGCCAGCCTTGCACCAGCCAGGCCAGA 

CCACCATGGTGAAGGCTTCTTTCCCCACTGCAGGACCCACTTTGAGAAGGATCGAGGAGGAGG 

ATTTGGGTTGTTTTGTTAGGGGTTACTTTCAGGGGAACATTTCATTTGTGTTATTTCTTAAAC 

TTCTATTTAGGAAATTACATTAAGTATTAATGAGGGGAAAGGAAATGAGCTCTACGAGGATTT 

CACCTTGCATGGGAGAGAGCAGGGTTTTCTCAGATTCCTTTTTAATCTCTATTTATCTGGTTG 

TTTCTGACAGGATGCTGCCTGCTTGGCTCTACGAGCTGGAAAGCAGCTTCTTAGCTGCCTAAT 

TAATGAAAGATGAAAATAGGAAGTGCCCTGGAGGGGGCCAGCAGGTCACGGGGCAGAATCTCT 

CAGGTTGCTGTGGGATCTCAGTGTGCCCCTACCTGTTCTCCCCTCCAGGCCACCTGTCTCTGT 

AAAGGATGTCTGCTCTGTTCAAAAGGCAGCTGGGATCCCAGCCCACAAGTGATCAGCAGAGTT 

GCATTTCCAAAGAAAAAGGCTATGAGATGAGCTGAGTTATAGAGAGAAAGGGAGAGGCATGTA 

CGGTGTGGGGAAGTGGAAGAGAAGCTGGCGGGGGAGAAGGAGGCTAACCTGCACTGAGTACTT 

CATTAGGACAAGTGAGAATCAGCTATTGATAATGGCCAGAGATATCCACAGCTTGGAGGAGCC 

CAGAGACTGTTTGCTTTATACCCACACAGCAACTGGTCCACTGCTTTACTGTCTGTTGGATAA 

TGGCTGTAAAATGTTTAAAAAC 
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FIGURE 152 

MVYKTLFALCILTAGWRVQSLPTSAPLSVSLPTNIVPPTTIWTSSPQNTDADTASPSNGTHNN 
SVLPVTASAPTSLLPKNISIESREEEITSPGSNWEGTNTDPSP3GFSSTSGGVHLTTTLEEHS 
SGTPEAGVAATLSQSAAEPPTLISPQAPASSPSSLSTSPPEVFSASVTTNHSSTVTSTQPTGA 
PTAPESPTEESSSDHTPTSHATAEPVPQEKTPPTTVSGKVMCELIDMETTTTFPRVIMQEVEH 
ALSSGSIAAITVTVIAVVLLVFGVAAYLKIRHSSYGRLLDDHDYGSWGNYNNPLYDDS 

Inportant features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 258-278 

N-glycosylation sites. 

amino acids 58-61, 62-65, 80-83, 176-179 
Casein kinase II phosphorylation sites. 

amino acids 49-52, 85-88, 95-98, 100-103, 120-123, 121-124, 141- 
144, 164-167, 191-194, 195-198, 200-203 

Tyrosine kinase phosphorylation site. 

amino acids 289-296 



N-«qrrist6ylation sites. 

amino acids 59-64, 115-120, 128-133, 133-138, 257-262, 297-302 
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FIGURE 153 



ACGTCACTGTCTTGAAGCAGCAGTAGCCTGGGAAGTGAGGCAGGAGGAATTGAGAGGCAGGAA 
GGGNGCTGGAGACACAGCTGAGCCTGGAAATGAGAGTGGGCATCGCCGTGGTCATCATGACTC 
CTCTGCGGCGTGGTCACCATGTTGGTTCACTGTGTTGGGCTCTTATTGACGGGTCTCCTGCTA 
GGCCTGACCTTGGGTGCCGGAGCCCTGCTGGCTTCTGAGCCTATCTACCAACCACCTTCAGCC 
TGGGTGCCAGCTGGGGGGCTGGTGGGGCTGGCGCTGCTGGGAGCCCTGCTCACACTTCGGTGG 
CCACGTCCATTCACAGTTCTGGGCACAACCCTGCTGGGTTCTGCAGTGCTTGTGGCCTGTGTT 
GACTACTTCCTGGAGGGGCTGGCACTGGGGAGTTGGCTGGGCCAACGCCTGCAGACACTTCCA 
GCCTTGCCTTCTCTCTGCTGATATAGCTGGGTCTTACTGGGGATCTGGCCAGCCTTGGGGGCC 
CTTGGAGCCCTGGCCCAGTGGAAGCTCGTGCCTGAGGAACATGGAGGCCACGCTAATGGGTCT 
GTTCCTGGTTTCCCAGATGCATAAAGGAAGACATATCCCTCCCCTGGGCAGCAAGGCTACAAT 
GGGAGGGAGGGAGAACATGGGAGCATGTGAATAAAATGGCATTAAATACTGAAAAAAAAAAAA 
AAAAfliAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 154 



MLVHCVGLLLTGLLLGLTLGAGALLASEPIYQPPSAWVPAGGLVGLALLGALLTLRWPRPFTV 
LGTTLLGSAVLVACVDYFLEGLALGSWLGQRLQTLPALPSLC 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 38-55^ 60-78 

N-myristoylation sites. 

amino acids 7-13, 12-18, 16-22, 22-28, 41-47, 50-56, 84-90, 88-94 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 67-78 
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FIGURE 155 

TGCAATTAAAGGAGTCGGGTCTCTAACTGTTGATCTGTTTTTTTCCCTTCTGAGCAATGGAGC 

TTACCATCTTTATCCTGAGACTGGCCATTTACATCCTGACATTTCCCTTGTACCTGCTGAACT 

TTCTGGGCTTGTGGAGCTGGATATGCAAAAAATGGTTCCCCTACTTCTTGGTGAGGTTCACTG 

TGATATACAACGAACAGATGGCAAGCAAGAAGCGGGAGCTCTTCAGTAACCTGCAGGAGTTTG 

CGGGCCCCTCCGGGAAACTCTCCCTGCTGGAAGTGGGCTGTGGCACGGGGGCCAACTTCAAGT 

TCTACCCACCTGGGTGCAGGGTGACCTGTATTGACCCCAACCCCAACTTTGAGAAGTTTTTGA 

TCAAGAGCATTGCAGAGAACCGACACCTGCAGTTTGAGCGCTTT.GTGGTAGCTGCCGGGGAGA 

ACATGCACCAGGTGGCTGATGGCTCTGTGGATGTGGTGGTCTGCACCCTGGTGCTGTGCTCTG 

TGAAGAACCAGGAGCGGATTCTCCGCGAGGTGTGCAGAGTGCTGAGACCGGGAGGGGCTTTCT 

ATTTCATGGAGCATGTGGCAGCTGAGTGTTCGACTTGGAATTACTTCTGGCAACAAGTCCTGG 

ATCCTGCCTGGCACCTTCTGTTTGATGGGTGCAACCTGACCAGAGAGAGCTGGAAGGCCCTGG 

AGCGGGCCAGCTTCTCTAAGCTGAAGCTGCAGCACATCCAGGCCCCACTGTCCTGGGAGTTGG 

TGCGCCCTCATATCTATGGATATGCTGTGAAATAGTGTGAGCTGGCAGTTAAGAGCTGAATGG 

CTCAAAGAATTTAAAGCTTCAGTTTTACATTTAAAATGCTAAGTGGGAGAAGAGAAACCTTTT 

TTTTGGGGGGCGGTTTTTTTGGTTTGTTGTTGGTTTTTTTTTTTTTTTTGGCAGGAGAATCTC 

TTGAACCCAGAAGGCGAAGGTTGCAGTGAACCGAGATCATGCCATTGTACTCTAGCCTGGGTG 

ACAAGAGCAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAGAAGTAGAGACAGGGAGAC 

GGGGTCTCACTGTGTTGCCTAGGCCGGTCTTGAACTCCTGGGCTCAAGTGATTCTCCCACCTT 

GACCTCCTAAATTGTTGGGATTACAGGTGTGAGACAGTGCACCTGGCCGAAATAGCTCAAGTT 

TCTGAAAAACAAATCTGAATCTATTTGTTATTCTTAGCGTCACTGGTCTGGCTTTCAGAATTA 

ACATACAAGGTTGCCACACCTAGTTCTGCCCAGCTTTATGTCTTTTATTCCAGTATTCCACCA 

AAGTTTGTTTTCCTGCATTCCAGTTCTCAAGTCTTAAGATAAAGATTGTACTTGACAGTTTAG 

TATATCCATAAAACTATTTGAGGTGGTTAAGGTTCTTGGGTTCATTTTCCTTAATACTTTGCT 

GAATATTGTAGATTGTAGGCAATGAAAAAGTCTACTAAATTAGGAAAACCTTGAATAATTAGG 

TATCCTAGGTAAGAGCCCCTAAACATCAAGCAATCTGTGAGTCTGTAAAGAAATAAATATTTT 

TTGGATTATTCTTATCTAATTCCACCCCTGTTGGAAGATGATTTCTTTGTTCTTTGCAACTAT 

GGAAGCTGTGAAAATCATCACAAGTGCCTCTGAAAGCGAGTGTTAGGTTGGTTAGAGGGTTTA 

ATATTTTCTGCAATGGTTTGTAGGAATTTTAATAAATGTAGTATATTTTCTGAGATGATTTTG 

TAAAAGTACTATTTTAAATATCAAATCAACCAATAAATTCACATTTGTGTTAGGAACAAAA 
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FIGURE 156 

MELTIFILRLAIYILTFPLYLLNFLGLWSWICKKWFPYFLVRFTVIYNEQMASKKRELFSNLQ 
EFAGPSGKLSLLEVGCGTGANFKFYPPGCRVTCIDPNPNFEKFLIKSIAENRHLQFERFWAA 
GENMHQVADGSVDVVVCtLVLCSVKNQERILREVCRVLRPGGAFYFMEHVAAECSTWNYFWQQ 
VLDPAWHLLFDGCNLTRESWKALERASFSKLKLQHIQAPLSWELVRPHIYGYAVK 

Signal peptide: 

amino acids 1-29 

N-glycosylation site. 

amino acids 203-207 



N-myristoylation sites. 

amino. acids 78-84, 80-86, 91-97, 201-207 
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FIGURE 157 



CCGCTGAGATGTACGAACTTCCGGTTCTCCGGGCAGCTGCCACTGCTGTAGCTTCTGCCACCT 

GCCACGACCGGGCCTCTCCCTGGCGTTTGGTCACCTCTGCTTCATTCTCCACCGCGCCTATGG 

TCCCTCTTGGAGCCAGCGTGGCGGGCCTGGCGGCTCCCGGGTGGTGAGAGAGCGGTCCGGGAA 

CGMSAAGGCCTCGCAGTGCTGCTGCTGTCTCAGCCACCTCTTGGCTTCCGTCCTCCTCCTGC 

TGTTGCTGCCTGAACTAAGCGGGCCCCTGGCAGTCCTGCTGCAGGCAGCCGAGGCCGCGCCAG 

GTCTTGGGCCTCCTGACCCTAGACCACGGACATTACCGCCGCTGCCACCGGGCCCTACCCCTG 

CCCAGCAGCCGGGCCGTGGTCTGGCTGAAGCTGCGGGGCCGCGGGGCTCCGAGGGAGGCAATG 

GCAGCAACCCTGTGGCCGGGCTTGAGACGGACGATCACGGAGGGAAGGCCGGGGAAGGCTCGG 

TGGGTGGCGGCCTTGCTGTGAGCCCCAACCCTGGCGACAAGCCCATGACCCAGCGGGCCCTGA 

CCGTGTTGATGGTGGTGAGCGGCGCGGTGCTGGTGTACTTCGTGGTCAGGACGGTCAGGATGA 

GAAGAAGAAACCGAAAGACTAGGAGATATGGAGTTTTGGACACTAACATAGAAAATATGGAAT 

TGACACCTTTAGAACAGGATGATGAGGATGATGACAACACGTTGTTTGATGCCAATCATCCTC 

GAAGA^AGAATGTGCCTTTTGATGAAAGAACTTTATCTTTCTACAATGAAGAGTGGAATTTC 

TATGTTTAAGGAATAAGAAGCCACTATATCAATGTTGGGGGGGTATTTAAGTTACATATATTT 

TAACAACCTTTAATTTGCTGTTGCAATAAATACCGTATCCTTTTATTATATCTTTATATGTAT 

AGAAGTACTCTATTAATGGGCTCAGAGATGTTGGGGATAAAGTATACTGTAATAATTTATCTG 

TTTGAAAATTACTATAAAACGGTGTTTTCTGGTCGGTTTTTGTTTCCTGCTTACCATATGATT 

GTAAATTGTTTTATGTATTAATCAGTTAATGCTAATTATTTTTGCTGATGTCATATGTTAAAG 

AGCTATAAATTCCAACAACCAACTGGTGTGTAAAAATAATTTAAAATTTCCTTTACTGAAAGG 

TATTTCCCATTTTTGTGGGGAAAAGAAGCCAAATTTATTACTTTGTGTTGGGGTTTTTAAAAT 

ATTAAGAAATGTCTAAGTTATTGTTTGCAAAACAATAAATATGATTTTAAATTCTCTTAAAAA 
AAAAA 



wo 01/40466 



PCT/USOO/32678 



158/550 



FIGURE 158 



MKASQCCCCLSHLLASVLLLLLLPELSGPLAVLLQAAEAAPGLGPPDPRPRTLPPLPPGPTPA 
QQPGRGLAEAAGPRGSEGGNGSNPVAGLETDDHGGKAGEGSVGGGLAVSPNPGDKPMTQRALT 
VLMWSGAVLVYFWRTVRMRRRNRKTRRYGVLDTNIENMELTPLEQDDEDDDNTLFDANHPRR 

Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 124-140 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 



amino acids 69-75, 78-84, 81-87, 97-103, 103-109, 106-112, 
157-160 
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FIGURE 159 

GCTGCAGGCGGCGACGGCTACACCM6GGCCGGCTGCTGCGGGCCGCCCGGCTGCCGCCGCTG 

CTTTCGCCGCTGCTGCTTCTGCTGGTTGGGGGAGCGTTCCTGGGTGCCTGTGTGGCTGGGTCT 

GATGAGCCTGGCCCAGAGGGCCTCACCTCCACCTCCCTGCTAGACCTCCTGCTGCCCACTGGC 

TTGGAGCCACTGGACTCAGAGGAGCCTAGTGAGACCATGGGCCTGGGAGCTGGGCTGGGAGCC 

TCTGGCTCAGGCTTCCCCAGCGAAGAGAATGAAGAGTCTCGGATTCTGCAGCCACCACAGTAC 

TTCTGGGAAGAGGAGGAAGAGCTGAATGACTCAAGTCTGGACCTGGGACCCACTGCAGATTAT 

GTTTTTCCTGACTTAACTGAGAAGGCAGGTTCCATTGAAGACACTAGCCAGGCTCAAGAGCTG 

CCAAACCTCCCCTCTCCCTTGCCCAAGATGAATCTGGTTGAGCCTCCCTGGCATATGCCTCCC 

AGAGAGGAGGAAGAAGAGGT^GAGGAAGAGGAGGAGAGGGAGAAGGAAGAGGTAGAGAAACAA 

GAGGAGGAGGAAGAGGAGGAGCTGCTCCCTGTGAATGGATCCCAAGAAGAAGCCAAGCCTCAG 

GTCCGTGACTTTTCTCTCACCAGCAGCAGCCAGACCCCAGGGGCCACCAAAAGCAGGCATGAA 

GACTCCGGGGACCAGGCCTCATCAGGTGTGGAGGTGGAGAGCAGCATGGGGCCCAGCTTGCTG 

CTGCCTTCAGTCACCCCAACTACAGTGACTCCGGGGGACCAGGACTCCACCAGCCAAGAGGCA 

GAGGCCACAGTGCTGCCAGCTGCAGGGCTTGGGGTAGAGTTCGAGGCTCCTCAGGAAGCAAGC 

GAGGAAGCCACTGCAGGAGCAGCTGGTTTGTCTGGCCAGCACGAGGAGGTGCCGGCCTTGCCT 

TCATTCCCTCAAACCACAGCTCCCAGTGGGGCCGAGCACCCAGATGAAGATCCCCTTGGCTCT 

AGAACCTCAGCCTCTTCCCCACTGGCCCCTGGAGACATGGAACTGACACCTTCCTCTGCTACC 

TTGGGACAAGAAGATCTCAACCAGCAGCTCCTAGAAGGGCAGGCAGCTGAAGCTCAATCCAGG 

ATACCCTGGGATTCTACGCAGGTGATCTGCAAGGACTGGAGCAATCTGGCTGGGAAAAACTAC 

ATCATTCTGAACATGACAGAGAACATAGACTGTGAGGTGTTCCGGCAGCACCGGGGGCCACAG 

CTCCTGGCCCTGGTGGAAGAGGTGCTGCCCCGCCATGGCAGTGGCCACCATGGGGCCTGGCAC 

ATCTCTCTGAGCAAGCCCAGCGAGAAGGAGCAGCACCTTCTCATGACACTGGTGGGCGAGCAG 

GGGGTGGTGCCCACTCAAGATGTCCTTTCCATGCTGGGTGACATCCGCAGGAGCCTGGAGGAG 

ATTGGCATCCAGAACTATTCCACAACCAGCAGCTGCCAGGCGCGGGCCAGCCAGGTGCGCAGC 

GACTACGGCACGCTCTTCGTGGTGCTGGTGGTCATTGGGGCCATCTGCATCATCATCATTGCG 

CTTGGCCTGCTCTACAACTGCTGGCAGCGCCGGCTGCCCAAGCTCAAGCACGTGTCGCACGGC 

GAGGAGCTGCGCTTCGTGGAGAACGGCTGCCACGACAACCCCACGCTGGACGTGGCCAGCGAC 

AGCCAGTCGGAGATGCAGGAGAAGCACCCCAGCCTGAACGGCGGCGGGGCCCTCAACGGCCCG 

GGGAGCTGGGGGGCGCTCATGGGGGGCAAGCGGGACCCCGAGGACTCGGACGTGTTCGAGGAG 

GACACGCACCTGIGAGCGCAGCCGAGGCGCAGGCCGAGTGGGCCGCCAGGACCAAGCGAGGTG 

GACCCCGAAACGGACGGCCCGGAGCCCGCACCAGCCCCGCGCCTACCCGGGCCGCCCCCGCGG 

CCTGGCCCTCGGCGCGGGCTCCTTCCCGCTTCCCCCGACTTCACACGGCGGCTTCGGACCAAC 

TCCCTCACTCCCGCCCGAGGGGCAGGCCTCAAAGCCCGCCTTGGCCCCGCTTTCCCGCCCCTG 

AACCCCGGCCCCGCGGGCGGCGGGCGGCGCTTCCTGCGCCCCGGGACTCAATTAAACCCGCCC 

GGAGACCACGCCGGGCCCAGCAATiA 
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FIGURE 160 

MGRLLRAARLPPLLSPLLLLLVGGAFLGACVAGSDEPGPEGLTSTSLLDLLLPTGLEPLDSEE 
PSETMGLGAGLGASGSGFPSEENEESRILQPPQYFWEEEEELNDSSLDLGPTADYVFPDLTEK 
AGSIEDTSQAQELPNLPSPLPKMNLVEPPWHMPPREEEEEEEEEEEREKEEVEKQEEEEEEEL 
LPVNGSQEEAKPQVRDFSLTSSSQTPGATKSRHEDSGDQASSGVEVESSMGPSLLLPSVTPTT 
VTPGDQDSTSQEAEATVLPAAGLGVEFEAPQEASEEATAGAAGLSGQHEEVPALPSFPQTTAP 
SGAEHPDEDPLGSRTSASSPLAPGDMELTPSSATLGQEDLNQQLLEGQAAEAQSRIPWDSTQV 
ICKDWSNLAGKNYIILNMTENIDCEVFRQHRGPQLLALVEEVLPRHGSGHHGAWHISLSKPSE 
KEQHLLMTLVGEQGVVPTQDVLSMLGDIRRSLEEIGIQNYSTTSSCQARASQVRSDYGTLFVV 
LVVI GA I C III I ALGLL YNCWQRRL PKLKHVSHGEELRFVENGCH DN PTLDVAS DSQSEMQEK 
HPSLNGGGALNGPGSWGALMGGKRDPEDSDVFEEDTHL 

Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 4 99-521 

N-glycosyla'tion sites . 

amino acids 106-110, 193-197, 395-399, 480-484 

Glycosaminoglycan attachment site. 

amino acids 77-81 

N-myristoylation sites. 

amino acids 24-30, 28-34, 41-47, 69-75, 71-77, 73-79, 75-81, 
216-222, 327-333, 455-461, 519-525, 574-580, 581-587, 584-590 



Amidation site. 

amino acids 588-592 



• 
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FIGURE 161 

CCAGGGCGGAGCGCAGCTGCGCCGGGCTTGGGCGCCTGGGGCCGCCGCTCCCCACCGTCGTTT 
TCCCCACCGAGGCCGAGGCGTCCCGGAGTCMGGCCGGCCTGAACTGCGGGGTCTCTATCGCA 
CTGCTAGGGGTTCTGCTGCTGGGTGCGGCGCGCCTGCCGCGCGGGGCAGAAGCTTTTGAGATT 
GCTCTGCCACGAGAT^GCAACATTACAGTTCTCATAAAGCTGGGGACCCCGACTCTGCTGGCA 
AAACCCTGTTACATCGTCATTTCTAAAAGACATATAACCATGTTGTCCATCAAGTCTGGAGAA 
AGAATAGTCTTTACCTTTAGCTGCCAGAGTCCTGAGAATCACTTTGTCATAGAGATCCAGAAA 
AATATTGACTGTATGTCAGGCCCATGTCCTTTTGGGGAGGTTCAGCTTCAGCCCTCGACATCG 
TTGTTGCCTACCCTCAACAGAACTTTCATCTGGGATGTCAAAGCTCATAAGAGCATCGGTTTA 
GAGCTGCAGTTTTCCATCCCTCGCCTGAGGCAGATCGGTCCGGGTGAGAGCTGCCCAGACGGA 
GTCACTCACTCCATCAGCGGCCGAATCGATGCCACCGTGGTCAGGATCGGAACCTTCTGCAGC 
AATGGCACTGTGTCCCGGATCAAGATGCAAGAAGGAGTGAAAATGGCCTTACACCTCCCATGG 
TTCCACCCCAGAAATGTCTCCGGCTTCAGCATTGCAAACCGCTCATCTATAAAACGTCTGTGC 
ATCATCGAGTCTGTGTTTGAGGGTGAAGGCTCAGCAACCCTGATGTCTGCCAACTACCCAGAA 
GGCTTCCCTGAGGATGAGCTCATGACGTGGCAGTTTGTCGTTCCTGCACACCTGCGGGCCAGC 
GTCTCCTTCCTCAACTTCAACCTCTCCAACTGTGAGAGGAAGGAGGAGCGGGTTGAATACTAC 
ATCCCGGGCTCCACCACCAACCCCGAGGTGTTCAAGCTGGAGGACAAGCAGCCTGGGAACATG 
GCGGGGAACTTCAACCTCTCTCTGCAAGGCTGTGACCAAGATGCCCAAAGTCCAGGGATCCTC 
CGGCTGCAGTTCCAAGTTTTGGTCCAACATCCACAAAATGAAAGCAGTGAGl^CCCCACTT 
TCCTTTTTCTTCCTCCTCCAGCACCTTCGTTGTTTCCTGGGTAGTCTGCCTGGGTGAGGCTCC 
CTTCCTGTTTCTCATCTGTGGCTTCTGAAACACTTAGACTCTGGACCCAGCAAGAGTTTCAGG 
AAGTGGGTTGCTAGGCAGTTAGACAGGCTTGTTGGTGAACACCCGGTATGTAGTTCCATTTCA 
GCACAATAAAAAGAAATCTTGCATTCAAGATGCTAAATTGTTTTTAACGAAAA 
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FIGURE 162 

MAGLNCGVSIALLGVLLLGAARLPRGAEAFEIALPRESNITVLIKLGTPTLLAKPCYIVISKR 
HITMLSIKSGERIVFTFSCQSPENHFVIEIQKNIDCMSGPCPFGEVQLQPSTSLLPTLNRTFI 
WDVKAHKSIGLELQFSIPRLRQIGPGESCPDGVTHSISGRIDATWRIGTFCSNGTVSRIKMQ 
EG VKMALHL PWFH PRN VSG FS I ANRS S I KRLC 1 1 ES VFEGEGSATLMSAN Y PEG FPEDELMTW 
QFVVPAHLRASVSFLNFNLSNCERKEERVEYYIPGSTTNPEVFKLEDKQPGNMAGNFNLSLQG 
CDQDAQSPGILRLQFQVLVQHPQNESSE 

Signal peptide: 

amino acids 1-29 

N-glycosylation sites. 

amino acids 39-43, 122-126, 180-184, 205-209, 213-217, 270-274, 
310-314, 339-343 

Tyrosine kinase phosphorylation site. 

amino acids 276-284 

N-myristoylation* sites. 

amino acids 3-9, 7-13, 158-164, 175-181, 191-197, 303-309 
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FIGURE 163 

CAACCACACACCTGGGGAATTGCTGGCCTGACTTCTGACCCCTGACTCCTCATACCCTTCCTC 

CAGAGCATGACATTTGACCACCAACTGAAACCTGACCTCTGACCCCAGACCACTGGCCCTTCC 

CCCGCCCTGTGGTGACTTCATAAAGGTTACTAGCTTCTCCCCTGGCCTTGAGACCCACACGAT 

SGCCCTGCTGGCTCTGGCCAGTGCCGTCCCCTCTGCCCTGCTGGCCCTGGCTGTCTTCAGGGT 

GCCCGCCTGGGCCTGTCTCCTCTGCTTCACAACCTACTCTGAGCGCCTCCGCATCTGCCAGAT 

GTTTGTTGGGATGCGGAGCCCCAAGCTTGAAGAGTGTGAGGAGGCCTTCACGGCCGCCTTCCA 

GGGCCTCTCTGACACCGAAATCAGTGAGGAGACCATCCACACTTCATCAGTGTCCTGGGGAAG 

GTGCAGAGGGAGGGCAGGAGAGGCCCAGAGGGTCAGGCTGAGGGACAGACAGAGAGAAACAGT 

CAGAGGAGAAAGGCTCAAAGACCATGAGAACAACAGAGACTTAGGGACAGAGAGACACAGACA 

GGGGAAGACAGCAGGGCAAAGACTCAGAGAGGGGAGGATGGAGAGTCAGAGAGGGGAAGATGG 

AGACTCAGAGAGAGGGGAGGATGGAGACTCAGAGAGAGAGGAAGATGGAGACTCAGAGGGAAA- 

GATGGAGACTCAGGAGTATGGAGAGTCAGAGAGGGGAGGATGGACACTCAGGGGAGGATGGAG 

AGTCAGGAGGATGGAGACTCATAGAAAGGGGAGGATGGAGAGTCAGGAGAGGTTGGAGACTGG 

AGAGGGAATAGAGACCCAGAAAGGGGAGGATGGAGACTCAGAGGGTGGAAGATGGAGACTCAA 

AGAGGATGGAAACCCAGAGAGAGGAGGACAGAGATGAGGCAGAGACTAGGGGAAGCAGGATAG 

CGACTGGTCGGGGGCAGAGACTCAGGGAGGATAGAGACTCACAGAGAGGTGAGGATAGAGACT 

TGGGAGGGACTCAGGAAGCATAGCGACTGTGGGGCAAAGAGTCAGAGAGGGGAGGATACAGAC 

TTGGGAGGGCAGAGACTCAGAAACAGAATGTTCGCATTAGGGACATGGTGTTGCGGGGAGCTG 

CCTCCCCCAGCCCCTGCTCCCTCCCTCACCGCCAGACTATGATGAGAGAAGCCACCTGCATGA 

CACCTTCACCCAGATGACCCATGCCCTGCAGGAGCTGGCTGCTGCCCAGGGATCCTTTGAGGT 

TGCCTTCCCTGATGCTGCAGAGAAAATGAAGAAGGTCATTACACAGCTTAAAGAAGCCCAGGC 

TTGCATCCCTCCCTGCGGTCTCCAGGAGTTCGCCCGGCGTTTCCTCTGCAGCGGGTGCTACTC 

TAGGGTCTGCGACCTCCCGCTGGACTGCCCAGTTCAGGATGTGACAGTGACTCGGGGCGACCA 

GGCTATGTTTTCTTGCATCGTAAACTTCCAGCTGCCAAAGGAGGAGATCACCTATTCCTGGAA 

GTTCGCAGGAGGAGGTCTCCGGACTCAGGACTTGTCCTATTTCCGAGATATGCCGCGGGCCGA 

AGGATACCTGGCGCGGATCCGGCCGGCTCAGCTCACGCACCGCGGGACGTTCTCCTGCGTGAT 

CAAGCAAGACCAGCGCCCCCTGGCCCGGCTCTACTTCTTTCTTAACGTCCTCGGGGCCCTCGC 

ATCAGCGAGTGCGACAGTGTTGGCGTGGTGAGTTCTGGGGACTCCGGAGCCCCAGCATCTAGC 

TCCCCGCTGTCTCAGATCCCACCGAGAAGTCTGGGTTCCCAGCAACCTCCAACCCAGGAGGAT 

GTTCTTTCGATGGTACTGCAGTGGCAACTAACAAAGGTATCTTTCCTCCTTCCCTATCCTATT 

TCCATCCTGAAAATAAAGAATATATTTCAACTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 164 

MALLALASAVPSALLALAVFRVPAWACLLCFTTYSERLRICQMFVGMRSPKLEECEEAFTAAF 
QGLSDTEISEETIHTSSVSWGRCRGRAGEAQRVRLRDRQRETVRGERLKDHENNRDLGTERHR 
QGKTAGQRLREGRMESQRGEDGDSERGEDGDSEREEDGDSEGKMETQEYGESERGGWTLRGGW 
RVRRMETHRKGRMESQERLETGEGIETQKGEDGDSEGGRWRLKEDGNPERGGQR 

Signal peptide: 

amino acids i-26 



N-myristoylation site. 

amino acids 65-71 
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FIGURE 165 

CAGAATCGCAGATTGCCAGCCCTTTTCCCGACCCCTACGGAAAGACGAGTCCAGGGGCCGTCC 

TGGCGAGGTCAAAACATTTAGTCTGGTCTTTTCAGCGTGGACCCTGCCAGCAGCCAGGCCATS 

GAGCTCTCTGATGTCACCCTCATTGAGGGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTG 

GTGGTGCTGATTCTAGCCTTGGTCCTAGCTTGGCTCTCTACCTACGTAGCAGACAGCGGTAGC 

AACCAGCTCCTGGGCGCTATTGTGTCAGCAGGCGACACATCCGTCCTCCACCTGGGGCATGTG 

GACCACCTGGTGGCAGGCCAAGGCAACCCCGAGCCAACTGAACTCCCCCATCCATCAGAGGGT 

AATGATGAGAAGGCTGAAGAGGCGGGTGAAGGTCGGGGAGACTCCACTGGGGAGGCTGGAGCT 

GGGGGTGGTGTTGAGCCCAGCCTTGAGCATCTCCTTGACATCCAAGGCCTGCCCAAAAGACAA 

GCAGGTGCAGGCAGCAGCAGTCCAGAGGCCCCCCTGAGATCTGAGGATAGCACCTGCCTCCCT 

CCCAGCCCTGGCCTCATCACTGTGCGGCTCAAATTCCTCAATGATACCGAGGAGCTGGCTGTG 

GCTAGGCCAGAGGATACCGTGGGTGCCCTGAAGAGCAAATACTTCCCTGGACAAGAAAGCCAG 

ATGAAACTGATCTACCAGGGCCGCCTGCTACAAGACCCAGCCCGCACACTGCGTTCTCTGAAC 

ATTACCGACAACTGTGTGATTCACTGCCACCGCTCACCCCCAGGGTCAGCTGTTCCAGGCCCC 

TCAGCCTCCTTGGCCCCCTCGGCCACTGAGCCACCCAGCCTTGGTGTCAATGTGGGCAGCCTC 

ATGGTGCCTGTCTTTGTGGTGCTGTTGGGTGTGGTCTGGTACTTCCGAATCAATTACCGCCAA 

TTCTTCACAGCACCTGCCACTGTCTCCCTGGTGGGAGTCACCGTCTTCTTCAGCTTCCTAGTA 

TTTGGGATGTATGGACGASaaGGACATAGGAAGAAAATGAAAGGCATGGTCTTTCTCCTTTAT 

GGCCTCCCCACTTTTCCTGGCCAGAGCTGGGCCCAAGGGCCGGGGAGGGAGGGGTGGAAAGGA 

TGTGATGGAAATCTCCTCCATAGGACACAGGAGGCAAGTATGCGGCCTCCCCTTCTCATCCAC 

AGGAGTACAGATGTCCCTCCCGTGCGAGCACAACTCAGGTAGAAATGAGGATGTCATCTTCCT 

TCACTTTTAGGGTCCTCTGAAGGAGTTCAAAGCTGCTGGCCAAGCTCAGTGGGGAGCCTGGGC 

TCTGAGATTCCCTCCCACCTGTGGTTCTGACTCTTCCCAGTGTCCTGCATGTCTGCCCCCAGC 

ACCCAGGGCTGCCTGCAAGGGCAGCTCAGCATGGCCCCAGCACAACTCCGTAGGGAGCCTGGA 

GTATCCTTCCATTTCTCAGCCAAATACTCATCTTTTGAGACTGAAATCACACTGGCGGGAATG 

AAGATTGTGCCAGCCTTCTCTTATGGGCACCTAGCCGCCTTCACCTTCTTCCTCTACCCCTTA 

GCAGGAATAGGGTGTCCTCCCTTCTTTCAAAGCACTTTGCTTGCATTTTATTTTATTTTTTTA 

AGAGTCCTTCATAGAGCTCAGTCAGGAAGGGGATGGGGCACCAAGCCAAGCCCCCAGCATTGG 

GAGCGGCCAGGCCACAGCTGCTGCTCCCGTAGTCCTCAGGCTGTAAGCAAGAGACAGCACTGG 

CCCTTGGCCAGCGTCCTACCCTGCCCAACTCCAAGGACTGGGTATGGATCGCTGGGCCCTAGG 

CTCTTGCTTCTGGGGCTATTGGAGGGTCAGTGTCTGTGACTGAATAAAGTTCCATTTTGTGGA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MELSDVTLIEGVGNEVMWAGVVVLILALVLAWLSTYVADSGSNQLLGAIVSAGDTSVLHLGH 
VDHLVAGQGNPEPTELPHPSEGNDEKAEEAGEGRGDSTGEAGAGiSGVEPSLEHLLDIQGLPKR 
QAGAGSSSPEAPLRSEDSTCLPPSPGLITVRLKFLNDTEELAVARPEDTVGALKSKYFPGQES 
QMKLIYQGRLLQDPARTLRSLNITDNCVIHCHRSPPGSAVPGPSASLAPSATEPPSLGVNVGS 
LMVPVFVVLLGVVWYFRINYRQFFTAPATVSLVGVTVFFSFLVFGMYGR 

Signal peptide: 

amino acids 1-36 

Transmembrane domains: 

amino acids 246-267, 275-301 

N-glycosylation sites. 

amino acids 162-166, 211-215 

N-myristoylation sites. 

amino acids 48-54, 105-111, 109-115, 129-135, 177-183, 247-253 



Cell attachment sequence. 

amino acids 97-100. 
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FIGURE 167 

GGCGGCTGTGTGTCGCCGGAGCCGAAGCGCGCAGGCCCGTCCCGGTGGCCGGGGAGCGGGCGGGTGGGGGCGCCA 

TGTGGTTCATGTACCTGCTGAGCTGGCTGTCGCTCTTCATCCAGGTGGCCTTCATCACGCTGGCTGTCGCGGCTG 

GACTCTATTACCTGGCAGAACTGATAGAAGAATACACAGTGGCCACCAGCAGGATCATAAAATACATGATCTGGT 

TCTCCACCGCTGTACTGATTGGCCTCTACGTCTTTGAGCGCTTCCCCACCAGCATGATTGGAGTGGGCCTATTCA 

CCAACCTCGTCTACTTTGGCCTCCTCCAGACCTTCCCCTTCATCATGCTGACCTCGCCTAACTTCATCCTGTCGT 

GTGGACTAGTGGTGGTGAATCATTACCTAGCATTTCAGTTTTTTGCAGAAGAATATTATCCCTTCTCAGAGGTCC 

TGGCCTATTTCACTTTCTGCCTGTGGATAATTCCGTTTGCGTTTTTTGTGTCACTTTCGGCCGGGGAGAACGTCC 

TGCCCTCTACCATGCAGCCAGGAGATGATGTCGTCTCCAATTATTTCACCAAAGGCAAGCGGGGCAAACGCTTAG 

GGATCCTGGTTGTCTTCTCCTTCATCAAAGAGGCCATTCTACCCAGTCGTCAGAAGATATACTGACCCCCATGCA 

GGCAGGATGTGGGGGGCAAGATCAGGAGAGTCAGGCCCCTGGGCCTCTATGCCAGGTGGGGACCAGAAGTCGGGA 

AGGCACCTACCACCTGCCCTGGCTTTCTTCCCCTCAACTCTGGAGCCCCATCCCCACCCTCCTTGGGGGGCTCAG 

CTTGGCTCAGATCTGATGCTTCAAGAGGCTGTAACCTCAGAGGGCACCAAGGAGGGTGGCAGAGCCTGCTTAGCC 

AGGAGGCCGAGGTCCCTCAGTCCTCCCCTGTCCCTTCCAAGGTGGGTCAGGAGGTTCTGGCCCCGCTGGGGCAGG 

CAGGGCAGGGTCTGTGAAGCTTAAGAGCAGATGGTGACAAGTTCTCTGGGCAGGTGGCCATGGGGAGGGGCCATG 

GCTTGGCATGTCCAACAGAAATAGTTTTTGCTGTTGAACGGTGATTTCTGTCCAAGTGCAGATTTCCGTTTGAAT 

AAAGCTTCGCTTCTAGGTGGCACTGTTTGCCTTAATACCCTGACAGTTCATCTTCCTTTCTTCCTGCTAACCTTC 

TGCTCTGGACTGGACTCACTTTTCTGCTCCAGGGACTCCTTTTCTGGGTTTGGGTCTTGCCCTTCCCAAGGGACT 

GTTCTTGTGGCCCTTAATGGGAAGGGGGCAGGGGTGAGGAGCTGAGCCTGCTCAAGGAGTGGGAAGTGGGGCTAT 

AGGCAGCCTCTCTGATGCACTCTCTTCCATCTCTTTCCCCAAGGCTCCGTGACTGTCAAACTGGGAGTAGGAGAG 

GGGACAATTTAGGACTGGGCTAGATTTTCAGAAGAACATCTACAATATCCTATTTATAAATCTTCCTCTGGGAAA 

AGGAGTGGTTTCTGGCTGAATACTATCTTAGGCTCAAGGAGAAACAAAATAAAAATTAGCTTCCAGGCAGCCTGT 

TTTTAAAGAAATGGGACTAATGGGAGAAGCTGTTTGTCACTCTAAGAGCATCCAAGCCCTGGCCCGTCTGTGCAC 

TCTTGGCTCCTGGGGAGATATATCTGCCTTCTAAGAAGGCAGGCCAGGTCTTGGGCACAGACCTGCATTTGTTGA 

CCTTGCACTCCAACTATAGTGCCTTGCAAGTGCTCAACAGTACATATTGGAATGAAGTCCCTATGAGAGCCATTT 

CTGGCCATGTTCTATACCTCAAAGTGAGGCTGGCAGGTACAGAGATGAACTGTACACATGTGATACATTTAAGCC 

ACTGGAAAAACCCCTGTGCTTGAAAATATTTCCTCTATATCATGCCTGGAGTTCCATCATAGCCCTTCATTTCCT 

TGGCTTTAGCATTTACCTTCTCTTAAGAATACCAGCTTTCCCCTTTCCCTGAGAGGAAGAGCACATGTTGGTCTC 

CTCTTAGTGTGAACGAGATTGCCAGGCCCTTTTCTCCTATGCACACCAGGATAGACAAGGCAGGGGATACTGGCA 

GCCTGCATCATCCTCCCATTGGGCTGACAGCTGGCCCTACTTTCCTCCCTCTGCTGCTTGGTCCCTCACCTTGAT 

GATGTGGCTTCGCCCCCTCCACTCTACTGCCAGTGTTCTCCCAGGGGTTGCTAAATCCAGCAGACCCCTTTCCTG 

TCTTACTAGATCTGGGCAGCATTTGACATGGCTGATCACCCCTTGCTTCTTGGATGGCACTTCCCTGGCACCTCT 

GTGGCTAGTTGTCCTACCTCCCTGGCTGTTCCTTTCAGGCTTCCGTGCAGGCTTCTCCACTTGCCCATGCACAGT 

AGGGTCTTTCAGGGTTCTGCTGTGGGCTCCCTAGGGAAGCCCATCCATCTGGATGGTTTCAAGGATGGTGAGGAA 

TTTAGAGTTGACCTCCAGCCCCAACATCCTTCCTGATCACCTGAACCACAGTTTTGCTGCCCTCTAGGTGCACAG 

ACAATTCAGGTCCATGGCCCAGATGGTACTTGCTGTCTTCTGCAAACCTGCCCCTTCTGGGTACTTCCCTTGACC 

CCGAGATCACTCAGGAGCCAGACAGGAAACTTATTCTATTCCTGTTTTCTCTTTCTGCCCACCACATCCTU^TCTC 

TCAAAACGGTCAGGTCTACCTTAACATCTCTTGATTTGAGCCACTCCCACTGTCATCAGCTTTCACCTGGATTAT 

CGTGACAGCCTCCTACTGCTTCTCTATCATGTGGCCAGAGCTATCTTCCTAAAATGCATTGCATAGTTGATCAAG 

TCACTCTCTGGCCTAAAACCTTCCTTGGCTCCCTGCTGCCCTCAGGATAAAGTCTGGACCCCTCAGCATGGCTTG 

TGAGACTCATGGTGTCCTTGTCCCTGCTCACCTCTCTGGTCTCATCACTTGCCTTCTTGCATTCTGGGTCCCAGC 

CTCCTGTATCCAGAGATGCAGTGGCTCTCCATTGCCACTCTGATTCCTCCTTTCTTTTGGTCACAGAGAAAGGGT 

ACTTTCTCTGTCAAATCTCAACTTAGACTTGACTTCCTCCAAGGAGCTTTGGCTATACTCTCTCCTCCCGACCCC 

CACCCTGGCATACTACACAGATCACTCTGGGCTCACTTGCCTGCCTAATGGTCATCTCCCCAGTAGACTGTAAGC 

TCCTTGAGGGCAAGGATTGTGTTGGAATTTTTGTATTAACAGTGCCTGGCTTGGTGCCTGGCACCTAGAAAGCAC 
TCAATAAATGTTTGTTTAATGAA 
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FIGURE 168 

MWFMYLLSWLSLFIQVAFITLAVAAGLYYLAELIEEYTVATSRIIKYMIWFSTAVLIGLYVFE 
RFPTSMIGVGLFTNLVYFGLLQTFPFIMLTSPNFILSCGLWVNHYLAFQFFAEEYYPFSEVL 
AYFTFCLWIIPFAFFVSLSAGENVLPSTMQPGDDVVSNYFTKGKRGKRLGILWFSFIKEAIL 
PSRQKIY 

Signal peptide: 

amino acids i-25 

Transmembrane domain: 

amino acids 126-14 6 

Casein kinase II phosphorylation site. 

amino acids 145-148 

N-myristoylation sites. 

amino acids 73-78, 82-87 

Amidation sites. 

amino acids 168-171, 171-174 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 91-101 
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FIGURE 169 

CAAAGCCCTACCCTCACCATTCACCAGGTCCTGTGGGAAGAGCAGCGTGGAGGTGGGCTGAGG 

TTAGAAGGTGCAGAGCGTGGAAGAAGATTGTGAGCTGAGTATTGGACATCTGTTCTTGAATAG 

TCCCTGGGCCTGCCATAGGAAAGGAAGTTCTCCAGGGTTACAGTTCTTATCCGCGTGAATACA 

CAT6GCTCTGTTACGAAAAATTAATCAGGTGCTGCTGTTCCTTCTGATCGTGACCCTCTGTGT 

GATTCTGTATAAGAAAGTTCATAAGGGGACTGTGCCCAAGAATGACGCAGATGATGAATCCGA 

GACTCCTGAAGAACTGGAAGAAGAGATTCCTGTGGTGATTTGTGCTGCAGCAGGGAGGATGGG 

TGCCACTATGGCTGCCATCAATAGCATCTACAGCAACACTGACGCCAACATCTTGTTCTATGT 

AGTGGGACTCCGGAATACTCTGACTCGAATACGAAAATGGATTGAACATTCCAAACTGAGAGA 

AATAAACTTTAAAATCGTGGAATTCAACCCGATGGTCCTCAAAGGGAAGATCAGACCAGACTC 

ATCGAGGCCTGAATTGCTCCAGCCTCTGAACTT.TGTTCGATTTTATCTCCCTCTACTTATCCA 

CCAACACGAGAAAGTCATCTATTTGGACGATGATGTAATTGTACAAGGTGATATCCAAGAACT 

GTATGACACCACCTTGGCCCTGGGCCACGCGGCGGCTTTCTCAGATGACTGCGATTTGCCCTC 

TGCTCAGGACATAAACAGACTCGTGGGACTTCAGAACACATATATGGGCTATCTGGACTACCG 

GAAGAAGGCCATCAAGGACCTTGGCATCAGCCCCAGCACCTGCTCTTTCAATCCTGGTGTGAT 

TGTTGCCAACATGACAGAATGGAAGCACCAGGGCATCACCAAGCAATTGGAGAAATGGATGCA 

AAAGAATGTGGAGGAAAACCTCTATAGCAGCTCCCTGGGAGGAGGGGTGGCCACCTCCCCAAT 

GCTGATTGTGTTTCATGGGAAATATTCCACAATTAACCCCCTGTGGCACATAAGGCACCTGGG 

CTGGAATCCAGATGCCAGATATTCGGAGCATTTTCTGCAGGAAGCTAAATTACTCCACTGGAA 

TGGAAGACATAAACCTTGGGACTTCCCTAGTGTTCACAACGACTTATGGGAAAGCTGGTTTGT 

TCCTGACCCTGCAGGGATATTTAAACTCAATCACCATAGCEgBsTATAACTCTACCCTTAAAAT 

ATTCCCTGTATAGAAATGTGGAATTGTCCCTTTGTAGCCAACTATAACATTGTTCTTTATGAA. 

TATTACCTTTGATACATATGATCCACAATATAAAAACCAAAAACTACTGTGTGCAAATTATAC 

CTTGGACCATATAGGCATTGATTAACTTCTTTAAGTACATGTGATAACTATGGAAATCAAGAT 

TATGTGACTGAAAAACATAAAGGAAGAGACCCATCTAGATAACAGCAATCAACCTGCTTAATT 

CTGAATGACAATTATATCCACAAATTTTTAAAACTTCTACATGTATTTTTCACATGAAGATCT 

CCTTAACAGGTTGCCAACCTTTTCTTTTATAAAACTATTACATTTAAAATATGGACGTCTGAA 
AAATAAAATATTCATCATTTTTAAAA 
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FIGURE 170 



mallrkinqvllfllivtlcvilykkvhkgtvpkndaddesetpeeleeeipwicaaAge^g 
atmaainsiysntdanilfyvvglrntltrirkwiehsklreinfkivefnpmvlkgkirpds 
srpellqplnfvrfylpllihqhekviyldddvivqgdiqelydttlalghaaafsddcdlps 
aqdinrlvglqntymgyldyrkkaikdlgispstcsfnpgvivanmtewkhqritkqlekwmq 
knveenlyssslgggvatspmlivfhgkystinplwhirhlgwnpdarysehflqeakllhwn 
grhkpwdfpsvhndlweswfvpdpagifklnhhs 

Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 234-238 

Tyrosine kinase phosphorylation site. 

amino acids 253-261 

N-myristoylatxon sites. 

amino acids 63-69, 86-92, 198-204, 218-224, 229-235, 265-271, 
266-272 
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FIGURE 171 

GCCAGAGGCTGCAGCTGGAGCCCAGAGCCCAAGATGGAGCCCCAGCTGGGGCCTGAGGCTGCC 
GCCCTCCGCCCTGGCTGGCTGGCCCTGCTGCTGTGGGTCTCAGCCCTGAGCTGTTCTTTCTCC 
TTGCCAGCTTCTTCCCTTTCTTCTCTGGTGCCCCAAGTCAGAACCAGCTACAATTTTGGAAGG 
ACTTTCCTCGGTCTTGATAAATGCAATGCCTGCATCGGGACATCTATTTGCAAGAAGTTCTTT 
AAAGAAGAAATAAGATCTGACAACTGGCTGGCTTCCCACCTTGGACTGCCTCCCGATTCCTTG 
CTTTCTTATCCTGCAAATTACTCAGATGATTCCAAAATCTGGCGCCCTGTGGAGATCTTTAGA 
CTGGTCAGCAAATATCAAAACGAGATCTCAGACAGGAGAATCTGTGCCTCTGCATCAGCCCCA 
AAGACCTGCAGCATTGAGCGTGTCCTGCGGAAAACAGAGAGGTTCCAGAAATGGCTGCAGGCC 
AAGCGCCTCACGCCGGACCTGGTGCAGGACTGTCACCAGGGCCAGAGAGAACTAAAGTTCCTG 
TGTATGCTGAGAIAaCACCAGTGAAAAAGCCTGGCATGGAGCCCAGCACTGAGAACTTCCAGA 
AAGTGTTAGCCTTCTCCCAACTGTGTTATACCAACCACATTTTCAAATAGTAATCATTAAAGA 
GGCTTCTGCATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 172 



MEPQLGPEAAALRPGWLALLLWVSALSCSFSLPASSLSSLVPQVRTSYNFGRTFLGLDKCNAC 
IGTSICKKFFKEEIRSDNWLASHLGLPPDSLLSYPANYSDDSKIWRPVEIFRLVSKYQNEISD 
RRICASASAPKTCSIERVLRKTERFQKWLQAKRLTPDLVQDCHQGQRELKFLCMLR 

Signal peptide: 

amino acids 1-28 

N-glycosylation site. 

amino acids 100-103 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 158-161 

N-nyristoylation sites. 

amino acids 56-61, 65-70 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 18-28 

Prenyl group binding site (CAAX box) . 

amino acids 179-182 

Iieucine zi|^r pattern. 

amino acids 5-26 
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FIGURE 173 

GCTGGACTGCTCGCTGGCCGGCAGCGCACCGTTTTGAAGGTCCTAGCCCACCTGGGCTGGCTC 

ACGCGCACGACTAGCCGCTCCCATACAGCACGCCCGGACTCTGTCGTCGCTTAAGGCCACTCC 

TATTCTACGGCTGACCCCTGGTGGTCACGTGGATCTGTTCGCCACGCAAGTCTGGGTCCTTCG 

GCGATTGACCGGGGTCCTTGCTGTTCGGGAGCCTCTCCTAAGCTGCCTGTTCGGGCGAGAGTT 

TGGAGGGGCGGGTTTGGGGTCGGTGTCTGATTGGGGCTCGCACCGCAGCACGCTGGAGTCCCG 

CTTAGGTACCAGTTAGCGTCAGGGGAGCTGGGTCAGGCGGTCGCCGGGACACCCCGTGTGTGG 

CAGGCGGCGAAGCGCTCTGGAGAATCCCGGACAGCCCTGCTCCCTGCAGCCAGGTGTAGTTTC 

GGGAGCCACTGGGGCCAAAGTGAGAGTCCAGCGGTCTTCCAGCGCTTGGGCCACGGCGGCGGC 

CCTGGGAGCAGAGGTGGAGCGACCCCATTACGCTAAAGATGAAAGGCTGGGGTTGGCTGGCCC 

TGCTTCTGGGGGCCCTGCTGGGAACCGCCTGGGCTCGGAGGAGCCAGGATCTCCACTGTGGAG 

CATGCAGGGCTCTGGTGGATGAACTAGAATGGGAAATTGCCCAGGTGGACCCCAAGAAGACCA 

TTCAGATGGGATCTTTCCGGATCAATCCAGATGGCAGCCAGTCAGTGGTGGAGGTGCCTTATG 

CCCGCTCAGAGGCCCACCTCACAGAGCTGCTGGAGGAGATATGTGACCGGATGAAGGAGTATG 

GGGAACAGATTGATCCTTCCACCCATCGCAAGAACTACGTACGTGTAGTGGGCCGGAATGGAG 

AATCCAGTGAACTGGACCTACAAGGCATCCGAATCGACTCAGATATTAGCGGCACCCTCAAGT 

TTGCGTGTGAGAGCATTGTGGAGGAATACGAGGATGAACTCATTGAATTCTTTTCCCGAGAGG 

CTGACAATGTTAAAGACAAACTTTGCAGTAAGCGAACAGATCTTTGTGACCATGCCCTGCACA 

TATCGCATGATGAGCTAT6AACCACTGGAGCAGCCCACACTGGCTTGATGGATCACCCCCAGG 

AGGGGAAAATGGTGGCAATGCCTTTTATATATTATGTTTTTACTGAAATTAACTGAAAAAATA 
TGAAACCAAAAGT 
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FIGURE 174 

MKGWGWLALLLGALLGTAWARRSQDLHCGACRALVDELEWEIAQVDPKKTIQMGSFRINPDGS 
QSWEVPYARSEAHLTELLEEICDRMKEYGEQIDPSTHRKNYVRVVGRNGESSELDLQGIRID 
SDISGTLKFACESIVEEYEDELIEFFSREADNVKDKLCSKRTDLCDHALHISHDEL 

Signal peptide: 

amino acids 1-20 

N-myristoylation sites. 

amino acids 12-18, 16-22, 29-35 



Endoplasmic reticulum targeting sequence. 

amino acids 179-184 
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FIGURE 175> 

CGCAGCGCGGCAGTCCTGATGGCCCGGCATGGGTTACCGCTGCTGCCCCTGCTGTCGCTCCTG 

GTCGGCGCGTGGCTCAAGCTAGGAAATGGACAGGCTACTAGCATGGTCCAACTGCAGGGTGGG 

AGATTCCTGATGGGAACAAATTCTCCAGACAGCAGAGATGGTGAAGGGCCTGTGCGGGAGGCG 

ACAGTGAAACCCTTTGCCATCGACATATTTCCTGTCACCAACAAAGATTTCAGGGATTTTGTC 

AGGGAGAAAAAGTATCGGACAGAAGCTGAGATGTTTGGATGGAGCTTTGTCTTTGAGGACTTT 

GTCTCTGATGAGCTGAGAAACAAAGCCACCCAGCCAATGAAGTCTGTACTCTGGTGGCTTCCA 

GTGGAAAAGGCATTTTGGAGGCAGCCTGCAGGTCCTGGCTCTGGCATCCGAGAGAGACTGGAG 

CACCCAGTGTTACACGTGAGCTGGAATGACGCCCGTGCCTACTGTGCTTGGCGGGGAAAACGA 

CTGCCCACGGAGGAAGAGTGGGAGTTTGCCGCCCGAGGGGGCTTGAAGGGTCAAGTTTACCCA 

TGGGGGAACTGGTTCCAGCCAAACCGCACCAACCTGTGGCAGGGAAAGTTCCCCAAGGGAGAC 

AAAGCTGAGGATGGCTTCCATGGAGTCTCCCCAGTGAATGCTTTCCCCGCCCAGAACAAeTAC 

GGGCTCTATGACCTCCTGGGGAACGTGTGGGAGTGGACAGCATCACCGTACCAGGCTGCTGAG 

CAGGACATGCGCGTCCTCCGGGGGGCATCCTGGATCGACACAGCTGATGGCTCTGCCAATCAC 

CGGGCCCGGGTCACCACCAGGATGGGCAACACTCCAGATTCAGCCTCAGACAACCTCGGTTTC 

CGCTGTGCTGCAGACGCAGGCCGGCCGCCAGGGGAGCTGaSaaGCAGCCGGGTGGTGACAAGGA 

GAAAAGCCTTCTAGGGTCACTGTCATTCCCTGGCCATGTTGCAAACAGCGCAATTCCAAGCTC 

GAGAGCTTCAGCCTCAGGAAAGAACTTCCCCTTCCCTGTCTCCCATCCCTCTGTGGCAGGCGC 

CTCTCACCAGGGCAGGAGAGGACTCAGCCTCCTGTGTTTTGGAGAAGGGGCCCAATGTGTGTT 

GACGATGGCTGGGGGCCAGGTGTTTCTGTTAGAGGCCAAGTATTATTGACACAGGATTGCAAA 

CACACAAACAGTTGGAACAGAGCACTCTGAAAGGCCATTTTTTAAGCATTTTAAAATCTATTC 

TCTCCCCCTTTCTCCCTGGATGATTCAGGAAGCTGACATTGTTTCCTCAAGGCAGAATTTTCC 

TGGTTCTGTTTTCTCAGCCAGTTGCTGTGGAAGGAGAATGCTTTCTTTGTGGCCTCATCTGTG 

GTTTCGTGTCCCTCTGAAGGAAACTAGTTTCCACTGTGTAACAGGCAGACATGTAACTATTTA 

AAGCACAGTTCAGTCCTAAAAGGGTCTGGGAGAACCAGATGATGTACTAGGTGAAGCATTGCA 

TTGTGGGAATCACAAAGCAAATAGTACTCCAGAAAGACAAATATCAGAAGCTTCCTATTCTTT 

TTTTTTTTTTTTTTTTTTTTTTGAGACAGGGTCTTTCTCTGTTGCCCAGGCTAGAGTGCACTG 

GTGATCACGGCTCACTCTAGCCTTGAATTCCTGGGCCCAAGCAATTCTCCCACCTCAGCCTCC 

TGAGTAGCTGGGACTACAAGTGTGCACCACCATGCCTGGCTAATTTTTTGAATTTTTGTAGTG 

ATGGGATCTCGCTCTGTTGCCCAGGGTGGTCTCGAACTCCTGGCCTCAAGCGATCCTCCCACC 

TCGACCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCTCGCCTGGGCCCCCTTCTCCATA 

TGCCTCCAAAAACATGTCCCTGGAGAGTAGCCTGCTCCCACACTGTCACTGGATGTCATGGGG 

CCAATAAAATCTCCTGCAATTGTGTATCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 176 

MARHGLPLLPLLSLLVGAWLKLGNGQATSMVQLQGGRFLMGTNSPDSRDGEGPVREATVKPFA 
I DI FPVTNKDFRDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQPMKSVLWWLPVEKAFW 
RQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQ 
PNRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTASPYQAAEQDMRVL 
RGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPGEL 

Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 191-195 

N-myristoylation sites . 

amino acids 23-29, 25-31, 175-181 



Amidation site* 

amino acids 159-163 
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FIGURE 177 



GCCTTCTCGCGCCTGACCATGCACCCCTGCATCTTCCTGCTGGGCCACAGGCGAGCGCTTTAT 

TTCTGGAGCTGAGGGCTAAAACTTTTTTGACTTTTCTTCTCCTCAACATCTGAATCMSCCAT 

GTGCCCAGAGGAGCTGGCTTGCAAACCTTTCCGTGGTGGCTCAGCTCCTTAACTTTGGGGCGC 

TTTGCTATGGGAGACAGCCTCAGCCAGGCCCGGTTCGCTTCCCGGACAGGAGGCAAGAGCATT 

TTATCAAGGGCCTGCCAGAATACCACGTGGTGGGTCCAGTCCGAGTAGATGCCAGTGGGCATT 

TTTTGTCATATGGCTTGCACTATCCCATCACGAGCAGCAGGAGGAAGAGAGATTTGGATGGCT 

CAGAGGACTGGGTGTACTACAGAATTTCTCACGAGGAGAAGGACCTGTTTTTTAACTTGACGG 

TCAATCAAGGATTTCTTTCCAATAGCTACATCATGGAGAAGAGATATGGGAACCTCTCCCATG 

TTAAGATGATGGCTTCCTCTGCCCCCCTCTGCCATCTCAGTGGCACGGTTCTACAGCAGGGCA 

CCAGAGTTGGGACGGCAGCCCTCAGTGCCTGCCATGGACTGACTGGATTTTTCCAACTACCAC 

ATGGAGACTTTTTCATTGAACCCGTGAAGAAGCATCCACTGGTTGAGGGAGGGTACCACCCGC 

ACATCGTTTACAGGAGGCAGAAAGTTCCAGAAACCAAGGAGCCAACCTGTGGATTAAAGGGTA 

TTGTGACTCACATGTCCTCCTGGGTTGAAGAATCTGTTTTGTTCTTTTGG^^TTTTATTAAA 

ACATGACCTATTCTTACTCAAGTCTCTTATCTCCTCTGTATTCTTTTTTTTTTAATATCTTCA 

TGACATTCAAATCTCTTCTGTATTCTCTTGCCAGAAAGTGTACATTCTTTTTGCTTGTATAAA 
CCCTTTCACTTGTC 
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FIGURE 178 

MPCAQRSWLANLSWAQLLNFGALCYGRQPQPGPVRFPDRRQEHFIKGLPEYHWGPVRVDAS 
GHFLSYGLHYPITSSRRKRDLDGSEDWVYYRISHEEKDLFFNLTVNQGFLSNSYIMEKRYGNL 
SHVKMMASSAPLCHLSGTVLQQGTRVGTAALSACHGLTGFFQLPHGDFFIEPVKKHPLVEGGY 
HPHIVYRRQKVPETKEPTCGLKGIVTHMSSWVEESVLFFW 

Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 11-15, 105-109, 125-129 



N-myristoylation site. 

amino acids 149-155 



• 
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FIGURE 179 



CAGATTTAAAAAGAAAACCTTTACTGAATCAGCTGAGTGTTAATAATACGAATTTCCTTTTCT 

TGCCAATTCTGATCTGAACAGAAAATCCAAGAACAGGGATASPSTGTGGATtACAGTTTTCTCT 

GCCTTGCCTACGACTGTTTCTGGTTGTTACCTGTTATCTTTTATTATTACTCCACAAAGAAAT 

ACTTGGATGTTCGTCTGTTTGTCAGCTCTGCACTGGGAGACAAATTAACTGCCGTAACTTAGG 

CCTTTCGAGTATTCCTAAGAATTTTCCTGAAAGTACAGTTTTTCTGTATCTGACTGGGAATAA 

TATATCTTATATAAATGAAAGTGAATTAACAGGACTTCATTCTCTTGTAGCATTGTATTTGGA 

TAATTCTAACATTCTGTATGTATATCCAAAAGCCTTTGTTCAATTGAGGCATCTATATTTTCT 

ATTTCTAAATAATAATTTCATCAAACGCTTAGATCCTGGAATATTTAAGGGACTTTTAAATCT 

TCGTAATTTATATTTACAGTATAATCAGGTATCTTTTGTTCCGAGAGGAGTATTTAATGATCT 

AGTTTCAGTTCAGTACTTAAATCTACAAAGGAATCGCCTCACTGTCCTTGGGAGTGGTACCTT 

TGTTGGTATGGTTGCTCTTCGGATACTTGATTTATCAAACAATAACATTTTGAGGATATCAGA 

ATCAGGCTTTCAACATCTTGAAAACCTTGCTTGTTTGTATTTAGGAAGTAATAATTTAACAAA 

AGTACCATCAAATGCCTTTGAAGTACTTAAAAGTCTTAGAAGACTTTCTTTGTCTCATAATCC 

TATTGAAGCAATACAGCCCTTTGCATTTAAAGGACTTGCCAATCTGGAATACCTCCTCCTGAA 

AAATTCAAGAATTAGGAATGTTACTAGGGATGGGTTTAGTGGAATTAATAATCTTAAACATTT 

GATCTTAAGTCATAATGATTTAGAGAATTTAAATTCTGACACATTCAGTTTGTTAAAGAATTT 

AATTTACCTTAAGTTAGATAGAAACAGAATAATTAGCATTGATAATGATACATTTGAAAATAT 

GGGAGCATCTTTGAAGATCCTTAATCTGTCATTTAATAATCTTACAGCCTTGCATCCAAGGGT 

CCTTAAGCCGTTGTCTTCATTGATTCATCTTCAGGCAAATTCTAATCCTTGGGAATGTAACTG 

CAAACTTTTGGGCCTTCGAGACTGGCTAGCATCTTCAGCCATTACTCTAAACATCTATTGTCA 

GAATCCCCCATCCATGCGTGGCAGAGCATTACGTTATATTAACATTACAAATTGTGTTACATC 

TTCAATAAATGTATCCAGAGCTTGGGCTGTTGTAAAATCTCCTCATATTCATCACAAGACTAC 

TGCGCTAATGATGGCCTGGCATAAAGTAACCACAAATGGCAGTCCTCTGGAAAATACTGAGAC 

TGAGAACATTACTTTCTGGGAACGAATTCCTACTTCACCTGCTGGTAGATTTTTTCAAGAGAA 

TGCCTTTGGTAATCCATTAGAGACTACAGCAGTGTTACCTGTGCAAATACAACTTACTACTTC 

TGTTACCTTGAACTTGGAAAAAAACAGTGCTCTACCGAATGATGCTGCTTCAATGTCAGGGAA 

AACATCTCTAATTTGTACACAAGAAGTTGAGAAGTTGAATGAGGCTTTTGACATTTTGCTAGC 

TTTTTTCATCTTAGCTTGTGTTTTAATCATTTTTTTGATCTACAAAGTTGTTCAGTTTAAACA 

AAAACTAAAGGCATCAGAAAACTCAAGGGAAAATAGACTTGAATACTACAGCTTTTATCAGTC 

AGCAAGGTATAATGTAACTGCCTCAATTTGTAACACTTCCCCAAATTCTCTAGAAAGTCCTGG 

CTTGGAGCAGATTCGACTTCATAAACAAATTGTTCCTGAAAATGAGGCACAGGTCATTCTTTT 

TGAACATTCTGCTTTATAACTCAACTAAATATTGTCTATAAGAAACTTCAGTGCCATGGACAT 

GATTTAAACTGAAACCTCCTTATATAATTATATACTTTAGTTGGAAATATAATGAATTATATG 

AGGTTAGCATTATTAAAATATGTTTTTTNTTAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 180 

MCGLQFSLPCLRLFLWTCYLLLLLHKEILGCSSVCQLCTGRQINCRNLGLSSIPKNFPESTV 
FLYLTGNNISYINESELTGLHSLVALyLDNSNILYVYPKAFVQLRHLYFLFLNNNFIKRLDPG 
IFKGLLNLRNLYLQYNQVSFVPRGVFNDLVSVQYLNLQRNRLTVLGSGTFVGMVALRILDLSN 
NNILRISESGFQHLENLACLYLGSNNLTKVPSNAFEVLKSLRRLSLSHNPIEAIQPFAFKGLA 
NLEYLLLKNSRIRNVTRDGFSGINNLKHLILSHNDLENLNSDTFSLLKNLIYLKLDRNRIISI 
DNDTFENMGASLKILNLSFNNLTALHPRVLKPLSSLIHLQANSNPWECNCKLLGLRDWLASSA 
ITLNIYCQNPPSMRGRALRYINITNCVTSSINVSRAWAVVKSPHIHHKTTALMMAWHKVTTNG 
SPLENTETENITFWERIPTSPAGRFFQENAFGNPLETTAVLPVQIQLTTSVTLNLEKNSALPN 
DAASMSGKTSLICTQEVEKLNEAFDILLAFFILACVLIIFLIYKWQFKQKLKASENSRENRL 
EYYSFYQSARYNVTASICNTSPNSLESPGLEQIRLHKQIVPENEAQVILFEHSAL 

Signal peptide: 

amino acids 1-41 

Transmembrane domain: 

amino acids 530-547 

N-glycosylation sites. 

amino acids 71-75, 76-80, 215-219, 266-270, 317-321, 331-335, 
336-340, 400-404, 410-414, 451-455, 579-583 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 231-235 - 

N-myristoylation sites , ^ . 

amino acids 3-9, 69-75, 126-132, 174-180 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 506-514 
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FIGURE 181 

GGCCTGGCGCGGCGCTCCGGTAAGGCGTGTGTGCGGCAGGGCGGGGACAGAACCGTCCTCTCG 

GGCTCTGGGCGTGTCCGAGACCGCGCTCCCCGCCGAAATCAAGCTCCGAGTCATCCGTGTGGG 

GCATTCGTCCCCCCTGGCACAGTTGGCCTCTTTCCAGAAGCCCGTTTTGTTTGTTTTACGTCT 

AAATTCGCGTCGGTTCTTATTTCTCTCCCTGGCAAGGTCTGAAGACGGGTAGGAGAATAACCT 

GTGTCAGCGTGTTATSATGCCGTCCCGTACCAACCTGGCTACTGGAATCCCCAGTAGTAAAGT 

GAAATATTCAAGGCTCTCCAGCACAGACGATGGCTACATTGACCTTCAGTTTAAGAAAACCCC 

TCCTAAGATCCCTTATAAGGCCATCGCACTTGCCACTGTGCTGTTTTTGATTGGCGCCTTTCT 

CATTATTATAGGCTCCCTCCTGCTGTCAGGCTACATCAGCAAAGGGGGGGCAGACCGGGCCGT 

TCCAGTGCTGATCATTGGCATTCTGGTGTTCCTACCCGGATTTTACCACCTGCGCATCGCTTA 

CTATGCATCCAAAGGCTACCGTGGTTACTCCTATGATGACATTCCAGACTTTGATGACTAGCA 

CCCACCCCATAGCTGAGGAGGAGTCACAGTGGAACTGTCCCAGCTTTAAGATATCTAGCAGAA 

ACTATAGCTGAGGACTAAGGAATTCTGCAGCTTGCAGATGTTTAAGAAAATAATGGCCAGATT 

TTTTGGGTCCTTCCCAAAGATGTTAAGTGAACCTACAGTTAGCTAATTAGGACAAGCTCTATT 

TTTCATCCCTGGGCCCTGACAAGTTTTTCCACAGGAATATGTATCATGGAAGAATAGAGGTTA 

TTCTGTAATGGAAAAGTGTTGCCTGCCACCACCCTCTGTAGAGCTGAGCATTTCTTTTAAATA 

GTCTTCATTGCCAATTTGTTCTTGTAGCAAATGGAACAATGTGGTATGGCTAATTTCTTATTA 

TTAAGTAGTTTATTTTAAAAATATCTGAGTATATTATCCTGTACACTTATCCCTACCTTCATG 

TTCCAGTGGAAGACCTTAGTAAAATCAAAGATCAGTGAGTTCATCTGTAATATTTTTTTTACT 

TGCTTTCTTACTGACAGCAACCAGGAATTTTTTTATCCTGCAGAGCAAGTTTTCAAAATGTAA 

ATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCACCAGTAGGTTGGACAGC 

ATATAATTTGCATCATTTTGTCCCTTGTAAATCAAGATGTTCTGCAGATTATTCCTTTAACGG 

CCGGACTTTTGGCTGTTTCCTAATGAAACATGTAGTGGTTATTATTTAGAGTTTATAGCCGTA 

TTGCTAGCACCTTGTAGTATGTCATCATTCTGCTCATGATTCCAAGGATCAGCCTGGATGCCT 

AGAGGACTAGATCACCTTAGTTTGATTCTATTTTTTAGCTTGCAAAAAGTGACTTATATTCCA 

AAGAAATTAAAATGTTGAAATCCAAATCCTAGAAATAAAATGAGTTTNNTTCCAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 182 

MMPSRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIALATVLFLIGAFLIIIG 
SLLLSGYISKGGADRAVPVLIIGILVFLPGFYHLRIAYYASKGYRGYSYDDIPDFDD 

Transmembrane domains: 

amino acids 45-66, 79-95 



N-myristoylation sites. 

amino acids 11-17, 75-81 
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FIGURE Ift^ 



^^^^^^ ^^^^'^^^'^'^'^^^^^^'^^^^^^CA^TGAGCCAAGATTAAGTCACTGCACTCCAGCCTGGGTGACAGA 
^™^^SJ^'^'^'^^'^'^^^^'^^T^GTACAACTCTGGATGGGCATGGTGGCTTA^^ 

fr^^SS*=^^^'^^^^°'^^^TT*^CCAGTCTCATAACTCGGTCTCAAAATAAATAA^^ 

aaaatttaaaaaataaagtccaactcagcggttttcagcatatttacagagttgSaat^^^ 

^^I^^^'^^^^^^^'^^^C^^^CCAAAAGAAACCTAACCCATTGACTATCTCTCCA^^ 

^^^^^^^^^'''^^'^''^'''^"■^^•^■^^tctctatagatttgcctattttggacact^^ 

^^S^^^^'^^^^^^'^'^'^'^TGTCCGGCTTCTTTGATTAATAGAATGTTTTCAAGGCTCMC^^^ 



:CTAG 
\TCAT 

ATCAGCACTTCATTCCTTTCTATGGCTGAATA^TA^^^^^^ 

ggacatttgggttgtttccaccttctggctattataaatattgctgctataaaSttcact^^ 



^^'^^^'^^^^'^^^'^'^^^^■^^c^'^^^tgtagtccatgagttgaaataaacacttgacccatagtaagtgc 
cagatcatcttcatttcacagcaaccagtaatttcacagatgaggaaatgaaggctccSagg^^ 



TGI 



r^S^r^J^^^^^''^''^^<'^°^^^T^^^C'=GGGACGCCTG5?Hc^^^^ 



•GCTGGTAC 



SS^^oI^'^*^^°*^*=^^^T'^^T^CCCGAGATGCACTCAATAAGCTCATTAATGGAGGAAACACTCCG 

tgccccatcctccgccagcagacgtcagattccaaccctcccatcaagctgaagtgtStactg^ 
cagaacccggcagatgggaaacgcatcatattctgattcctacctggaaSttSgttSSgg^ 
^I^^S^^''''^''^''''^^^^^^^^^™'^^<^t^gtc^^tcagtggactctggttSS^^^ 
gagttccactgaggggagcactggagcagccctttggcagaggctgaggagggaStggaSSS 

CACCTGGCTCCATGGCATAAGGAAAGGGAGATGCTGGCCTCTGTGCTCCTSScrraTCCTGS^ 

gptgacttagtagcaaccgacagagtggcaagggatttggtcttSgcagtaSStc^^ 
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FIGURE 184 

QCCRKIKDTVQKLASDHKDIHSSVSRVGKAIDRNFDSEICGVVSDAVWDAREQQQQILQMAIV 
EHLYQQGMLSVAEELCQESTLNVDLDFKQPFLELNRILEALHEQDLGPALEWAVSHRQRLLEL 
NSSLEFKLHRLHFIRLLAGGPAKQLEALSYARHFQPFARLHQREIQVMMGSLVYLRLGLEKSP 
YCHLLDSSHWAEICETFTRDACSLLGLSVESPLSVSFASGCVALPVLMNIKAVIEQRQCTGVW 
NHKDELPIEIELGMKCWYHSVFACPILRQQTSDSNPPIKLICGHVISRDALNKLINiGGKLKCP 
YCPMEQNPADGKRI I F 

Transmembrane domain: 

amino acids 222-241 

N-glycosylation site. 

amino acids 129-133 

Tyrosine kinase phosphorylation site. 

amino acids 151-159, 184-193 

Amidation site. 

amino acids 327-331 

Prokaryotic membrane lipoprotein lipid attachment site • 

s 

amino acids 222-233 
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FIGURE 185 

GAGCGACGCTGTCTCTAGTCGCTGATCCCAAATGCACCGGCTCATCTTTGTCTACACTCTAAT 

CTGCGCAAACTTTTGCAGCTGTCGGGACACTTCTGCAACCCCGCAGAGCGCATCCATCAAAGC 

TTTGCGCAACGCCAACCTCAGGCGAGATGACTTGTACCGAAGAGATGAGACCATCCAGGTGAA 

AGGAAACGGCTACGTGCAGAGTCCTAGATTCCCGAACAGCTACCCCAGGAACCTGCTCCTGAC 

ATGGCGGCTTCACTCTCAGGAGAATACACGGATACAGCTAGTGTTTGACAATCAGTTTGGATT 

AGAGGAAGCAGAAAATGATATCTGTAGGTATGATTTTGTGGAAGTTGAAGATATATCCGAAAC 

CAGTACCATTATTAGAGGACGATGGTGTGGACACAAGGAAGTTCCTCCAAGGATAAAATCAAG 

AACGAACCAAATTAAAATCACATTCAAGTCCGATGACTACTTTGTGGCTAAACCTGGATTCAA 

GATTTATTATTCTTTGCTGGAAGATTTCCAACCCGCAGCAGCTTCAGAGACCAACTGGGAATC 

TGTCACAAGCTCTATTTCAGGGGTATCCTATAACTCTCCATCAGTAACGGATCCCACTCTGAT 

TGCGGATGCTCTGGACAAAAAAATTGCAGAATTTGATACAGTGGAAGATCTGCTCAAGTACTT 

CAATCCAGAGTCATGGCAAGAAGATCTTGAGAATATGTATCTGGACACCCCTCGGTATCGAGG 

CAGGTCATACCATGACCGGAAGTCAAAAGTTGACCTGGATAGGCTCAATGATGATGCCAAGCG 

TTACAGTTGCACTCCCAGGAATTACTCGGTCAATATAAGAGAAGAGCTGAAGTTGGCCAATGT 

GGTCTTCTTTCCACGTTGCCTCCTCGTGCAGCGCTGTGGAGGAAATTGTGGCTGTGGAACTGT 

CAACTGGAGGTCCTGCACATGCAATTCAGGGAAAACCGTGAAAAAGTATCATGAGGTATTACA 

GTTTGAGCCTGGCCACATCAAGAGGAGGGGTAGAGCTAAGACCATGGCTCTAGTTGACATCCA 

GTTGGATCACCATGAACGATGCGATTGTATCTGCAGCTCAAGACCACCTCGATAAGAGAATGT 

GCACATCCTTACATTAAGCCTGAGAGAA 
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FIGURE 186 

MHRLIFVYTLICANFCSCRDTSATPQSASIKALRNANLRRDDLYRRDETIQVKGNGYVQSPRF 
PNSYPRNLLLTWRLHSQENTRIQLVFDNQFGLEEAENDICRYDFVEVEDISETSTIIRGRWCG 
HKEVPPRIKSRTNQIKITFKSDDYFVAKPGFKIYYSLLEDFQPAAASETNWESVTSSISGVSY 
NSPSVTDPTLIADALDKKIAEFDTVEDLLKYFNPESWQEDLENMYLDTPRYRGRSYHDRKSKV 
DLDRLNDDAKRYSCTPRNYSVNIREELKLANWFFPRGLLVQRCGGNCGCGTVNWRSCTCNSG 
KTVKKYHEVLQFEPGHIKRRGRAKTMALVDIQLDHHERCDCICSSRPPR 

Signal peptide: 

amino acids 1-18 

N-glycosylation site. 

amino acids 270-274 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 262-266 

Tyrosine kinase phosphorylation site. 

amino acids 256-265 

N-myristoylation sites. 

amino acids 94-100, 186-192, 297-303, 298-304 



TonB-dependent receptor proteins signature 1. 

amino acids 1-56 
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FIGURE 187 



CATGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTG 

TGTCTCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGT 

CCTACAATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTT 

CATATATGGGCAAAAGAAATGGCTGAATAAATCAGAATGCAGAAATATCAATAGAACCTACTG 

TGATCTTTGTGCTGAAACTTCTGACTACGAACACCAGTATTATGCCAAAGTTAAGGCCATTTG 

GGGAACAAAGTGTTCCAAATGGGCTGAAAGTGGACGGTTCTATCCTTTTTTAGAAACACAAAT 

TGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 

AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCT 

GAAGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAA 

CCACACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTT 

CGTCCCAGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGA 

TCAATCATCAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGT 

GTTTCTTTTTTCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACA 

CCCAGCAAATTTGATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGA 

AAAAATCGTGATTAACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGA 

TATGAGTTTACTGGGAAAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAA 

CCTGAGGCCCCCTCAGGAGGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGA 

AATTTTTTGTGACTCTGAAGAAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAG 

CAGAACAATACCCCCGGATAAAACAGTCATTGAATATGAATATGATGTCAGAACCACTGACAT 

TTGTGCGGGGCCTGAAGAGCAGGAGCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATT 

ATTGGAGTCGCAGGCAGCGTTGGCAGTCTTGGGCCCGCAAACGTTACAGTACTCATACACCCC 

TCAGCTCCAAGACTTAGACCCCCTGGCGCAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGA 

AGAGCCATCGACGACCCTGGTCGACTGGGATCCCCAAACTGGCAGGCTGTGTATTCCTTCGCT 

GTCCAGCTTCGACCAGGATTCAGAGGGCTGCGAGCCTTCTGAGGGGGATGGGCTCGGAGAGGA 

GGGTCTTCTATCTAGACTCTATGAGGAGCCGGCTCCAGACAGGCCACCAGGAGAAAATGAAAC 

CTATCTCATGCAATTCATGGAGGAATGGGGGTTATATGTGCAGATGGAAAACTGATGCCAACA 

CTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAGTCACCCCTTTGATCCCAGCCATAAA 

GTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGTCTGTGAGAA 
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FIGURE 188 

MPLPPLLLLLLAAPWGRAVPCVSGGLPKPANITFLSINMKNVLQWTPPEGLQGVKVTYTVQYF 
lYGQKKWLNKSECRNINRTYCDLSAETSDYEHQYYAKVKAIWGTKCSKWAESGRFYPFLETQI 
GPPEVALTTDEKSISVVLTAPEKWKRNPEDLPVSMQQIYSNLKYNVSVLNTKSNRTWSQCVTN 
HTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLKDQSSEFKAKIIFWYVLPISITV 
FLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEKIVINFITLNISDDSKISHQD 
MSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFCDSEENTEGTSLTQQESLS 
RTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQAALAVLGPQTLQYSYTP 
QLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDSEGCEPSEGDGLGEE 
GLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Signal sec[uence: 

amino acids 1-18 

Transm^nbrane domain: 

amino acids 240-260 

M-glycosylation sites. 

amino acids 31-34, . 72-75, 80-83, 171-174, 180-183, 189-192, ' 
304-307, 523-526 

Tyrosine kinase phbsphoxylatioii site. 

amino acids 385-392, 518-526 

K-myristoylation sites. 

amino acids 53-58, 106-111, 368-373, 492-497 



Tissue factor 

amino acids 1-278 
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FIGURE 189 

ATGTGCTGCTGGCCGCTGCTCCTGCTGTGGGGGCTGCTCCCCGGGACGGCGGCGGGGGGCTCG 

GGCCGAACCTATCCGCACCGGACCCTCCTGGACTCGGAGGGCAAGTACTGGCTGGGCTGGAGC 

CAGCGGGGCAGCCAGATCGCCTTCCGCCTCCAGGTGCGCACTGCAGGCTACGTGGGCTTCGGC 

TTCTCGCCCACCGGGGCCATGGCGTCCGCCGACATCGTCGTGGGCGGGGTGGCCCACGGGCGG 

CCCTACCTCCAGGATTATTTTACAAATGCAAATAGAGAGTTGAAAAAAGATGCTCAGCAAGAT 

TACCATCTAGAATATGCCATGGAAAATAGCACACACACAATAATTGAATTTACCAGAGAGCTG 

CATACATGTGACATAAATGACAAGAGTATAACGGATAGCACTGTGAGAGTGATCTGGGCCTAC 

CACCATGAAGATGCAGGAGAAGCTGGTCCCAAGTACCATGACTCCAATAGGGGCACCAAGAGT 

TTGCGGTTATTGAATCCTGAGAAAACTAGTGTGCTATCTACAGCCTTACCATACTTTGATCTG 

GTAAATCAGGACGTCCCCATCCCAAACAAAGATACAACATATTGGTGCCAAATGTTTAAGATT 

CCTGTGTTCCAAGAAAAGCATCATGTAATAAAGGTTGAGCCAGTGATACAGAGAGGCCATGAG 

AGTCTGGTGCACCACATCCTGCTCTATCAGTGCAGCAACAACTTTAACGACAGCGTTCTGGAG 

TCCGGCCACGAGTGCTATCACCCCAACATGCCCGATGCATTCCTCACCTGTGAAACTGTGATT 

TTTGCCTGGGCTATTGGTGGAGAGGGCTTTTCTTATCCACCTCATGTTGGATTATCCCTTGGC 

ACTCCATTAGATCCGCATTATGTGCTCCTAGAAGTCCATTATGATAATCCCACTTATGAGGAA 

GGCTTAATAGATAATTCTGGACTGAGGTTATTTTACACAATGGATATAAGGAAATATGATGCT 

GGGGTGATTGAGGCTGGCCTCTGGGTGAGCCTCTTCCATACCATCCCTCCAGGGATGCCTGAG 

TTCCAGTCTGAGGGTCACTGCACTTTGGAGTGCCTGGAAGAGGCTCTGGAAGCCGAAAAGCCA 

AGTGGAATTCATGTGTTTGCTGTTCTTCTCCATGCTCACCTGGCTGGCAGAGGCATCAGGCTG 

CGTCATTTTCGAAAAGGGAAGGAAATGAAATTACTTGCCTATGATGATGATTTTGACTTCAAT 

TTCCAGGAGTTTCAGTATCTAAAGGAAGAACAAACAATCTTACCAGGAGATAACCTAATTACT 

GAGTGTCGCTACAACACGAAAGATAGAGCTGAGATGACTTGGGGAGGACTAAGCACCAGGAGT 

GAAATGTGTCTCTCATACCTTCTTTATTACCCAAGAATTAATCTTACTCGATGTGCAAGTATT 

CCAGACATTATGGAACAACTTCAGTTCATTGGGGTTAAGGAGATCTACAGACCAGTCACGACC 

TGGCCTTTCATTATCAAAAGTCCCAAGCAATATAAAAACCTTTCTTTCATGGATGCTATGAAT 

AAGTTTAAATGGACTAAAAAGGAAGGTCTCTCCTTCAACAAGCTGGTCCTCAGCCTGCCAGTG 

AATGTGAGATGTTCCAAGACAGACAATGCTGAGTGGTCGATTCAAGGAATGACAGCATTACCT 

CCAGATATAGAAAGACCCTATAAAGCAGAACCTTTGGTGTGTGGCACGTCTTCTTCCTCTTCC 

CTGCACAGAGATTTCTCCATCAACTTGCTTGTTTGCCTTCTGCTACTCAGCTGCACGCTGAGC 

ACCAAGAGCTTGT^TCAAAATTCTGTTGGACTTGACAATGTTTTCTATGATCTGAACCTGTC 

ATTTGAAGTACAGGTTAAAGACTGTGTCCACTTTGGGCATGAAGAGTGTGGAGACTTTTCTTC 

CCCATTTTCCCTCCCTCCTTTTTCCTTTCCATGTTACATGAGAGACATCAATCAGGTTCTCTT 

CTCTTTCTTAGAAATACCTGATGTTATATATACATGGTCAATAAAATAAAACTGGCCTGACTT 

AAGATAACCATTTTAAAAAATTGGGCTGTCATGTGGGAATAAAAGAATTCTTTCTTTCCTAAA 
AAAAAAAA 
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FIGURE 190 

MCCWPLLLLWGLLPGTAAGGSGRTYPHRTLLDSEGKYWLGWSQRGSQIAFRLQVRTAGYVGFG 
FSPTGAMASADIVVGGVAHGRPYLQDYFTNANRELKKDAQQDYHLEYAMENSTHTIIEFTREL 
HTCDINDKSITDSTVRVIWAYHHEDAGEAGPKYHDSNRGTKSLRLLNPEKTSVLSTALPYFDL 
VNQDVPIPNKDTTYWCQMFKIPVFQEKHHVIKVEPVIQRGHESLVHHILLYQCSNNFNDSVLE 
SGHECYHPNMPDAFLTCETVIFAWAIGGEGFSYPPHVGLSLGTPLDPHYVLLEVHYDNPTYEE 
GLIDNSGLRLFYTMDIRKYDAGVIEAGLWVSLFHTIPPGMPEFQSEGHCTLECLEEALEAEKP 
SGIHVFAVLLHAHLAGRGIRLRHFRKGKEMKLLAYDDDFDFNFQEFQYLKEEQTILPGDNLIT 
ECRYNTKDRAEMTWGGLSTRSEMCLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTT 
WPFIIKSPKQYKNLSFMDAMNKFKWTKKEGLSFNKLVLSLPVNVRCSKTDNAEWSIQGMTALP 
PDIERPYKAEPLVCGTSSSSSLHRDFSINLLVCLLLLSCTLSTKSL 

Signal peptide: 

amino acids 1-18 

Transmeinbrane domains: 

amino acids 56-73, 378-393, 583-602 

N-glycosylatxon sites. 

amino acids 114-118, 247-251, 476-480, 517-521 
N-xqfristoylation sites. 

amino acids 11-17, 15-21, 20-26, 45-51, 68-74, 79-85, 290-296, 
316-322, 337-343, 342-348, 456-462, 534-540, 582-588 



Copper type II, asoorbate-dependent monooxygenases proteins. 

amino acids 271-321, 422-474 



wo 01/40466 



PCTA)S00/32678 



191/550 

FIGURE 191 

GCTTCAGCTGAAGAAAGAGAGGAATGAAGCGCCTTCTGCTTCTGTTTTTGTTCTTTATAACAT 

TTTCTTCTGCATTTCCCTTAGTCCGGATGACGGAAAATGAAGAAAATATGCAACTGGCTCAGG 

CATATCTCAACCAGTTCTACTCTCTTGAAATAGAAGGGAATCATCTTGTTCAAAGCAAGAATA 

GGAGTCTCATAGATGACAAAATTCGGGAAATGCAAGCATTTTTTGGATTGACAGTGACTGGAA 

AACTGGACTCAAACACCCTTGAGATCATGAAGACACCCAGGTGTGGGGTGCCTGATGTGGGCC 

AGTATGGCTACACCCTCCCTGGGTGGAGAAAATACAACCTCACCTACAGAATAATAAACTATA 

CTCCGGATATGGCACGAGCTGCTGTGGATGAGGCTATCCAAGAAGGTTTAGAAGTGTGGAGCA 

AAGTCACTCCACTAAAATTCACCAAGATTTCAAAGGGGATTGCAGACATCATGATTGCCTTTA 

GGACTCGAGTCCATGGTCGGTGTCCTCGCTATTTTGATGGTCCCTTGGGAGTGCTTGGCCATG 

CCTTTCCTCCTGGTCCGGGTCTGGGTGGTGACACTCATTTTGATGAGGATGAAAACTGGACCA 

AGGATGGAGCAGGATTCAACTTGTTTCTTGTGGCTGCTCATGAATTTGGTCATGCACTGGGGC 

TCTCTCACTCCAATGATCAAACAGCCTTGATGTTCCCAAATTATGTCTCCCTGGATCCCAGAA 

AATACCCACTTTCTCAGGATGATATCAATGGAATCCAGTCCATCTATGGAGGTCTGCCTAAGG 

TACCTGCTAAGCCAAAGGAACCCACTATACCCCATGCCTGTGACCCTGACTTGACTTTTGACG 

CTATCACAACTTTCCGCAGAGAAGTAATGTTCTTTAAAGGCAGGCACCTATGGAGGATCTATT 

ATGATATCACGGATGTTGAGTTTGAATTAATTGCfTCATTCTGGCCATCTCTGCCAGCTGATC 

TGCAAGCTGCATACGAGAACCCCAGAGATAAGATTCTGGTTTTTAAAGATGAAAACTTCTGGA 

TGATCAGAGGATATGCTGTCTTGCCAGATTATCCCAAATCCATCCATACATTAGGTTTTCCAG 

GACGTGTGAAGAAAATAGATGCAGCCGTCTGTGATAAGACCACAAGAAAAACCTACTTCTTTG 

TGGGCATTTGGTGCTGGAGGTTTGATGAAATGACCCAAACCATGGACAAAGGATTCCCGCAGA 

GAGTGGTAAAACACTTTCCTGGAATCAGTATCCGTGTTGATGCTGCTTTCCAGTACAAAGGAT 

TCTTCTTTTTCAGCCGTGGATCAAAGCAATTTGAATACAACATTAAGACAAAGAATATTACCC 

GAATCATGAGAACTAATACTTGGTTTCAATGCAAAGAACCAAAGAACTCCTCATTTGGTTTTG 

ATATCAACAAGGAAAAAGCACATTCAGGAGGCATAAAGATATTGTATCATAAGAGTTTAAGCT 

TGTTTATTTTTGGTATTGTTCATTTGCTGAAAAACACTTCTATTTATCAATAAATTCATAGAC 

CTAAAATAAACCTCAACAGGTCTTTTAATATAAATTCTGCTTCAAAATAGAATAAAACCATTC 
TTTAACAAC 
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FIGURE 192 



MKRLLLLFLFFITFSSAFPLVRMTENEENMQLAQAYLNQFYSLEIEGNHLVQSKNRSLIDDKI 
REMQAFFGLTVTGKLDSNTLEIMKTPRCGVPDVGQYGYTLPGWBKYNLTYRIINYTPDMAE^ 



AVCDKTTRKTYFFVGIWCWRFDEMTQTMDKGFPQRWKHFPGISIRVDAAFQYKGFFFFSRGS 

KQFEYNIKTKNITRIMRTNTWFQCKEPKNSSFGFDINKEKAHSGGIKILYHKSLSLFIFGIVH 
LLKNTSIYQ 

Signal peptide: 

amino acids 1-17 

N-glycosylation sites. 

amino "acids 55-59, 110-114, 200-204, 452-456, 470-474, 508-512 

N-myristoylation site. 

amino acids 71-77, 205-211, 223-229 

Henaopexin domain signature • 

amino acids 171-202, 207-238, 318-334 

Neutral zinc metallopeptidases, zinc-binding region signature. 

amino acids 213-223 



VDEAIQEGLEVWSKVTPLKFTKISKGIADIMIAFRTRVHGRCPRYFDGPLGVLGHAFPPGPGL 
GGDTHFDEDENWTKDGAGFNLFLVAAHEFGHALGLSHSNDQTALMFPNYVSLDPRKYPLSQDD 
INGIQSIYGGLPKVPAKPKEPTIPHACDPDLTFDAITTFRREVMFFKGRHLWRIYYDITDVEF 
ELIASFWPSLPADLQAAYENPRDKILVFKDENFWMIRGYAVLPDYPKSIHTLGFPGRVKKIDA 



Hatrixins cysteine switch. 

amino acids 89-97, 207-238 



• 
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FIGURE 193 



CACAATCAGGTCCCATTCTATAGATGGGGAAACTGAGGCTTGAGGTCACATAGGCGTCGTTCA 

AGGCTGGTATACCTGCACCCTCTCCCATGTGAACAACATGGTTCTGGGTAATGGGGGCTGTCA 

TCCAGTCTCCTCCCTGCCCCTGCTGGTGCACTTCCTGCCTCTGCTGGTGCACTTTCTGCCCCT 

ACTGGTATATTTGCTGCCTCTGCTGGGGCGCTTCCTGCCTCGGCTGGTGTATCTCCTGCCCCT 

GCTGGTGCACTTTCTGCCCCCGCTGATGCACTTCCTGCCTCTGCTGGTGCACTTCCTGGCTCT 

GCTGGCACACTTCCTGCCTCTGCTGGTGCACTTCCTGGCTCTGCTGGCGCACTTTCCTGCCCC 

TGCTGGTGTATTTCCTGCCCCTGCTGGTGTACTTCCTTCCCCTGCTGGTGCACTTCCTGCCTC 

TGCTGGCGCACTTCTTGCCTCTCCAGGCCCTACCT^CCTCTCCCTCTTATATATGGAAGTCT 

TCCCAGTTCACTGACACTGGTAACAGGGACTCTGCTCTTGGTGTTGCTGTCTGCCCTGGGGAT 

GGGCATCTGTGTCTTCCTTTACTACTGCTGGCTCAGGACCCAGAGCTTTGAAGCATGTCCAGA 

TGCAGGTCCGGGCACCAGAGTCTAAGGAGCCCCTACACCCACCAGGATTTTCCAATAAAGAGA 
TGTTCACCA 
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FIGURE 194 



MVLGNGGCHPVSSLPLLVHFLPLLVHFLPLLVYLLPLLGRFLPRLVYLLPLLVHFLPPLMHFL 
PLLVHFLALLAHFLPLLVHFLALLAHFPAPAGVFPAPAGVLPSPAGALPASAGALLASPGPT 

Signal peptide: 

amino acids 1-39 

N-myristoylation sites. 

amino acids 4-10, 109-115, 116-122 

Leucine zipper pattern. 

amino acids 14-36, 16-38, 17-39, 21-43, 24-46, 28-50, 31-53, 

35-57, 38-60, 42-64, 45-67, 49-71, 52-74, 56-78, 59-81, 63-85, 
65-87, 66-88 
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FIGURE 195 

GGCAAGGCGGCGGCGGCGGCGGCGGCAGCCGCGGTGGCGGCGTGGGGAACATCTCGGCAGCCA 

CCGCGCTTCTCCCGCTGGAGCGGGCGTCCAGCTTGGCTGCCCTCGGTCCTTCCCTGCCACGTT 

TCGGGTCGCCCTGCACCCCCCACCCAGGCTCGCTTCTCTTCGAAGCGGGAAGGGCGCCTTGCA 

GGATCCTGCCGCCCCTCCAACCGGATCCTGGGTCTAGAGCTCCCCAGAGCGAGGCGCTCGCCA 

GGACTCCTGCCCCGCCAACCCTGACCGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAG 

GAAGCGGCAAAGGGGACCAT6CGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTG 

CTCTGGATCACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCGACGGGA 

CCTGAAGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGAT 

GAGCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAGCTGTATGCT 

TTCAACCAGCGGGAGAGTGAGCGGATCTCCAGCAATCGGGCCATCCCGGACACTCGCCATCTG 

AGATGCACACTGCTGGTGTATTGCACGGACCTTCCACCCACTAGCATCATCATCACCTTCCAC 

AACGAGGCCCGCTCCACGCTGCTCAGGACCATCCGCAGTGTATTAAACCGCACCCCTACGCAT 

CTGATCCGGGAAATCATATTAGTGGATGACTTCAGCAATGACCCTGATGACTGTAAACAGCTC 

ATCAAGTTGCCCAAGGTGAAATGCTTGCGCAATAATGAACGGCAAGGTCTGGTCCGGTCCCGG 

ATTCGGGGCGCTGACATCGCCCAGGGCACCACTCTGACTTTCCTCGACAGCCACTGTGAGGTG 

AACAGGGACTGGCTCCAGCCTCTGTTGCACAGGGTCAAAGAGGACTACACGCGGGTGGTGTGC 

CCTGTGATCGATATCATTAACCTGGACACCTTCACCTACATCGAGTCTGCCTCGGAGCTCAGA 

GGGGGGTTTGACTGGAGCCTCCACTTCCAGTGGGAGCAGCTCTCCCCAGAGCAGAAGGCTCGG 

CGCCTGGACCCCACGGAGCCCATCAGGACTCCTATCATAGCTGGAGGGCTCTTCGTGATCGAC 

AAAGCTTGGTTTGATTACCTGGGGAAATATGATATGGACATGGACATCTGGGGTGGGGAGAAC 

TTTGAAATCTCCTTCCGAGTGTGGATGTGCGGGGGCAGCCTAGAGATCGTCCCCTGCAGCCGA 

GTGGGGCACGTCTTCCGGAAGAAGCACCCCTACGTTTTCCCTGATGGAAATGCCAACACGTAT 

ATAAAGAACACCAAGCGGACAGCTGAAGTGTGGATGGATGAATACAAGCAATACTATTACGCT 

GCCCGGCCATTCGCCCTGGAGAGGCCCTTCGGGAATGTTGAGAGCAGATTGGACCTGAGGAAG 

AATCTGCGCTGCCAGAGCTTCAAGTGGTACCTGGAGAATATCTACCCTGAACTCAGCATCCCC 

AAGGAGTCCTCCATCCAGAAGGGCAATATCCGACAGAGACAGAAGTGCCTGGAATCTCAAAGG 

CAGAACAACCAAGAAACCCCAAACCTAAAGTTGAGCCCCTGTGCCAAGGTCAAAGGCGAAGAT 

GCAAAGTCCCAGGTATGGGCCTTCACATACACCCAGCAGATCCTCCAGGAGGAGCTGTGCCTG 

TCAGTCATCACCTTGTTCCCTGGCGCCCCAGTGGTTCTTGTCCTTTGCAAGAATGGAGATGAC 

CGACAGCAATGGACCAAAACTGGTTCCCACATCGAGCACATAGCATCCCACCTCTGCCTCGAT 

ACAGATATGTTCGGTGATGGCACCGAGAACGGCAAGGAAATCGTCGTCAACCCATGTGAGTCC 

TCACTCATGAGCCAGCACTGGGACATGGTGAGCTCTT^GGACCCCTGCCAGAAGCAGCAAGG 

GCCATGGGGTGGTGCTTCCCTGGACCAGAACAGACTGGAAACTGGGCAGCAAGCAGCCTGCAA 

CCACCTCAGACATCCTGGACTGGGAGGTGGAGGCAGAGCCCCCCAGGACAGGAGCAACTGTCT 

CAGGGAGGACAGAGGAAAACATCACAAGCCAATGGGCTCAAAGACAAATCCCACATGTTCTCA 

AGGCCGTTAAGTTCCAGTCCTGGCCAGTCATTCCCTGATTGGTATCTGGAGACAGAAACCTAA 

TGGGAAGTGTTTATTGTTCCTTTTCCTACAAAGGAAGCAGTCTCTGGAGGCCAGAAAGAAAAG 

CCTTCTTTTTCACTAGGCCAGGACTACATTGAGAGATGAAGAATGGAGGTTGTTTCCAAAAGA 

AATAAAGAGAAACTTAGAAGTTGTCTCTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAA 
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FIGURE 196 

MRRLTRRLVLPVFGVLWITVLLFFWVTKRKLEVPTGPEVQTPKPSDADWDDLWDQFDERRYLN 
AKKWRVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARST 
LLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADI 
AQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRWCPVIDIINLDTFTYIESASELRGGFDWS 
LHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFR 
VWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFAL 
ERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIRQRQKCLESQRQNNQET 
PNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPGAPWLVLCKNGDDRQQWTK 
TGSH I EHIASHLCLDTDM FGDGTENGKE I VVN PCESSLMSQHWDMVSS 

Transmembrane domain: 

amino acids 475-493 

cftMP- and cOlP-dependent protein kinase phosphorylation site. 

amino acids 2-6 

Tyrosine kinase phosphorylation sites . 

amino acids 68-75, 401-409 

N-myristoylation sites. 

amino acids 178-184, 186-192, 192-198, 346-352, 383-389, 526-532 
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FIGURE 197 



GCAGCTCACCCTTCGCAGCCGCGMGGGGGAAGACGACGCCGCGCTTCGGGCTGGCAGCAGGGGGCTCTCCGACC 

CGTGGGCAGACTCAGTGGGAGTGCGACCCCGCACCACGGAGCGCCACATCGCCGTACACAAGCGGCTTGTGCTGG 

CCTTCGCTGTGTCCCTCGTGGCATTGCTCGCGGTCACAATGCTCGCTGTGCTGCTCAGCCTGCGCTTCGACGAGT 

GCGGGGCGAGTGCCACGCCAGGCGCCGACGGTGGCCCCTCAGGCTTTCCGGAGCGCGGCGGCAACGGGAGCCTCC 

CTGGATCGGCCCGGCGCAACCACCACGCAGGCGGGGACTCCTGGCAGCCCGAGGCGGGTGGGGTGGCCAGTCCGG 

GGACCACGTCGGCCCAGCCGCCGTCGGAGGAGGAGCGGGAGCCGTGGGAGCCGTGGACGCAGCTGCGCCTGTCGG 

GCCACCTGAAGCCGCTGCACTACAATCTGATGCTCACCGCCTTCATGGAGAACTTCACCTTCTCCGGGGAGGTCA 

ACGTGGAGATCGCGTGCCGGAACGCCACCCGCTACGTAGTGCTGCACGCTTCCCGAGTGGCGGTGGAGAAAGTGC 

AGCTGGCCGAGGACCGGGCGTTCGGGGCTGTCCCTGTAGCCGGTTTTTTCCTCTACCCGCAAACCCAGGTCTTAG 

TGGTGGTGCTGAATAGGACACTGGACGCGCAGAGGAATTACAATCTGAAGATTATCTACAACGCGCTCATCGAGA 

ATGAGCTCCTGGGCTTCTTCCGCAGCTCCTATGTGCTCCACGGGGAGAGAAGATTCCTTGGTGTTACTCAGTTTT 

CGCCTACACATGCCAGAAAGGCATTTCCTTGTTTTGATGAGCCAATCTACAAGGCTACTTTCAAAATCAGCATCA 

AGCATCAAGCAACCTATTTATCTTTATCTAATATGCCAGTGGAAACTTCCGTGTTTGAGGAAGATGGATGGGTTA 

CGGATCACTTTTCACAGACCCCTCTCATGTCCACATATTATTTAGCCTGGGCAATTTGCAACTTCACATACAGAG 

AAACTACCACCAAGAGTGGGGTTGTAGTACGATTATATGCAAGACCTGATGCTATCAGAAGAGGATCCGGGGACT 

ATGCTCTCCATATAACAAAGAGATTAATAGAATTTTATGAAGACTACTTTAAAGTGCCCTATTCCTTGCCAAAAC 

TAGATCTTTTAGCTGTGCCTAAGCATCCGTATGCTGCTATGGAGAACTGGGGACTAAGTATTTTTGTGGAACAAA 

GAATACTGCTGGATCCCAGTGTTTCATCTATTTCTTATTTGCTGGATGTCACCATGGTCATTGTTCATGAGATAT 

GTCACCAGTGGTTTGGTGACCTTGTGACGCCTGTGTGGTGGGAAGACGTGTGGCTGAAGGAAGGGTTTGCTCACT 

ACTTTGAATTTGTTGGTACAGACTACCTCTATCCTGGCTGGAACATGGAAAAGCAGAGGTTTCTGACCGATGTTC 

TGCATGAAGTGATGCTGCTGGACGGTTTGGCCAGTTCCCATCCAGTATCACAGGAAGTGCTGCAGGCAACAGATA 

TTGACAGGGTGTTTGACTGGATCGCATATAAAAAGGGTGCTGCTTTAATAAGAATGCTGGCTAATTTTATGGGCC 

ATTCAGTTTTCCAGAGGGGTTTGCAAGATTATTTAACCATTCATAAGTATGGTAATGCAGCCAGAAATGATCTCT 

GGAATACATTATCGGAGGCTTTAAAAAGAAATGGGAAATATGTAAATATACAAGAAGTAATGGATCAGTGGACAC 

TCCAGATGGGTTATCCTGTTATCACCATCTTGGGAAACACAACAGCAGAAAATAGAATAATAATTACCCAACAGC 

ATTTTATCTATGATATCAGTGCTAAAACTAAAGCACTTAAACTTCAGAATAACAGTTACCTGTGGCAGATTCCAT 

TAACTATTGTGGTAGGAAATAGAAGCCATGTGTCTTCAGAAGCAATTATTTGGGTGTCTAACAAATCAGAGCACC 

ACAGAATAACTTATTTGGACAAAGGAAGCTGGCTGCTGGGGAACATCAATCAAACTGGCTATTTTAGAGTCAACT 

ATGACCTAAGGAACTGGAGATTATTAATTGATCAATTAATCCGGAATCATGAGGTTCTTTCTGTCAGTAACCGAG 

CGGGCTTGATCGATGATGCCTTCAGCCTAGCCAGGGCTGGCTATTTGCCTCAGAATATTCCTCTGGAGATTATCA 

GATACCTGTCTGAGGAGAAGGATTTTCTTCCTTGGCATGCTGCCAGCCGAGCTCTTTATCCTCTAGATAAATTAC 

TGGACCGCATGGAAAACTACAACATTTTCAATGAATATATTTTAAAGCAAGTTGCAACAACATATATCAAGCTTG 

GGTGGCCGAAAAATAATTTTAATGGATCTCTTGTTCAAGCATCCTACCAACATGAAGAACTACGTAGAGAAGTTA 

TAATGCTGGCCTGCAGTTTTGGCAACAAGCACTGTCACCAACAGGCATCAACACTTATTTCAGATTGGATTTCCA 

GCAACAGGAACAGAATACCACTAAATGTTAGAGACATCGTATACTGTACAGGAGTGTCACTACTGGATGAGGATG 

TCTGGGAATTCATATGGATGAAATTCCATTCCACCACAGCAGTTTCTGAGAAGAAAATATTATTGGAAGCCTTAA 

CTTGCAGTGATGACAGGAATTTATTAAACAGGCTTCTAAATCTGTCACTGAATTCTGAGGTGGTGCTGGATCAAG 

ATGCAATTGATGTCATAATCCATGTAGCTCGAAATCCACATGGTCGAGACCTTGCCTGGAAGTTTTTCAGGGATA 

AATGGAAGATATTAAATACCAGGTATGGAGAAGCATTGTTTATGTATTCCAAACTCATCAGTGGTGTCACAGAAT 

TTCTTAATACTGAAGGTGAACTCAAAGAGCTCAAGAACTTCATGAAAAACTATGATGGGGTAGCTGCTGCTTCTT 

TCTCACGAGCTGTGGAAACTGTCGAAGCCAATGTGCGCTGGAAAATGCTTTACCAAGACGAGCTTTTCCAATGGT 

TAGGAAAAGCTCTAAGACACTAATATATGTATCTTATAAACAAACAATTCAACTCAGAAGTTTATGAGAAGACAC 

GCTTTTTGTGGAATGAGGAAAATGTACTACCTAGAAAATGGCCAGATTTTCAGTGTTAACGTGTGGGAGGAATTT 

TTTTTTTTAGTTTTTATTTTTTGGTTTTGGGGGATATTTTTTATTTGTTTCATTCATTCTGTTCTGTTTCTCTAC 

TGGGTGTTCCTCTCTAAAGAAACTCTTGCAAGTGAAACTAGCCATGATTGCTTCAGCTGTACATTCCTTGCTGTA 

CAGGACCAAATATGATAGTGATGCATGTTGATGTTACAGTCAATTTGGAAAAACATATTCAGAATATCTGTGCAT 

GGATATATTGTCCTGCCTGTGTTCCAGCATGCTTATTTCAAACGTCCAGTGTTGTGTGTGAATATGTGTTACACC 

TAGGATGGGCATTATGCAAAAGCACAAAGATTATATATGACAATCAGTATTGCAATGAAAGAAAAACTAAAAACA 

GAAATGATATTCTCAATTTTGGGCAATGTGAGAGGTAAAATAGCCCTTGACATGATGAACATCACTTATTTCAGC 

ACTTGGATTGTCTGGCAATGATTACTGTGTTGCTAACTCATTTTCTTTGAGTTAAAGCTGTGTATACATTTTAAA 

AGGCATATAGATAGTGTATGCATATGTATATGTACATAGGGAAGCCCCATATGTATATAGTATGTTGTACACTGC 

ACATGTACAAAGAATGTCTTCAGATCAAAGAAAATTTATCTCTTTTTATAAACTTAAGGACAGTTGCAAAAGGCT 

TCAAGGAATTTATCTCAACATTATTCTTTCTATGTCCTAACTAAATTTCTCAACTGTTATGAATTTTTCATCTAC 

TTCTTGAACAGTGGTCTATTCTGCTACATGAAGATGAATACAAACAAAATTTTTGTATAAACTCCCAAAAAAAAA 
AAAAAAAAAA 
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FIGURE 198 , 

MGEDDAALRAGSRGLSDPWADSVGVRPRTTERHIAVHKRLVLAFAVSLVALLAVTMLAVLLSL 

RFDECGASATPGADGGPSGFPERGGNGSLPGSARRNHHAGGDSWQPEAGGVASPGTTSAQPPS 

EEEREPWEPWTQLRLSGHLKPLHYNLMLTAFMENFTFSGEVNVEIACRNATRYWLHASRVAV 

EKVQLAEDRAFGAVPVAGFFLYPQTQVLVWLNRTLDAQRNYNLKIIYNALIENELLGFFRSS 

YVLHGERRFLGVTQFSPTHARKAFPCFDEPIYKATFKISIKHQATYLSLSNMPVETSVFEEDG 

WVTDHFSQTPLMSTYYLAWAICNFTYRETTTKSGVWRLYARPbAIRRGSGDYALHITKRLIE 

FYEDYFKVPYSLPKLDLLAVPKHPYAAMENWGLSIFVEQRILLDPSVSSISYLLDVTMVIVHE 

ICHQWFGDLVTPVWWEDVWLKEGFAHYFEFVGTDYLYPGWNMEKQRFLTDVLHEVMLLDGLAS 

SHPVSQEVLQATDIDRVFDWIAYKKGAALIRMLANFMGHSVFQRGLQDYLTIHKYGNAARNDL 

WNTLSEALKRNGKYVNIQEVMDQWTLQMGYPVITILGNTTAENRIIITQQHFIYDISAKTKAL 

KLQNNSYLWQIPLTIVVGNRSHVSSEAIIWVSNKSEHHRITYLDKGSWLLGNINQTGYFRVNY 

DLRNWRLLIDQLIRNHEVLSVSNRAGLIDDAFSLARAGYLPQNIPLEIIRYLSEEKDFLPWHA 

. ASRALYPLDKLLDRMENYNIFNEYILKQVATTYIKLGWPKNNFNGSLVQASYQHEELRREVIM 

LACSFGNKHCHQQASTLISDWISSNRNRIPLNVRDIVYCTGVSLLDEDVWEFIWMKFHSTTAV 

SEKKILLEALTCSDDRNLLNRLLNLSLNSEWLDQDAIDVIIHVARNPHGRDLAWKFFRDKWK 

ILNTRYGEALFMYSKLISGVTEFLNTEGELKELKNFMKNYDGVAAASFSRAVETVEANVRWKM 
LYQDELFQWLGKALRH 

Transmembrane domain: 
amino acids 44-63 
N-glyeosylation sites. 

amino acids 89-93, 160-164, 175-179, 222-226, 338-342, 605-609, 
634-638, 649-653, 663-667, €84-688, 800-804, 906-910 
.c»MP- and cOIP-d^endent protein kinase phosphorylation site. 

amino acids 362-366 

Tyrosine kinase phosphorylation site. 

amino acids 520-528 
N-myristoylation sites. 

amino acids 78-84, 87-93, .90-96, 118-124, 501-507, 604-610, 
825-831, 987-993 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 437-447 
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FIGURE 199 

GCGCCCGGCGCAGCTCGGCCAGAGCGACCGCGGGGCTGAGCGCGCGTCCGCCCAGGGGGCTCCGGAAGCTGCCCC 

GGCCCGCGGCCTCCTCCCTCGCTCCCGCTTCCCCTTTCTCGCTCACCGCCGCCCTCCTTCCCCAGCTCCCTCGCC 

GTCCGCCCGCCCCACAGCCAGCGGCTCCGCGCCCCCTGCAGCCACGATGCCCGCGGCCCGGCCGCCCGCCGCGGG 

ACTCCGCGGGATCTCGCTGTTCCTCGCTCTGCTCCTGGGGAGCCCGGCGGCAGCGCTGGAGCGAGATGCTCTTCC 

CGAGGGAGATGCTAGCCCTTTGGGTCCTTACCTCCTGCCCTCAGGAGCCCCGGAGAGAGGCAGTCCTGGCAAAGA 

GCACCCTGAAGAGAGAGTGGTAACAGCGCCCCCCAGTTCCTCACAGTCGGCGGAAGTGCTGGGCGAGCTGGTGCT 

GGATGGGACCGCACCCTCTGCACATCACGACATCCCAGCCCTGTCACCGCTGCTTCCAGAGGAGGCCCGCCCCAA 

GCACGCCTTGCCCCCCAAGAAGAAACTGCCTTCGCTCAAGCAGGTGAACTCTGCCAGGAAGCAGCTGAGGCCCAA 

GGCCACCTCCGCAGCCACTGTCCAAAGGGCAGGGTCCCAGCCAGCGTCCCAGGGCCTAGATCTCCTCTCCTCCTC 

CACGGAGAAGCCTGGCCCACCGGGGGACCCGGACCCCATCGTGGCCTCCGAGGAGGCATCAGAAGTGCCCCTTTG 

GCTGGATCGAAAGGAGAGTGCGGTCCCTACAACACCCGCACCCCTGCAAATCTCCCCCTTCACTTCGCAGCCCTA 

TGTGGCCCACACACTCCCCCAGAGGCCAGAACCCGGGGAGCCTGGGCCTGACATGGCCCAGGAGGCCCCCCAGGA 

GGACACCAGCCCCATGGCCCTGATGGACAAAGGTGAGAATGAGCTGACTGGGTCAGCCTCAGAGGAGAGCCAGGA 

GACCACTACCTCCACCATTATCACCACCACGGTCATCACCACCGAGCAAGCACCAGCTCTCTGCAGTGTGAGCTT 

CTCCAATCCTGAGGGGTACATTGACTCCAGCGACTACCCACTGCTGCCCCTCAACAACTTTCTGGAGTGCACATA 

CAACGTGACAGTCTACACTGGCTATGGGGTGGAGCTCCAGGTGAAGAGTGTGAACCTGTCCGATGGGGAACTGCT 

CTCCATCCGCGGGGTGGACGGCCCTACCCTGACCGTCCTGGCCAACCAGACACTCCTGGTGGAGGGGCAGGTAAT 

CCGAAGCCCCACCAACACCATCTCCGTCTACTTCCGGACCTTCCAGGACGACGGCCTTGGGACCTTCCAGCTTCA 

CTACCAGGCCTTCATGCTGAGCTGCAACTTTCCCCGCCGGCCTGACTCTGGGGATGTCACGGTGATGGACCTGCA 

CTCAGGTGGGGTGGCCCACTTTCACTGCCACCTGGGCTATGAGCTCCAGGGCGCTAAGATGCTGACATGCATCAA 

TGCCTCCAAGCCGCACTGGAGCAGCCAGGAGCCCATCTGCTCAGCTCCTTGTGGAGGGGCAGTGCACAATGCCAC 

CATCGGCCGCGTCCTCTCCCCAAGTTACCCTGAAAACACAAATGGGAGCCAATTCTGCATCTGGACGATTGAAGC 

TCCAGAGGGCCAGAAGCTGCACCTGCACTTTGAGAGGCTGTTGCTGCATGACAAGGACAGGATGACGGTTCACAG 

CGGGCAGACCAACAAGTCAGCTCTTCTCTACGACTCCCTTCAAACCGAGAGTGTCCCTTTTGAGGGCCTGCTGAG 

CGAAGGCAACACCATCCGCATCGAGTTCACGTCCGACCAGGGCCGGGCGGCCTCCACCTTCAACATCCGATTTGA 

AGCGTTTGAGAAAGGCCACTGCTATGAGCCCTACATCCAGAATGGGAACTTCACTACATCCGACCCGACCTATAA 

CATTGGGACTATAGTGGAGTTCACCTGCGACCCCGGCCACTCCCTGGAGCAGGGCCCGGCCATCATCGAATGCAT 

CAATGTGCGGGACCCATACTGGAATGACACAGAGCCCCTGTGCAGAGCCATGTGTGGTGGGGAGCTCTCTGCTGT 

GGCTGGGGTGGTATTGTCCCCAAACTGGCCCGAGCCCTACGTGGAAGGTGAAGATTGTATCTGGAAGATCCACGT 

GGGAGAAGAGAAACGGATCTTCTTAGATATCCAGTTCCTGAATCTGAGCAACAGTGACATCTTGACCATCTACGA 

TGGCGACGAGGTCATGCCCCACATCTTGGGGCAGTACCTTGGGAACAGTGGCCCCCAGAAACTGTACTCCTCCAC 

GCCAGACTTAACCATCCAGTTCCATTCGGACCCTGCTGGCCTCATCTTTGGAAAGGGCCAGGGATTTATCATGAA 

CTACATAGAGGTATCAAGGAATGACTCCTGCTCGGATTTACCCGAGATCCAGAATGGCTGGAAAACCACTTCTCA 

CACGGAGTTGGTGCGGGGAGCCAGAATCACCTACCAGTGTGACCCCGGCTATGACATCGTGGGGAGTGACACCCT 

CACCTGCCAGTGGGACCTCAGCTGGAGCAGCGACCCCCCATTTTGTGAGAAAATTATGTACTGCACCGACCCCGG 

AGAGGTGGATCACTCGACCCGCTTAATTTCGGATCCTGTGCTGCTGGTGGGGACCACCATCCAATACACGTGCAA 

CCCCGGTTTTGTGCTTGAAGGGAGTTCTCTTCTGACCTGCTACAGCCGTGAAACAGGGACTCCCATCTGGACGTC 

TCGCCTGCCCCACTGCGTTTCGGAGGAGTCCCTGGCATGTGACAACCCAGGGCTGCCTGAAAATGGATACCAAAT 

CCTGTACAAGCGACTCTACCTGCCAGGAGAGTCCCTCACCTTCATGTGCTACGAAGGCTTTGAGCTCATGGGTGA 

AGTGACCATCCGCTGCATCCTGGGACAGCCATCCCACTGGAACGGGCCCCTGCCCGTGTGTAAAGTTAATCAAGA 

CAGTTTTGAACATGCTTTAGAAGCAGAAGCGGCAGCAGAGACGTCGCTGGAAGGGGGGAACATGGCCCTGGCTAT 

CTTCATCCCGGTCCTCATCATCTCCTTACTGCTGGGAGGAGCCTACATTTACATCACAAGATGTCGCTACTATTC 

CAACCTCCGCCTGCCTCTGATGTACTCCCACCCCTACAGCCAGATCACCGTGGAAACCGAGTTTGACAACCCCAT 

TTACGAGACAGGGGAAACCAGAGAGTATGAGGTTTCTATCTAAAGAGAGCTACACTTGAGAAGGGGACTTGTGAA 

CTCAACCACAATCTCCTCGAGACATTCATCCAGAGACCATGTGGCACTTGATTGAAACCCCAGAATGTCGACTGT 

CTTTTGTTTAGACTCTTTATCAAAGGTTTACTGTTTTCTTCCCTGTATTTATTATATTTAAAAGTGAAAAAAAAA 

AAAAAAAAA7VA 
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FIGURE 200 



MPAARPPAAGLRGISLFLALLLGSP/yy^LERDALPEGDASPLGPYLLPSGAPERGSPGKEHPE 



ERWTAPPSSSQSAEVLGELVLDGTAPSAHHDIPALSPLLPEEARPKHALPPKKKLPSLKQVN 

SARKQLRPKATSAATVQRAGSQPASQGLDLLSSSTEKPGPPGDPDPIVASEEASEVPLWLDRK 

ESAVPTTPAPLQISPFTSQPYVAHTLPQRPEPGEPGPDMAQEAPQEDTSPMALMDKGENELTG 

SASEESQETTTSTIITTTVITTEQAPALCSVSFSNPEGYIDSSDYPLLPLNNFLECTYNVTVY 

TGYGVELQVKSVNLSDGELLSIRGVDGPTLTVLANQTLLVEGQVIRSPTNTISVYFRTFQDDG 

LGTFQLHYQAFMLSCNFPRRPDSGDVTVMDLHSGGVAHFHCHLGYELQGAKMLTCINASKPHW 

SSQEPICSAPCGGAVHNATIGRVLSPSYPENTNGSQFCIWTIEAPEGQKLHLHFERLLLHDKD 

RMTVHSGQTNKSALLYDSLQTESVPFEGLLSEGNTIRIEFTSDQARAASTFNIRFEAFEKGHC 

YEPYIQNGNFTTSDPTYNIGTIVEFTCDPGHSLEQGPAIIECINVRDPYWNDTEPLCRAMCGG 

ELSAVAGWLSPNWPEPYVEGEDCIWKIHVGEEKRIFLDIQFLNLSNSDILTIYDGDEVMPHI 

LGQYLGNSGPQKLYSSTPDLTIQFHSDPAGLIFGKGQGFIMNYIEVSRNDSCSDLPEIQNGWK 

TTSHTELVRGARITYQCDPGYDIVGSDTLTCQWDLSWSSDPPFCEKIMYCTDPGEVDHSTRLI 

SDPVLLVGTTIQYTCNPGFVLEGSSLLTCYSRETGTPIWTSRLPHCVSEESLACDNPGLPENG 

YQILYKRLYLPGESLTEWCYEGFELMGEVTIRCILGQPSHWNGPLPVCKVNQDSFEHALEAEA 

AAETSLEGGNMALAIFIPVLIISLLLGGAYIYITRCRYYSNLRLPLMYSHPYSQITVETEFDN- 

PIYETGETREYEVSI 

Signal peptide : 
amino acids 1-28 
Transmembrane domain: 
amino acids 893-915 
N-glycosylation sites. 

amino acids 311-315, 328-332, 350-354, 435-439, 458-462, 474-478, 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 188-192 
N-myristoylation sites. 

amino acids 23^29, 87-93, . 146-152, 454-460, 475-481, 575-581, 
629-635, 695-701, 723-729, 766-772, 877-883,. 953-959 
Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 383-394 



514-518, 576-580, 618-622, 674-678, 742-746 
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FIGURE 201 

GAT^CTACGGCAGGGGGTGGCTCTGGGGCTGACCCGGGAAGTCGGGGTCTCCTTCGCCTTCT 

GTCTTTCTGCGTCCTACTAGCAGGTTTGTGCAGGGGAAACTCAGTGGAGAGGAAGATATATAT 

CCCCTTAAATAAAACAGCTCCCTGTGTTCGCCTGCTCAACGCCACTCATCAGATTGGCTGCCA 

GTCTTCAATTAGTGGAGACACAGGGGTTATCCACGTAGTAGAGAAAGAGGAGGACCTACAGTG 

GGTATTGACTGATGGCCCCAACCCCCCTTACATGGTTCTGCTGGAGAGCAAGCATTTTACCAG 

GGATTTAATGGAGAAGCTGAAAGGGAGAACCAGCCGAATTGCTGGTCTTGCAGTGTCCTTGAC 

CAAGCCCAGTCCTGCCTCAGGCTTCTCTCCTAGTGTACAGTGCCCAAATGATGGGTTTGGTGT 

TTACTCCAATTCCTATGGGCCAGAGTTTGCTCACTGCAGAGAAATACAGTGGAATTCGCTGGG 

CAATGGTTTGGCTTATGAAGACTTTAGTTTCCCCATCTTTCTTCTTGAAGATGAAAATGAAAC 

CAAAGTCATCAAGCAGTGCTATCAAGATCACAACCTGAGTCAGAATGGCTCAGCACCAACCTT 

CCCACTATGTGCCATGCAGCTCTTTTCACACATGCATGCTGTCATCAGCACTGCCACCTGCAT 

GCGGCGCAGCTCCATCCAAAGCACCTTCAGCATCAACCCAGAAATCGTCTGTGACCCCCTGTC 

TGATTACAATGTGTGGAGCATGCTAAAGCCTATAAATACAACTGGGACATTAAAGCCTGACGA 

CAGGGTTGTGGTTGCTGCCACCCGGCTGGATAGTCGTTCCTTTTTCTGGAATGTGGCCCCAGG 

GGCTGAAAGCGCAGTGGCTTCCTTTGTCACCCAGCTGGCTGCTGCTGAAGCTTTGCAAAAGGC 

ACCTGATGTGACCACCCTGCCCCGCAATGTCATGTTTGTCTTCTTTCAAGGGGAAACTTTTGA 

CTACATTGGCAGCTCGAGGATGGTCTACGATATGGAGAAGGGCAAGTTTCCCGTGCAGTTAGA 

GAATGTTGACTCATTTGTGGAGCTGGGACAGGTGGCCTTAAGAACTTCATTAGAGCTTTGGAT 

GCACACAGATCCTGTTTCTCAGAAAAATGAGTCTGTACGGAACCAGGTGGAGGATCTCCTGGC 

CACATTGGAGAAGAGTGGTGCTGGTGTCCCTGCTGTCATCCTCAGGAGGCCAAATCAGTCCCA 

GCCTCTCCCACCATCTTCCCTGCAGCGATTTCTTCGAGCTCGAAACATCTCTGGCGTTGTTCT 

GGCTGACCACTCTGGTGCCTTCCATAACAAATATTACCAGAGTATTTACGACACTGCTGAGAA 

CATTAATGTGAGCTATCCCGAATGGCTGAGCCCTGAAGAGGACCTGAACTTTGTAACAGACAC 

TGCCAAGGCCCTGGCAGATGTGGCCACGGTGCTGGGACGTGCTCTGTATGAGCTTGCAGGAGG 

AACCAACTTCAGCGACACAGTTCAGGCTGATCCCCAAACGGTTACCCGCCTGCTCTATGGGTT 

CCTGATTAAAGCCAACAACTCATGGTTCCAGTCTATCCTCAGGCAGGACCTAAGGTCCTACTT 

GGGTGACGGGCCTCTTCAACATTACATCGCTGTCTCCAGCCCCACCAACACCACTTATGTTGT 

ACAGTATGCCTTGGCAAATTTGACTGGCACAGTGGTCAACCTCACCCGAGAGCAGTGCCAGGA 

TCCAAGTAAAGTCCCAAGTGAAAACAAGGATCTGTATGAGTACTCATGGGTCCAGGGCCCTTT 

GCATTCTAATGAGACGGACCGACTCCCCCGGTGTGTGCGTTCTACTGCACGATTAGCCAGGGC 

CTTGTCTCCTGCCTTTGAACTGAGTCAGTGGAGCTCTACTGAATACTCTACATGGACTGAGAG 

CCGCTGGAAAGATATCCGTGCCCGGATATTTCTCATCGCCAGCAAAGAGCTTGAGTTGATCAC 

CCTGACAGTGGGCTTCGGCATCCTCATCTTCTCCCTCATCGTCACCTACTGCATCAATGCCAA 

AGCTGATGTCCTTTTCATTGCTCCCCGGGAGCCAGGAGCTGTGTCATACTGAGGAGGACCCCA 

GCTTTTCTTGCCAGNTCAGCAGTTCACTTCCTAGAGCATCTGTCCCACTGGGACACAACCACT 

AATTTGTCACTGGAACCTCCCTGGGCCTGTCTCAGATTGGGATTAACATAAAAGAGTGGAACT 

ATCCAAAAGAGACAGGGAGAAATAAATAAATTGCCTCCCTTCCTCCGCTCCCCTTTCCCATCA 

CCCCTTCCCCATTTCCTCTTCCTTCTCTACTCATGCCAGATTTTGGGATTACAAATAGAAGCT 

TCTTGCTCCTGTTTAACTCCCTAGTTACCCACCCTAATTTGCCCTTCAGGACCCTTCTACTTT 

TTCCTTCCTGCCCTGTACCTCTCTCTGCTCCTCACCCCCACCCCTGTACCCAGCCACCTTCCT 

GACTGGGAAGGACATAAAAGGTTTAATGTCAGGGTCAAACTACATTGAGCCCCTGAGGACAGG 

GGCATCTCTGGGCTGAGCCTACTGTCTCCTTCCCACTGTCCTTTCTCCAGGCCCTCAGATGGC 

ACATTAGGGTGGGCGTGCTGCGGGTGGGTATCCCACCTCCAGCCCACAGTGCTCAGTTGTACT 

TTTTATTAAGCTGTAATATCTATTTTTGTTTTTGTCTTTTTCCTTTATTCTTTTTGTAAATAT 

ATATATAATGAGTTTCATTAAAATAGATTATCCC 



# 
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FIGURE 202 

MATAGGGSGADPGSRGLLRLLSFCVLLAGLCRGNSVERKIYIPLNKTAPCVRLLNATHQIGCQ 
SSISGDTGVIHVVEKEEDLQWVLTDGPNPPYMVLLESKHFTRDLMEKLKGRTSRIAGLAVSLT 
KPSPASGFSPSVQCPNDGFGVYSNSYGPEFAHCREIQWNSLGNGLAYEDFSFPIFLLEDENET 
KVIKQCYQDHNLSQNGSAPTFPLCAMQLFSHMHAVISTATCMRRSSIQSTFSINPEIVCDPLS 
DYNVWSMLKPINTTGTLKPDDRVWAATRLDSRSFFWNVAPGAESAVASFVTQLAAAEALQKA 
PDVTTLPRNVMFVFFQGETFDYIGSSRMVYDMEKGKFPVQLENVDSFVELGQVALRTSLELWM 
HTDPVSQKNESVRNQVEDLLATLEKSGAGVPAVILRRPNQSQPLPPSSLQRFLRARNISGVVL 
ADHSGAFHNKYYQSIYDTAENINVSYPEWLSPEEDLNFVTDTAKALADVATVLGRALYELAGG 
TNFSDTVQADPQTVTRLLYGFLIKANNSWFQSILRQDLRSYLGDGPLQHYIAVSSPTNTTYVV 
QYALANLTGTWNLTREQCQDPSKVPSENKDLYEYSWVQGPLHSNETDRLPRCVRSTARLARA 
LSPAFELSQWSSTEYSTWTESRWKDIRARIFLIASKELELITLTVGFGILIFSLIVTYCINAK 
ADVLFIAPREPGAVSY 

Signal, peptide : 

amino acids 1-33 

Transmenbreme domain: 

amino acids 671-692 

N-glycosylation sites. 

amino acids 45-49, 55-59, 187-191, 200-204, 204-208, 264-268, 

387-391, 417-421, 435-439, 464-468, 506-510, 530-534, 562-566, 
573-577, 580-584, 612-616 

Glycosaminoglycan attachment site. 

amino acids 404-408 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 232-236 



N-myristoylation site. 

amino acids 5-11, 6-12, 9-15, 29-35, 61-67, 120-126, 146-152, 
168-174, 205-211, 294-300, 438-444, 446-452, 504-510, 576-582 •* 
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FIGURE 203 



GCTAGACCGAGCCCTGGGAGGCTACGGGCTCCCCCGGAAACCCTGCCAGGGGAGCCGGGTTTT 

GAGCTCAGGCGCCTCTAGCGGCGGCCCCCAGAAATCTGACTCGCGAGGCCAGAGTTGCAGGGA 

CTGAATAGCAAACTGAGGCTGAGTAGGGAACAGACCATGAGGTCAGTGCAGATCTTCCTCTCC 

CAATGCCGTTTGCTCCTTCTACTAGTTCCGACAATGCTCCTTAAGTCTCTTGGCGAAGATGTA 

ATTTTTCACCCTGAAGGGGAGTTTGACTCGTATGAAGTCACCATTCCTGAGAAGCTGAGCTTC 

CGGGGAGAGGTGCAGGGTGTGGTCAGTCCCGTGTCCTACCTACTGCAGTTAAAAGGCAAGAAG 

CACGTCCTCCATTTGTGGCCCAAGAGACTTCTGTTGCCCCGACATCTGCGCGTTTTCTCCTTC 

ACAGAACATGGGGAACTGCTGGAGGATCATCCTTACATACCAAAGGACTGCAACTACATGGGC 

TCCGTGAAAGAGTCTCTGGACTCTAAAGCTACTATAAGCACATGCATGGGGGGTCTCCGAGGT 

GTATTTAACATTGATGCCAAACATTACCAAATTGAGCCCCTCAAGGCCTCTCCCAGTTTTGAA 

CATGTCGTCTATCTCCTGAAGAAAGAGCAGTTTGGGAATCAGGTTTGTGGCTTAAGTGATGAT 

GAAATAGAATGGCAGATGGCCCCTTATGAGAATAAGGCGAGGCTAAGGGACTTTCCTGGATCC 

TATAAACACCCAAAGTACTTGGAATTGATCCTACTCTTTGATCAAAGTAGGTATAGGTTTGTG 

AACAACAATCTTTCTCAAGTCATACATGATGCCATTCTTTTGACTGGGATTATGGACACCTAC 

TTTCAAGATGTTCGTATGAGGATACACTTAAAGGCTCTTGAAGTATGGACAGATTTTAACAAA 

ATACGCGTTGGATATCCAGAGTTAGCTGAAGTTTTAGGCAGATTTGTAATATATAAAAAAAGT 

GTATTAAATGCTCGCCTGTCATCAGATTGGGCACATTTATATCTTCAAAGAAAATAT7VATGAT 

GCTCTTGCATGGTCGTTTGGAAAAGTGTGTTCTCTAGAATATGCTGGATCAGTGAGTACTTTA 

CTAGATACAAATATCCTTGCCCCTGCTACCTGGTCTGCTCATGAGCTGGGTCATGCTGTAGGA 

ATGTCACATGATGAACAATACTGCCAATGTAGGGGTAGGCTTAATTGCATCATGGGCTCAGGA 

CGCACTGGGTTTAGCAATTGCAGTTATATCTCTTTTTTTAAACATATCTCTTCGGGAGCAACA 

TGTCTAAATAATATCCCAGGACTAGGTTATGTGCTTAAGAGATGTGGAAACAAAATTGTGGAG 

GACAATGAGGAATGTGACTGTGGTTCCACAGAGGAGTGTCAGAAAGATCGGTGTTGCCAATCA 

AATTGTAAGTTGCAACCAGGTGCCAACTGTAGCATTGGACTTTGCTGTCATGATTGTCGGTTT 

CGTCCATCTGGATACGTGTGTAGGCAGGAAGGAAATGAATGTGACCTTGCAGAGTACTGCGAC 

GGGAATTCAAGTTCCTGCCCAAATGACGTTTATAAGCAGGATGGAACCCCTTGCAAGTATGAA 

GGCCGTTGTTTCAGGAAGGGGTGCAGATCCAGATATATGCAGTGCCAAAGCATTTTTGGACCT 

GATGCCATGGAGGCTCCTAGTGAGTGCTATGATGCAGTTAACTTAATAGGTGATCAATTTGGT 

AACTGTGAGATTACAGGAATTCGAAATTTTAAAAAGTGTGAAAGTGCAAATTCAATATGTGGC 

AGGCTACAGTGTATAAATGTTGAAACCATCCCTGATTTGCCAGAGCATACGACTATAATTTCT 

ACTCATTTACAGGCAGAAAATCTCATGTGCTGGGGCACAGGCTATCATCTATCCATGAAACCC 

ATGGGAATACCTGACCTAGGTATGATAAATGATGGCACCTCCTGTGGAGAAGGCCGGGTATGT 

TTTAAAAAAAATTGCGTCAATAGCTCAGTCCTGCAGTTTGACTGTTTGCCTGAGAAATGCAAT 

ACCCGGGGTGTTTGCAACAACAGAAAAAACTGCCACTGCATGTATGGGTGGGCACCTCCATTC 

TGTGAGGAAGTGGGGTATGGAGGAAGCATTGACAGTGGGCCTCCAGGACTGCTCAGAGGGGCG 

ATTCCCTCGTCAATTTGGGTTGTGTCCATCATAATGTTTCGCCTTATTTTATTAATCCTTTCA 

GTGGTTTTTGTGTTTTTCCGGCAAGTGATAGGAAACCACTTAAAACCCAAACAGGAAAAAATG 

CCACTATCCAAAGCAAAAACTGAACAGGAAGAATCTAAAACAAAAACTGTACAGGAAGAATCT 

AAAACAAAAACTGGACAGGAAGAATCTGAAGCAAAAACTGGACAGGAAGAATCTAAAGCAAAA 

ACTGGACAGGAAGAATCTAAAGCAAACATTGAAAGTAAACGACCCAAAGCAAAGAGTGTCAAG 

AAACAAAAAAAGTAACCGGGCAATCCATACTCATTCAGTAACACAGGCTCATTTATTTAACCA 

GCTAATCATTTATCCAAAGGCTTTCCATTCTTCTCCCAATATTTTTTTACTTTAATTTTTCCC 

ACAAGTTTTGATCAGCAAATAAACAGCATTCTTGTTTTGGAAACAAAAA 



• 
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FIGURE 204 

MRSVQIFLSQCRLLLLLVPTMLLKSLGEDVIFHPEGEFDSYEVTIPEKLSFRGEVQGVVSPVS 
YLLQLKGKKHVLHLWPKRLLLPRHLRVFSFTEHGELLEDHPYIPKDCNYMGSVKESLDSKATI 
STCMGGLRGVFNIDAKHYQIEPLKASPSFEHVVYLLKKEQFGNQVCGLSDDEIEWQMAPYENK 
ARLRDFPGSYKHPKYLELILLFDQSRYRFVNNNLSQVIHDAILLTGIMDTYFQDVRMRIHLKA 
LEVWTDFNKIRVGYPELAEVLGRFVIYKKSVLNARLSSDWAHLYLQRKYNDALAWSFGKVCSL 
EYAGSVSTLLDTNILAPATWSAHELGHAVGMSHDEQYCQCRGRLNCIMGSGRTGFSNCSYISF 
FKHISSGATCLNNIPGLGYVLKRCGNKIVEDNEECDCGSTEECQKDRCCQSNCKLQPGANCSI 
GLCCHDCRFRPSGYVCRQEGNECDLAEYCDGNSSSCPNDVYKQDGTPCKYEGRCFRKGCRSRY 
MQCQSIFGPDAMEAPSECYDAVNLIGDQFGNCEITGIRNFKKCESANSICGRLQCINVETIPD 
LPEHTTIISTHLQAENLMCWGTGYHLSMKPMGIPDLGMINDGTSCGEGRVCFKKNCVNSSVLQ 
FDCLPEKCNTRGVCNNRKNCHCMYGWAPPFCEE VG YGGS I DSGPPGLLRGAI PSS I W VVS I IM 
FRLILLILSWFVFFRQVIGNHLKPKQEKMPLSKAKTEQEESKTKTVQEESKTKTGQEESEAK 
TGQEESKAKTGQEESKANIESKRPKAKSVKKQKK 

Signal peptide: 

amino acids 1-27 . 
Transmembrane domain: 
amino acids 684-705 
N-9lycosylation sites. 

amino acids. 222-226, 372-376, 438-442, 473-477, 625-629 
N-myristoylation sites. 

amino acids 131-137, 168-174, 235-241, 319-325, 364-370, 436-442, 

472-478, 609-615, 642-648, 668-674, 676-680, 680-686, 749-755, 

758-764, 767-773 

Amidation site. 

amino acids 69-73 

Disintegrins proteins 

amino acids 429-479 

EGF-like domain proteins 

amino acids 650-662 

Neutral zinc metallopeptidases , zinc-binding region proteins 

amino acids 335-345 
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FIGURE 205 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGGAAGGTTGAATGGGGTAGAAGGCCTG 

TTGTGGAGGGAAACCACCCATCCTCCTGCCTCCCACCACCACCATCATCCTGGCTGGACGGAG 

AGGGTGACGGGGGCTGGGAAGGGGCAGCTCATGTTCAGGTTTCCAGGAGGGGCTACCTGTTGA 

CTGTCTTTGCAGGAAGAAGAAAACACCTGAGTGACCAGATGTCCCAGCTCCAGGTGCCTTGCC 

AGATGGCCAGAACCACACCTCTTGAAGAGTGACAGTGCTGTGGAGCATGGTTTCTGCACACCT 

GGAATGACTGGAACCCCAAAGACTCAAGAAGGAGCTAAAGATCTTGAAGTAGACATGAATAAA 

ACAGAAGGCTGTGGACCACCTGTCGAGATGGAGAAGTCCTTCTGAGGCTATCCAAACACGGAC 

CAGGCCATGAGACCCCGATGACCATCCCTGAATTTTTTCGAGAGTCAGTCAACCGATTTGGAA 

CTTATCCAGCCCTCCCATCCAAGAATGGCAAAAAGTGGGAAATTCTGAATTTCAACCAGTACT 

ATGAGGCTTGTCGGAAGGCTGCAAAATCCTTGATCAAGCTGGGTTTGGAGCGTTTCCACGGAG 

TTGGTATCCTGGGGTTTAACTCTGCAGAGTGGTTTATCACTGCTGTTGGTGCCATCCTAGCCG 

GGGGTCTTTGTGTTGGTATTTATGCCACCAACTCTGCCGAGGCTTGTCAATATGTCATCACTC 

ATGCCAAAGTGAACATCTTGCTGGTTGAGAATGATCAACAGTTACAGAAAATCCTTTCGATTC 

CACAGAGCAGCCTAGAGCCCCTAAAAGCGATCATCCAGTACAGACTGCCAATGAAGAAGAACA 

ACAACTTGTACTCTTGGGATGATTTCATGGAACTTGGCAGAAGTATCCCTGACACCCAACTGG 

AGCAGGTCATCGAGAGCCAGAAGGCGAATCAATGCGCAGTGCTCATCTACACTTCAGGGACCA 

CAGGCATACCCAAGGGAGTGATGCTCAGTCATGACAACATCACGTGGATTGCAGGAGCAGTGA 

CAAAGGACTTTAAACTGACAGACAAGCATGAGACGGTGGTTAGCTACCTCCCACTCAGCCATA 

TTGCAGCACAGATGATGGACATCTGGGTACCCATAAAGATTGGGGCGCTCACATACTTTGCTC 

AAGCAGATGCTCTCAAGGGCACCTTGGTAAGTACTCTAAAGGAGGTAAAACCTACTGTCTTCA 

TTGGAGTGCCTCAAATTTGGGAGAAGATACATGAGATGGTGAAGAAAAATAGTGCCAAGTCCA 

TGGGCTTGAAGAAGAAGGCATTCGTGTGGGCAAGAAACATTGGCTTCAAGGTCAACTCAAAAA 

AGATGTTGGGGAAATATAATACTCCCGTGAGCTACCGCATGGCTAAGACTCTCGTGTTCAGCA 

AAGTCAAGACATCCCTTGGCTTGGATCACTGTCACTCTTTTATCAGTGGGACTGCGCCCCTCA 

AGCAAGAGACTGCCGAGTTCTTTCTAAGCTTGGACATACCTATAGGCGAGTTGTATGGGTTGA 

GTGAGAGCTCGGGACCCCACACGATATCCAACCAGAATAACTACAGGCTTCTAAGCTGTGGCA 

AGATCTTGACTGGGTGTAAGAATATGCTGTTCCAGCAGAACAAGGATGGCATTGGGGAGATCT 

GCCTCTGGGGTAGGCACATCTTCATGGGCTATCTGGAAAGTGAGACTGAAACTACAGAGGCCA 

TCGATGATGAAGGCTGGCTACACTCTGGGGATCTGGGCCAGCTGGACGGTCTGGGTTTCCTCT 

ATGTCACCGGCCACATCAAAGAAATCCTTATCACTGCTGGTGGTGAAAATGTGCCCCCCATTC 

CTGTTGAGACCTTGGTTAAGAAGAAGATCCCCATCATCAGTAACGCCATGTTAGTAGGAGATA 

AACTGAAGTTTCTGAGCATGTTGCTGACGCTGAAGTGTGAGATGAATCAGATGAGCGGAGAAC 

CTCTGGACAAGCTGAACTTCGAGGCCATCAACTTCTGTCGGGGTCTGGGCAGCCAGGCATCCA 

CCGTGACTGAGATTGTGAAGCAGCAAGACCCCCTGGTCTACAAGGCCATCCAGCAAGGCATCA 

ATGCTGTGAACCAGGAAGCCATGAACAATGCACAGAGGATTGAAAAGTGGGTCATCTTGGAGA 

AGGACTTTTCCATCTATGGTGGAGAGCTAGGTCCAATGATGAAACTTAAGAGACATTTTGTAG 

CCCAGAAATACAAAAAACAAATTGATCACATGTACCACTGACTGCTTTGATGGAGCTGCTCTC 

AGCTGTTCTGATGCCTTCAGCAGGAAGACCTCATTGCAATAAGTGAAATGCTGCTCTAGGTAG 

AAGCTCTCCCTGCTGTTTTTAAGAAGCCACATTCCTCATTGGTCAGTTTCTTGATTGTTCGTC 

TGTTGGAGAGGTGCTCCCTAGAAGAACCTGCCATACGTTTCAAAGCAATAAAATCACTGTATA 

TCTTTCTAAGGACCTTCAAGTCATGACTCGAGGGAAGCCTATTGGGAAGTCTACTAAAAACTG 

CCTGATTTACAAGAAAGACCTGAACTTGTGGGCTCCCATTTGATTTTTTTCTCCTCAGGGGAC 

TCAGACATTAGAAAGAAAAAGCCTCACAGATTTGAAGAACTGGACCCCCAAATCAACTCACCT 

GCCTGGAAGCAACTGGGAAACCCTTCCAATAAGTCCTGATAATAAAGCACTTCAGGGTCCCAA 
AAAAAAAAAA 
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FIGURE 206 



MTIPEFFRESVNRFGTYPALPSKNGKKWEILNFNQYYEACRKAAKSLIKLGLERFHGVGILGF 
NSAEWFITAVGAILAGGLCVGIYATNSAEACQYVITHAKVNILLVENDQQLQKILSIPQSSLE 
PLKAIIQYRLPMKKNNNLYSWDDFMELGRSIPDTQLEQVIESQKANQCAVLIYTSGTTGIPKG 
VMLSHDNITWIAGAVTKDFKLTDKHETWSYLPLSHIAAQMMDIWVPIKIGALTYFAQADALK 
GTLVSTLKEVKPTVFIGVPQIWEKIHEMVKKNSAKSMGLKKKAFVWARNIGFKVNSKKMLGKY 
NTPVSYRMAKTLVFSKVKTSLGLDHCHSFISGTAPLNQETAEFFLSLDIPIGELYGLSESSGP 
HTISNQNNYRLLSCGKILTGCKNMLFQQNKDGIGEICLWGRHIFMGYLESETETTEAIDDEGW 
LHSGDLGQLDGLGFLYVTGHIKEILITAGGENVPPIPVETLVKKKIPIISNAMLVGDKLKFLS 
MLLTLKCEMNQMSGEPLDKLNFEAINFCRGLGSQASTVTEIVKQQDPLVYKAIQQGINAVNQE 
AMNNAQRIEKWVILEKDFSIYGGELGPMMKLKRHFVAQKYKKQIDHMYH 

Signal peptide: 
amino acids 1-22 
Transmembrane domain: 
amino acids 65-86 
N-glycosylation site, 
amino acids 196-200 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 282-286 

Tyrosine kinase phosphorylation sites. 

amino acids 547-555, 608-616 
N-myristoylation sites. 

amino acids 15-21, 74-80, 80-86, 84-90, 185-191, 189-195, 
253-259, 337-343, 371-377, 448-454, 536-542 
Amidation site, 
amino acids 24-28 

Putative AMP-binding d<»aain signature. 

amino acids 177-189 

Putative AMP-binding domain proteins. 

amino acids 173-190 
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FIGURE 207 

CCCACGCGTCCGCCCACGCGTCCGCGGACGCGTGGGGCCAGATCGCGGCCGGCGCCAGCGCCA 

CCGTCCGGTCCACCCGCCAGCCCGCACAGCCGCGCCGCCGCCGAGCGTTTCGTGAGCGGCGCT 

CCGAGGATCAGGAATGGGGCTTCGGGCGCTGGGCGCGCTCCGAACCCGGCGCACGTAAGAGCC 

TGGGAGCGCCCGAGCCGCCCGGCTGCCCGGAGCCCCATCGCCTAGGACCGGGAGATGCTGGAA 

ATGCAACCGCCTGTTCCCCGAGGAGCCGCTGCCCCCGGGACCCCCTGGCACTGTGCGCACCCT 

GGTCAGCAGCCCCCGGAGAAGACGGCGCCCCCAACGCCCGACCCGCGTGGCCGTGGCAGCGCC 

ACGCGAGCCCTCTAGGCGACCGCAGGGCCACAGCAGCTCAGCCGCCGGTGCCCCCTCGGAAAC 

CATGACCCCCGGCGCGGGCCCATGGAGCCATGGCCTATAGGGTCCTGGGCCGCGCGGGGCCAC 

CTCAGCCGCGGAGGGCGCGCAGGCTGCTCTTCGCCTTCACGCTCTCGCTCTCCTGCACTTACC 

TGTGTTACAGCTTCCTGTGCTGCTGCGACGACCTGGGTCGGAGCCGCCTCCTCGGCGCGCCTC 

GCTGCCTCCGCGGCCCCAGCGCGGGCGGCCAGAAACTTCTCCAGAAGTCCCGCCCCTGTGATC 

CCTCCGGGCCGACGCCCAGCGAGCCCAGCGCTCCCAGCGCGCCCGCCGCCGCCGTGCCCGCCC 

CTCGCCTCTCCGGTTCCAACCACTCCGGCTCACCCAAGCTGGGTACCAAGCGGTTGCCCCAAG 

CCCTCATTGTGGGCGTGAAGAAGGGGGGCACCCGGGCCGTGCTGGAGTTTATCCGAGTACACC 

CGGACGTGCGGGCCTTGGGCACGGAACCCCACTTCTTTGACAGGAACTACGGCCGCGGGCTGG 

ATTGGTACAGGAGCCTGATGCCCAGGACCCTCGAGAGCCAGATCACGCTGGAGAAGACGCCCA 

GCTACTTTGTCACTCAAGAGGCTCCTCGACGCATCTTCAACATGTCCCGAGACACCAAGCTGA 

TCGTGGTTGTGCGGAACCCTGTGACCCGTGCCATCTCTGATTACACGCAGACACTCTCCAAGA 

AGCCCGACATCCCGACCTTTGAGGGCCTCTCCTTCCGCAACCGCACCCTGGGCCTGGTGGACG 

TGTCATGGAACGCCATCCGCATCGGCATGTACGTGCTGCACCTGGAGAGCTGGCTGCAGTACT 

TCCCGCTAGCTCAGATTCACTTCGTCAGTGGCGAGCGACTCATCACTGACCCGGCCGGCGAGA 

TGGGGCGAGTCCAGGACTTCCTGGGCATTAAGAGATTCATCACGGACAAGCACTTCTATTTCA 

ACAAGACCAAAGGATTCCCTTGCTTGAAAAAAACAGAATCGAGCCTCCTGCCTCGATGCTTGG 

GCAAATCAAAAGGGAGAACTCATGTACAGATTGATCCTGAAGTGATAGACCAGCTCCGAGAAT 

TTTATAGACCGTATAATATCAAATTTTATGAAACCGTTGGGCAGGACTTCAGGTGGGAATAAG 

CCCACGAAAGGAAAGGGCTCTCAAGGGCTCTTCTGCTCATCTCTTCCGTGAGATTTGCTCCCA 

GACCCTCTGATCTCCCTCCAACAAACCCTGGCTCCAGCCCCCTTTCCCAACTTGAGTTGCATC 

ATCTTGGAACCAGGAAGCCCAGCTAAAGCCAAGAGACCAGAGAGTCCCTGCCACTAGTTTTCA 

TCAGTCTGTTCAAGCAAAGTTGATCTGCTCCTGGCACGTCCAGTAAATTCCAGAATCATTCTC 

CTTTCTGCCCATAAAGGGCCTTGGAGAATTGCTTTAAGAAGAGTGAATGTTCCAATGATGATA 

GATATTATAAGCGATGATGGTTCTGTTGCTATGAACACAGCAGTCGGTCCCTGTCATTGTCCA 

CCCAGGAGTGGCCTTGTTAATTCCAAGTGGCATGTATCTTCCCTCTGAGCTTCATTTCTTCAA 

GATGCTCTGGGTGGTGGGATGGGAGACCATCCTCAGCCCTCCTCAGACCTTATCAATTCATTG 

AGAGATTGCAAAGCTGAAAGCACCTCCGGCCACTCCTGGGAGACAGACCCTTTGGTGATGAAA 

TAAACCAGTGACTTCAGAGCCTATGGTCTCAACTGTGCTTGAAAAACACTGTCTCTGAAAACA 

ACTTTGTGATTCTCCCTGCTCCCTGTGGACAAAAGCACATAATTCTGCTGTTACGGGTACTTT 

GCTCATACGAGCTTTCATGTTCAGCATGCAATGGAATCATGCTTGTCCATGTGAAATAAATAT 

GGCTCTCTCGTGTCCTTAATGCTGGGCTTTTCTCTGTAAGCTGGTTCTGCAGCACAATTCATT 

AATTAAACTTCTCCCAGTGCAAGAAGGCAGCTGGTGCTGGGGGTGGTCTGGGGGGTCAGGGAG 

GAGGGCAAGGACTACATGGGGCAGAGGCAAGGCGGTGGTGGAGATGAGGAAAGAAGTTCTTCT 

TGGCAGAAGCTGGGGCAGAAAGATCACATGAGATCTGTGGGGACACCCTCTATCTGAAACATA 

AGTCTGTGTTCATTCTCTGCTTAGAAATTTTAGATCTGAAGTGCTACACTGAAGGTCCGAAGG 

TTGATGGGGCATCAGATATCTTTTTGGTTGGCCAGCATGATATTTTGAAATAACTGTCAACAG 

TTAGAAACTGGGAGCATTCATATGTAAAAAATATGGATTTTCAGCTTCTTCTTAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA 
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FIGURE 208 

MAYRVLGRAGPPQPRRARRLLFAFTLSLSCTYLCYSFLCCCDDLGRSRLLGAPRCLRGPSAGG 
QKLLQKSRPCDPSGPTPSEPSAPSAPAAAVPAPRLSGSNHSGSPKLGTKRLPQALIVGVKKGG 
TRAVLEFIRVHPDVRALGTEPHFFDRNYGRGLDWYRSLMPRTLESQITLEKTPSYFVTQEAPR 
RIFNMSRDTKLIWVRNPVTRAISDYTQTLSKKPDIPTFEGLSFRNRTLGLVDVSWNAIRIGM 
YVLHLESWLQYFPLAQIHFVSGERLITDPAGEMGRVQDFLGIKRFITDKHFYFNKTKGFPCLK 
KTESSLLPRCLGKSKGRTHVQIDPEVIDQLREFYRPYNIKFYETVGQDFRWE 

Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 102-106^ 193-197, 235-239, 306-310 

Tyrosine kinase phosphorylation site. 

amino acids 296-305 

N-myristoylation sites. 

amino acids 51-57, 100-106, 121-127, 125-131 

Prokaryotic membrane . lipoprotein lipid attachment site . 

amino acids 20-31 
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FIGURE 209 

CTTTCCTTATCTGTGTGTACTCTTATCTCACTGTTCTATTTTTTCTCCTCATTTATATTAACT 

CTTTCTTACCTTTTTTTCTGAACTTCTAGGCCTTCTCTTTCCAGAACTGGTGGAAGACAAATG 

AAACGGCCAAGATGGTAAGAAACAAGCCGCATTTCTCCTTGGGGAGACTGATAATTTAAAAGG 

TTTGTTGTGTCAGAAACATTCCCAGCTTCATCACCAACCCTTTCCTTCCACCTCTGCCCACTG 

GAGACCACTTACATCCCGAAGCGGACGCGGCAGCTGAAGTCAGGAAACCATGCATCACATTAG 

CAGGAGCCAACTGCAGACTTTAAACTCCGTTCAACATGTGGATGCGGCAGAGAAATGACCTGT 

CCAGACAAGCCGGGGCAGCTCATAAACTGGTTCATCTGCTCCCTGTGCGTCCCGCGGGTGCGT 

AAGCTCTGGAGCAGCCGGCGTCCAAGGACCCGGAGAAACCTTCTGCTGGGCACTGCGTGTGCC 

ATCTACTTGGGCTTCCTGGTGAGCCAGGTGGGGAGGGCCTCTCTCCAGCATGGACAGGCGGCT 

GAGAAGGGGCCACATCGCAGCCGCGACACCGCCGAGCCATCCTTCCCTGAGATACCCCTGGAT 

GGTACCCTGGCCCCTCCAGAGTCCCAGGGCAATGGGTCCACTCTGCAGCCCAATGTGGTGTAC 

ATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAGCCCAAGCGC 

AGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTCGGACCATCCCTT 

CAGCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACGCTCAGGGAGCAAAC 

CTGGTTAAGATTGGAGAGCGACCCTGGAGGTTGGTGCGGGGTCCGGGAGTGCGAGCCGGGGGC 

CCAGACTTCCTGCAGCCCAGCTCCAGGGAGAGCAACATTAGGATCTACAGCGAGAGCGCCCCC 

TCCTGGCTGAGCAAAGATGACATCCGAAGAATGCGACTCTTGGCGGACAGCGCAGTGGCAGGG 

CTCCGGCCTGTGTCCTCTAGGAGCGGAGCCCGTTTGCTGGTGCTGGAGGGGGGCGCACCTGGC 

GCTGTGCTCCGCTGTGGCCCTAGCCCCTGTGGGCTTCTCAAGCAGCCCTTGGACATGAGTGAG 

GTGTTTGCCTTCCACCTAGACAGGATCCTGGGGCTCAACAGGACCCTGCCGTCTGTGAGCAGG 

AAAGCAGAGTTCATCCAAGATGGCCGCCCATGCCCCATCATTCTTTGGGATGCATCTTTATCT 

TCAGCAAGTAATGACACCCATTCTTCTGTTAAGCTCACCTGGGGAACTTATCAGCAGTTGCTG 

AAACAGAAATGCTGGCAGAATGGCCGAGTACCCAAGCCTGAATCAGGTTGTACTGAAATACAT 

CATCATGAGTGGTGCAAGATGGCACTCTTTGATTTTTTGTTACAGATTTATAATCGCTTAGAT 

ACAAATTGCTGTGGATTCAGACCTCGCAAGGAAGATGCCTGTGTACAGAATGGATTGAGGCCA 

AAATGTGATGACCAAGGTTCTGCGGCTCTAGCACACATTATCCAGCGAAAGCATGACCCAAGG 

CATTTGGTTTTTATAGACAACAAGGGTTTCTTTGACAGGAGTGAAGATAACTTAAACTTCAAA 

TTGTTAGAAGGCATCAAAGAGTTTCCAGCTTCTGCAGTTTCTGTTTTGAAGAGCCAGCACTTA 

CGGCAGAAACTTCTTCAGTCTCTGTTTCTTGATAAAGTGTATTGGGAAAGTCAAGGAGGTAGA 

CAAGGAATTGAAAAGCTTATCGATGTAATAGAACACAGAGCCAAAATTCTTATCACCTATATC 

AATGCACACGGGGTCAAAGTATTACCTATGAATGAAJGACAAAAGAATCTTCTGGCTAGGGTG 

TTAGATATATTTATGCATTTTTGGTTTTGTTTTTAAATCAAGCACATCAACCTCAAGCCCGTT 

TAGCAATGAGGCAGTGTAGATGAATACGTAAAATAAATGACTTTAACCAAGTAGCTATAAAGG 

GACTTAGCACTGTATGCATACTTAAAAAGGTTTTGAAAAACAAACTACTTGAGAAATATTTGT 

TTATATTTTTCTCTAACATCATGCTATGTGTCAGTCTGAACATCTGACAACAGAAATTTCAGT 

TATTATTCTAGCTAAGTTTTGAAAACATTTGTCATGCTGTTTAATAGAAAACTGCAAACCAGA 

GATACTGACTCCATTAATAAACCATATTTTGTGCCGTTTTGACTGTTCTGACCAAATACTAAT 

GGGAACAATTCTTGACGTTTTTCTGTTGCTGATTGTTAACATAGAGCAGTCTCTACACTACCC 

TGAGGCAACTCTACATTGGAACACTGAGGCTTACAGCCTGCAAGAGCATCAGAGCTGACCATA 

CATTTAAACAGAAATGCTGGTTTATTTGCAAAATCACCAGTATATTTTCTATTGTGTCTATAA 

AAAATCAGTCATTTAAGTACAAGAATCATATTTTCCATTCCTTTTTAGAAATTTATTTTGTTG 

TCCCTATGGAAATCATTCACATCTGACAATTTATATGTTAAAGAGTTTTACTCTCTCTATTTT 

GGTCCAATTTGTATCTAGTGGCTGAGAAATTAAATAATTCTAAAGTATGAAGTTACCTATCTG 

AAAATGTACTTACAGAGTATCATTTTAAAATGGATGTCTCTTTAAAAATTTTGTTACTTTTAC 

CAACAATGTAATATAATTTATGTATATTTTATTAATAATAGTGAATTCCTTAAAATTTGTTCT 

ATGTACTTATATTTAATTTGATTTAATGGTTACTGCCCAGATATTGAGAAATGGTTCAAATAT 
TGAGTGTGTTTCAATAA 
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FIGURE 210 

MTCPDKPGQLINWFICSLCVPRVRKLWSSRRPRTRRNLLLGTACAIYLGFLVSQVGRASLQHG 

QAAEKGPHRSRDTAEPSFPEIPLDGTLAPPESQGNGSTLQPNWYITLRSKRSKPANIRGTVK 

PKRRKKHAVASAAPGQEALVGPSLQPQEAAREADAVAPGYAQGANLVKIGERPWRLVRGPGVR 

AGGPDFLQPSSRESNIRIYSESAPSWLSKDDIRRMRLLADSAVAGLRPVSSRSGARLLVLEGG 

APGAVLRCGPSPCGLLKQPLDMSEVFAFHLDRILGLNRTLPSVSRKAEFIQDGRPCPIILWDA 

SLSSASNDTHSSVKLTWGTYQQLLKQKCWQNGRVPKPESGCTEIHHHEWSKMALFDFLLQIYN 

RLDTNCCGFRPRKEDACVQNGLRPKCDDQGSAALAHIIQRKHDPRHLVFIDNKGFFDRSEDNL 

NFKLLEGIKEFPASAVSVLKSQHLRQKLLQSLFLDKVYWESQGGRQGIEKLIDVIEHRAKILI 
TYINAHGVKVLPMNE 

Transmembrane domain: 

amino acids 40-56 

M-giycosyiation sites. 

amino acids 98-102, 289-293, 322-326 

N-myristoylation sites. 

amino acids 8-14, 41-47, 97-103, 187-193, 251-257, 252-258, 
287-293, 484-490 
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FIGURE 211 

GTGGGGTGGTGAGCGCAGCGCCGAGGATGAGGAGGTGCAACAGCGGCTCCGGGCCGCCGCCGTCGCTGCTGCTGC 

TGCTGCTGTGGCTGCTCGCGGTTCCCGGCGCTAACGCGGCCCCGCGGTCGGCGCTCTATTCGCCTTCCGACCCGC 

TGACGCTGCTGCAGGCGGACACGGTGCGCGGCGCGGTGCTGGGCTCCCGCAGCGCCTGGGCCGTGGAGTTCTTCG 

CCTCCTGGTGCGGCCACTGCATCGCCTTCGCCCCGACGTGGAAGGCGCTGGCCGAAGACGTCAAAGCCTGGAGGC 

CGGCCCTGTATCTCGCCGCCCTGGACTGTGCTGAGGAGACCAACAGTGCAGTCTGCAGAGACTTCAACATCCCTG 

GCTTCCCGACTGTGAGGTTCTTCAAGGCCTTTACCAAGAACGGCTCGGGAGCAGTATTTCCAGTGGCTGGTGCTG 

ACGTGCAGACGCTGCGGGAGAGGCTCATTGACGCCCTGGAGTCCCATCATGACACGTGGCCCCCAGCCTGTCCCC 

CACTGGAGCCTGCCAAGCTGGAGGAGATTGATGGATTCTTTGCGAGAAATAACGAAGAGTACCTGGCTCTGATCT 

TTGAAAAGGGAGGCTCCTACCTGGGTAGAGAGGTGGCTCTGGACCTGTCCCAGCACAAAGGCGTGGCGGTGCGCA 

GGGTGCTGAACACAGAGGCCAATGTGGTGAGAAAGTTTGGTGTCACCGACTTCCCCTCTTGCTACCTGCTGTTCC 

GGAATGGCTCTGTCTCCCGAGTCCCCGTGCTCATGGAATCCAGGTCCTTCTATACCGCTTACCTGCAGAGACTCT 

CTGGGCTCACCAGGGAGGCTGCCCAGACCACAGTTGCACCAACCACTGCTAACAAGATAGCTCCCACTGTTTGGA 

AATTGGCAGATCGCTCCAAGATCTACATGGCTGACCTGGAATCTGCACTGCACTACATCCTGCGGATAGAAGTGG 

GCAGGTTCCCGGTCCTGGAAGGGCAGCGCCTGGTGGCCCTGAAAAAGTTTGTGGCAGTGCTGGCCAAGTATTTCC 

CTGGCCGGCCCTTAGTCCAGAACTTCCTGCACTCCGTGAATGAATGGCTCAAGAGGCAGAAGAGAAATAAAATTC 

CCTACAGTTTCTTTAAAACTGCCCTGGACGACAGGAAAGAGGGTGCCGTTCTTGCCAAGAAGGTGAACTGGATTG 

GCTGCCAGGGGAGTGAGCCGCATTTCCGGGGCTTTCCCTGCTCCCTGTGGGTCCTCTTCCACTTCTTGACTGTGC 

AGGCAGCTCGGCAAAATGTAGACCACTCACAGGAAGCAGCCAAGGCCAAGGAGGTCCTCCCAGCCATCCGAGGCT 

ACGTGCACTACTTCTTCGGCTGCCGAGACTGCGCTAGCCACTTCGAGCAGATGGCTGCTGCCTCCATGCACCGGG 

TGGGGAGTCCCAACGCCGCTGTCCTCTGGCTCTGGTCTAGCCACAACAGGGTCAATGCTCGCCTTGCAGGTGCCC 

CCAGCGAGGACCCCCAGTTCCCCAAGGTGCAGTGGCCACCCCGTGAACTTTGTTCTGCCTGCCACAATGAACGCC 

TGGATGTGCCCGTGTGGGACGTGGAAGCCACCCTCAACTTCCTCAAGGCCCACTTCTCCCCAAGCAACATCATCC 

TGGACTTCCCTGCAGCTGGGTCAGCTGCCCGGAGGGATGTGCAGAATGTGGCAGCCGCCCCA6AGCTGGCGATGG 

GAGCCCTGGAGCTGGAAAGCCGGAATTCAACTCTGGACCCTGGGAAGCCTGAGATGATGAAGTCCCCCACAAACA 

CCACCCCACATGTGCCGGCTGAGGGACCTGAGGCAAGTCGACCCCCGAAGCTGCACCCTGGCCTCAGAGCTGCAC 

CAGGCCAGGAGCCTCCTGAGCACATGGCAGAGCTTCAGAGGAATGAGCAGGAGCAGCCGCTTGGGCAGTGGCACT 

TGAGCAAGCGAGACACAGGGGCTGCATTGCTGGCTGAGTCCAGGGCTGAGAAGAACCGCCTCTGGGGCCCTTTGG 

AGGTCAGGCGCGTGGGCCGCAGCTCCAAGCAGCTGGTCGACATCCCTGAGGGCCAGCTGGAGGCCCGAGCTGGAC 

GGGGCCGAGGCCAGTGGCTGCAGGTGCTGGGAGGGGGCTTCTCTTACCTGGACATCAGCCTCTGTGTGGGGCTCT 

ATTCCCTGTCCTTCATGGGCCTGCTGGCCATGTACACCTACTTCCAGGCCAAGATAAGGGCCCTGAAGGGCCATG 

CTGGCCACCCTGCAGCCT6AACCACCTGGGGAGGAGGCGGGAGAGGGAGCTGCCATCTCTAGGCACCTCAAGCCC 

™I5SI^*^**^'^^^^^''^'^^'^'''*^^^^^'^^<^'^^'5GACAGAGAGCTCCTTTGACACAAAAGACAGGAGCAGGGTCCAGG 
TTCCCCTGCTGTGCAGGGAGGGCAGCCCCGGGCAGTGGGCATAGGGCAGCTCAGTCCCTGGCCTCTTAGCACCAC 
ATTCCTGTTTTTCAGCTTATTTGAAGTCCTGCCTCATTCTCACTGGAGCCTCAGTCTCTCCTGCTTGGTCTTGGC 
CCTCAACTGGGGCAAGTGAAGCCAGAGGAGGGTCCCCCAGCTGGGTGGGCTGGAATGGAACTCCTCACTAGCTGC 
TGGGGCTCCGCCCACCCTGCTCCCTTCCGGACAATGAAGAAGCCTTTGCACCCTGGGAGGAAGGACCACCCCGGG 
CCCTCTATGCCTGGCCAGCCTCCAGCTCCTCAGACCTCCTGGGTGGGGTTTGGCTTCAGGGTGGGGTTTGGAAGC 
TTCTGGAAGTCGTGCTGGTCTCCCAGGTGAGGCAAGCCATGGTTGCTGGGCTGTAGGGTGAGTGGCTTGCTTGGT 
GGGACCTGACGAGTTGGTGGCATGGGAAGGATGTGGGTCTCTAGTGCCTTGCCCTGGCTTAGCTGCAGGAGAAGA 
TGGCTGCTTTCACTTCCCCCCATTGAGCTCTGCTCCCTCTGAGCCTGGTCTTTTGTCCTTTTTTATTTTGGTCTC 
CAAGATGAATGCTCATCTTTGGAGGGTGCCAGGTAGAAGCTAGGGAGGGGAGTGTCTTCTCTCTCCAGGTTTCAC 
CpcCAGTGTGCAGAAGTTAGAAGGGTCTGGCGGGGGCAGTGCCTTACACATGCTTGATTCCCACGCTACCCCCT 
GCCTTGGGAGGTGTGTGGAATAAATTATTTTTGTTAAGGCA 
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FIGURE 212 

MRRCNSGSGPPPSLLLLLLWLLAVPGANAAPRSALYSPSDPLTLLQADTVRGAVLGSRSAWAV 
EFFASWCGHCIAFAPTWKALAEDVKAWRPALYLAALDCAEETNSAVCRDFNIPGFPTVRFFKA 
FTKNGSGAVFPVAGADVQTLRERLIDALESHHDTWPPACPPLEPAKLEEIDGFFARNNEEYLA 
LI FEKGGS YLGREVALDLSQHKGVAVRRVLNTEANWRKFGVTDFPSCYLLFRNGSVSRVPVL 
MESRSFYTAYLQRLSGLTRE/yVQTTVAPTTANKIAPTVWKLADRSKIYMADLESALHYILRIE 
VGRFPVLEGQRLVALKKFVAVLAKYFPGRPLVQNFLHSVNEWLKRQKRNKIPYSFFKTALDDR 
KEGAVIiAKKVNWIGCQGSEPHFRGFPCSLWVLFHFLTVQAARQNVDHSQEAAKAKEVLPAIRG 
YVHYFFGCRDCASHFEQMATyVSMHRVGSPNAAVLWLWSSHNRVNARLAGAPSEDPQFPKVQWP 
PRELCSACHNERLDVPVWDVEATLNFLKAHFSPSNIILDFPAAGSAARRDVQNVAAAPELAMG 
ALELESRNSTLDPGKPEMMKSPTNTTPHVPAEGPEASRPPKLHPGLRAAPGQEPPEHMAELQR 
NEQEQPLGQWHLSKRDTGAALLAESRAEKNRLWGPLEVRRVGRSSKQLVDIPEGQLEARAGRG 
RGQWLQVLGGGFSYLDISLCVGLYSLSFMGLLAMYTYFQAKIRALKGHAGHPAA 

iSignal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 705-728 

N-glycosylation sites. 

amino acids 130-134, 243-247, 575-579 

GXycosaminoglycan attachment site. 

amino acids 6-10 

CAMP- and cOlP-dependent protein kinase phosphorylation site« 
amino acids 644-648 
N-myristoylation sites. 

amino acids 52-58, 56-62, 196-202, 381-387, 392-398, 448-454, 
468-474, 684-690, 702-708 

Cytochrome c family hesie-binding site signature, 
amino acids 509-515 
Thioredoxin family proteins 

amino acids 62-78 
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FIGURE 213 

GCACGAGGCCGACTTCCAGACCATCTACAACTGCACGGCCTGGAACAGCTTCGGCTCCGACAC 

TGAGATCATCCGGCTCAAGGAGCAAGGTTCGGAAATGAAGTCGGGAGCCGGGCTGGAAGCAGA 

GTCTGTGCCGATGGCCGTCATCATTGGGGTGGCCGTAGGAGCTGGTGTGGCCTTCCTCGTCCT 

TATGGCAACCATCGTGGCGTTCTGCTGTGCCCGTTCCCAGAGAAATCTCAAAGGTGTTGTGTC 

AGCCAAAAATGATATCCGAGTGGAAATTGTCCACAAGGAACCAGCCTCTGGTCGGGAGGGTGA 

GGAGCACTCCACCATCAAGCAGCTGATGATGGACCGGGGTGAATTCCAGCAAGACTCAGTCCT 

GAAACAGCTGGAGGTCCTCAAAGAAGAGGAGAAAGAGTTTCAGAACCTGAAGGACCCCACCAA 

TGGCTACTACAGCGTCAACACCTTCAAAGAGCACCACTCAACCCCGACCATCTCCCTCTCCAG 

CTGCCAGCCCGACCTGCGTCCTGCGGGTAAGCAGCGTGTGCCCACAGGCATGTCCTTCACCAA 

CATCTACAGCACCCTGAGCGGCCAGGGCCGCCTCTACGACTACGGGCAGCGGTTTGTGCTGGG 

CATGGGCAGCTCGTCCATCGAGCTTTGTGAGCGGGAGTTCCAGAGAGGCTCCCTCAGCGACAG 

CAGCTCCTTCCTGGACACGCAGTGTGACAGCAGCGTCAGCAGCAGCGGCAAGCAGGATGGCTA 

TGTGCAGTTCGACAAGGCCAGCAAGGCTTCTGCTTCCTCCTCCCACCACTCCCAGTCCTCGTC 

CCAGAACTCTGACCCCAGTCGACCCCTGCAGCGGCGGATGCAGACTCACGTCSAaGGATCACA 

CACCGCGGGTGGGGACGGGCCAGGGAAGAGGTCAGGGCACGTTCTGGTTGTCCAGGGACGAGG 

GGTACTTTGCAGAGGACACCAGAATTGGCCACTTCCAGGACAGCCTCCCAGCGCCTCTGCCAC 

TGCCTTCCTTCGAAGCTCTGATCAAGCACAAATCTGGGTCCCCAGGTGCTGTGTGCCAGAGGT 

GGGCGGGTGGGGAGACAGACAGAGGCTGCGGCTGAGTGCGCTGTGCTTAGTGCTGGACACCCG 

TGTCCCCGGCCCTTTCCTGGAGGCCCCTCTACCACCTGCTCTGCCCACAGGCACAAGTGGCAG 

CTATAACTCTGCTTTCATGAAACTGCGGTCCACTCTCTGGTCTCTCTGTGGGCTCTACCCCTC 

ACTGACCACAAGCTCTACCTACCCCTGTGCCTGTGCTCCCATACAGCCCTGGGGAGAAGGGGA 

TGACGTCTTCCCAGCACTGAGCTGCCCCAGAAACCCCGGCTCCCCACTGCTGCTCATAGCCCA 

TACCCTGGAGGCTGACAAGCCAGAAATGGCCTTGGCTAAAGGAGCCTCTCTCTCACCAGGCTG 

GCCGGGAGCCCACCCCCAATTTGTTTGGTGTTTTGTGTCCATACTCTTGCAGTTCTGTCCTTG 

GACTTGATGCCGCTGAACTCTGCGGTGGGACCGGTCCCGTCAGAGCCTGGTGTACTGGGGGGA 

GGGAGGGAGGAGGGAGCCTGTGCTGACGGAGCACCTCGCCGGGTGTGCCCCTCCTGGGCTGTG 

TGACCCCAGCCTCCCCACCCACCTCCTGCTTTGTGTACTCCTCCCCTCCCCCTCAGCACAATC 

GGAGTTCATATAAGAAGTGCGGGAGCTTCTCTGGTCAGGGTTCTCTGAACACTTATGGAGAGA 

GTGCTTCCTGGGAAGTGTGGCGTTTGAAGGGGCTGGAGGGCAGGTCTTTAAGATGGCGAGACT 

GCCCTTCTCAGCTGATAAACACAAGAACGGCGATCCTGTCTTCAGTAAGGCTCCACGAGAAGA 

GAGGAAGTATATCTACACCTCAACCCTCCTAGTCACCACCTGAAATAAATGTTAGGGAAAAAAAA 
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FIGURE 214 

MAVIIGVAVGAGVAFLVLMATIVAFCCARSQRNLKGVVSAKNDIRVEIVHKEPASGREGEEHS 
TIKQLMMDRGEFQQDSVLKQLEVLKEEEKEFQNLKDPTNGYYSVNTFKEHHSTPTISLSSCQP 
DLRPAGKQRVPTGMSFTNIYSTLSGQGRLYDYGQRFVLGMGSSSIELCEREFQRGSLSDSSSF 
LDTQCDSSVSSSGKQDGYVQFDKASKASASSSHHSQSSSQNSDPSRPLQRRMQTHV 

Signal peptide: 

amino acids 1-28 

Glycosaminoglycan attachment site. 

amino acids 150-154 

N-myristoylation sites. 

amino acids 6-12, 10-16, 36-42, 139-145, 165-171 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 114-125 
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FIGURE 215 



CAGCCTTCCTCCCCCAGCCTGAGTGACTACTCTATTCCTTGGTCCCTGCTATTGTCGGGGACG 

ATTGCMSGGCTACGCCAGGAAAGTAGGCTGGGTGACCGCAGGCCTGGTGATTGGGGCTGGCG 

CCTGCTATTGCATTTATAGACTGACTAGGGGAAGAAAACAGAACAAGGAAAAAATGGCTGAGG 

GTGGATCTGGGGATGTGGATGATGCTGGGGACTGTTCTGGGGCCAGGTATAATGACTGGTCTG 

ATGATGATGATGACAGCAATGAGAGCAAGAGTATAGTATGGTACCCACCTTGGGCTCGGATTG 

GGACTGAAGCTGGAACCAGAGCTAGGGCCAGGGCAAGGGCCAGGGCTACCCGGGCACGTCGGG 

CTGTCCAGAAACGGGCTTCCCCCAATTCAGATGATACCGTTTTGTCCCCTCAAGAGCTACAAA 

AGGTTCTTTGCTTGGTTGAGATGTCTGAAAAGCCTTATATTCTTGAAGCAGCTTTAATTGCTC 

TGGGTAACAATGCTGCTTATGCATTTAACAGAGATATTATTCGTGATCTGGGTGGTCTCCCAA 

TTGTCGCAAAGATTCTCAATACTCGGGATCCCATAGTTAAGGAAAAGGCTTTAATTGTCCTGA 

ATAACTTGAGTGTGAATGCTGAAAATCAGCGCAGGCTTAAAGTATACATGAATCAAGTGTGTG 

ATGACACAATCACTTCTCGCTTGAACTCATCTGTGCAGCTTGCTGGACTGAGATTGCTTACAA 

ATATGACTGTTACTAATGAGTATCAGCACATGCTTGCTAATTCCATTTCTGACTTTTTTCGTT 

TATTTTCAGCGGGAAATGAAGAAACCAAACTTCAGGTTCTGAAACTCCTTTTGAATTTGGCTG 

AAAATCCAGCCATGACTAGGGAACTGCTCAGGGCCCAAGTACCATCTTCACTGGGCTCCCTCT 

TTAATAAGAAGGAGAACAAAGAAGTTATTCTTAAACTTCTGGTCATATTTGAGAACATAAATG 

ATAATTTCAAATGGGAAGAAAATGAACCTACTCAGAATCAATTCGGTGAAGGTTCACTTTTTT 

TCTTTTTAAAAGAATTTCAAGTGTGTGCTGATAAGGTTCTGGGAATAGAAAGTCACCATGATT 

TTTTGGTGAAAGTAAAAGTTGGAAAATTCATGGCCAAACTTGCTGAACATATGTTCCCAAAGA 

GCCAGGAATAACACCTTGATTTTGTAATTTAGAAGCAACACACATTGTAAACTATTCATTTTC 

TCCACCTTGTTTATATGGTAAAGGAATCCTTTCAGCTGCCAGTTTTGAATAATGAATATCATA 

TTGTATCATCAATGCTGATATTTAACTGAGTTGGTCTTTAGGTTTAAGATGGATAAATGAATA 

TCACTACTTGTTCTGAAAACATGTTTGTTGCTTTTTATCTCGCTGCCTAGATTGAAATATTTT 

GCTATTTCTTCTGCATAAGTGACAGTGAACCAATTCATCATGAGTAAGCTCCCTTCTGTCATT 

TTCATTGATTTAATTTGTGTATCATCAATAAAATTGTATGTTAATGCTGGAAAGA 
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FIGURE 216 

MGYARKVGWVTAGLVIGAGACYCIYRLTRGRKQNKEKMAEGGSGDVDDAGDCSGARYNDWSDD 
DDDSNESKSIVWYPPWARIGTEAGTRARARARARATRARRAVQKRASPNSDDTVLSPQELQKV 
LCLVEMSEKPYILEAALIALGNNAAYAFNRDIIRDLGGLPIVAKILNTRDPIVKEKALIVLNN 
LSVNAENQRRLKVYMNQVCDDTITSRLNSSVQLAGLRLLTNMTVTNEYQHMLANSiSDFFRLF 
SAGNEETKLQVLKLLLNLAENPAMTRELLRAQVPSSLGSLFNKKENKEVILKLLVIFENINDN 
FKWEENEPTQNQFGEGSLFFFLKEFQVCADKVLGIESHHDFLVKVKVGKFMAKLAEHMFPKSQE 

Signal peptide: 

amino acids 1-20 

N-glycosylation sites. 

amino acids 68-72, 189-193, 217-221, 230-234 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 107-111 

N-xnyristoylation sites. 

amino acids 13-19, 17-23, 19-25, 54-60, 83-89, 147-153, 255-261, 
290-296 



.Amidation site. 

amino acids 29-33 
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FIGURE 217 



GAGACACAAAGGCAGGCGGGATGCGGGAGCAGGCAAAGGGAAAGCGAAAGCCGCGCGCCCGGC 

CGGTGACTGGGTGAAGGCGCCGCGCAGCTTTCCCGACGCCGGCTGTACCCGGACCTCCTGGTC 

GAGCCTGGCGCGCCGCAGCCATGGCCATCGCTCAACTGGCCACGGAGTACGTGTTCTCGGATT 

TCTTGCTGAAGGAGCCCACGGAGCCCAAGTTCAAGGGGCTGCGACTGGAGCTGGCTGTGGACA 

AGATGGTCACGTGCATTGCGGTGGGGCTGCCCCTGCTGCTCATCTCGCTGGCCTTCGCGCAGG 

AGATCTCGATTGGTACACAGATAAGCTGTTTCTCTCCAAGTTCTTTCTCCTGGCGTCAGGCTG 

CCTTTGTGGATTCATATTGCTGGGCGGCTGTTCAGCAGAAGAACTCACTGCAGAGCGAGTCTG 

GAAACCTCCCACTGTGGCTGCATAAGTTTTTCCCCTACATCCTGCTGCTCTTTGCGATCCTCC 

TGTACCTGCCCCCGCTGTTCTGGCGTTTCGCAGCTGCTCCTCATATTTGCTCAGACTTGAAGT 

TTATCATGGAAGAACTTGACAAAGTTTACAACCGTGCAATTAAGGCTGCAAAGAGTGCGCGTG 

ACCTTGACATGAGAGATGGAGCCTGCTCAGTTCCAGGTGTTACCGAGAACTTAGGGCAAAGTT 

TGTGGGAGGTATCTGAAAGCCACTTCAAGTACCCAATTGTGGAGCAGTACTTGAAGACAAAGA 

AAAATTCTAATAATTTAATCATCAAGTACATTAGCTGCCGCCTGCTGACACTCATCATTATAC 

TGTTAGCGTGTATCTACCTGGGCTATTACTTCAGCCTCTCCTCACTCTCAGACGAGTTTGTGT 

GCAGCATCAAATCAGGGATCCTGAGAAACGACAGCACCGTGCCCGATCAGTTTCAGTGCAAAC 

TCATTGCCGTGGGCATCTTCCAGTTGCTCAGTGTCATTAACCTTGTGGTTTATGTCCTGCTGG 

CTCCCGTGGTTGTCTACACGCTGTTTGTTCCATTCCGACAGAAGACAGATGTTCTCAAAGTGT 

ACGAAATCCTCCCCACTTTTGATGTTCTGCATTTCAAATCTGAAGGGTACAACGATTTGAGCC 

TCTACAATCTCTTCTTGGAGGAAAATATAAGTGAGGTCAAGTCATACAAGTGTCTTAAGGTAC 

TGGAGAATATTAAGAGCAGTGGTCAGGGGATCGACCCAATGCTACTCCTGACAAACCTTGGCA 

TGATCAAGATGGATGTTGTTGATGGCAAAACTCCCATGTCTGCAGAGATGAGAGAGGAGCAGG 

GGAACCAGACGGCAGAGCTCCAAGGTATGAACATAGACAGTGAAACTAAAGCAAATAATGGAG 

AGAAGAATGCCCGACAGAGACTTCTGGATTCTTCTTGCTGATGATTTTTTTTCCTTGAGCTGT 

AAATCTGTGACTTCTGCGACATGGGATTTAATTTGGCTAAAGCACCCCTGTTGGTTTCACAGC 

TGGTTTGCAATAAATGGTTCTTGGTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 218 



MAIAQLATEYVFSDFLLKEPTEPKFKGLRLELAVDKMVTCIAVGLPLLLISLAFAQEISIGTQ 
ISCFSPSSFSWRQAAFVDSYCWAAVQQKNSLQSESGNLPLWLHKFFPYILLLFAILLYLPPLF 
WRFAAAPHICSDLKFIMEELDKVYNRAIK7\AKSARDLDMRDGACSVPGVTENLGQSLWEVSES 
HFKYPIVEQYLKTKKNSNNLIIKYISCRLLTLIIILLACIYLGYYFSLSSLSDEFVCSIKSGI 
LRNDSTVPDQFQCKLIAVGIFQLLSVINLVVYVLLAPVVVYTLFVPFRQKTDVLKVYEILPTF 
DVLHFKSEGYNDLSLYNLFLEENISEVKSYKCLKVLENIKSSGQGIDPMLLLTNLGMIKMDVV 
DGKTPMSAEMREEQGNQTAELQGMNIDSETKANNGEKNARQRLLDSSC 

Transmembrane domains: 

amino acids 37-55, 108-126, 216-232, 273-290 

N-glycosylation sites. 

amino acids 255-259, 338-342, 394-398 

Glycosaminoglycan attachment site. 

amino acids 357-361 

cl^MP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 203-207 

N-n^ristoylation sites. 

amino acids 61-67, 174-180, 251-257, 393-399 

Prokarybtic membrane lipoprotein lipid attachment site. 

amino acids 218-229 
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FIGURE 21Q 

CTGTGAGTGACACACGCTGAGTGGGGTGAAGGGAAATgCTGGTGAATTTCATTTTGAGGTGTG 

GGTTGCTGTTAGTCACTCTGTCTCTTGCCATTGCCAAGCACAAGCAATCTTCCTTCACCAAAA 

GTTGTTACCCAAGGGGAACATTGTCCCAAGCTGTTGACGCTCTCTATATCAAAGCAGCATGGC 

TCAAAGCAACGATTCCAGAAGACCGCATAAAAAATATACGATTATTAAAAAAGAAAACAAAAA 

AGCAGTTTATGAAAAACTGTCAATTTCAAGAACAGCTTCTGTCCTTCTTCATGGAAGACGTTT 

TTGGTCAACTGCAATTGCAAGGCTGCAAGAAAATACGCTTTGTGGAGGACTTTCATAGCCTTA 

GGCAGAAATTGAGCCACTGTATTTCCTGTGCTTCATCAGCTAGAGAGATGAAATCCATTACCA 

GGATGAAAAGAATATTTTATAGGATTGGAAACAAAGGAATCTACAAAGCCATCAGTGAACTGG 

ATATTCTTCTTTCCTGGATTAAAAAATTATTGGAAAGCAGTCAG^AACCAAAGCCAAGTACA 

TTGATTTTACAGTTATTTTGAAATACAATAAGAACTGCTAGAAATATGTTTATAACAGTCTAT 

TTCTTTTAAAAACTTTTTAACATAATACTGACGGCATGTTAGGTGATTCAGAATAGACAAGAA 

GGATTTAGTAAATTAACGTTTTGGATATAAGTTGTCACTAATTTGCACATTTTCTGTGTTTTC 

AAATAATGTTTCCATTCTGAACATGTTTTGTCATTCACAAGTACATTGTGTCAACTTAATTTA 

AAGTATGTAACCTGAATTAACTCGTGTAATATTTGTGTGTGGAGTGGGATGTGGGGGGTGGAG 

GGGGAATGACAGATTTCTGGAATGCAATGTAATGTTACTGAGACTTAAATAGATGTTATGTAT 

ATGATTGTCTGTTTAAGTGTTTGAAAATTGTTAATTATGCCCAGTGTGAACTTAGTACTTAAC 

ACATTTTGATTTTAATTAAATAAATTGGGTTTCCTTCTCAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAA 
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FIGURE 220 



MLVNFILRCGLLLVTLSLAIAKHKQSSFTKSCYPRGTLSQAVDALYIKAAWLKATIPEDRIKN 
IRLLKKKTKKQFMKNCQFQEQLLSFFMEDVFGQLQLQGCKKIRFVEDFHSLRQKLSHCISCAS 
SAREMKSITRMKRIFYRIGNKGIYKAISELDILLSWIKKLLESSQ 

Signal sequence: 

amino acids 1-21 

cAMP- and cOdP-dependent protein kinase phosphorylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 148-153 

Interleukin-10 proteins. 

amino acids 58-94, 74-102, 128-170 
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FIGURE 221 

GACCACGGCCCTGCGCCCCAGCCAGGCCTGAGGACATCAGGCGGCCGGCGGCGGTGCCGCTCC 

TGCTGCTGCTGTGTTTTGGGTCTCAGAGGGCCAAGGCAGCAACAGCCTGTGGTCGCCCCAGGA 

TGCTGAACCGAATGGTGGGCGGGCAGGACACGCAGGAGGGCGAGTGGCCCTGGCAAGTCAGCA 

TCCAGCGCAACGGAAGCCACTTCTGCGGGGGCAGCCTCATCGCGGAGCAGTGGGTCCTGACGG 

CTGCGCACTGCTTCCGCAACACCTCTGAGACGTCCCTGTACCAGGTCCTGCTGGGGGCAAGGC 

AGCTAGTGCAGCCGGGACCACACGCTATGTATGCCCGGGTGAGGCAGGTGGAGAGCAACCCCC 

TGTACCAGGGCACGGCCTCCAGCGCTGACGTGGCCCTGGTGGAGCTGGAGGCACCAGTGCCCT 

TCACCAATTACATCCTCCCCGTGTGCCTGCCTGACCCCTCGGTGATCTTTGAGACGGGCATGA 

ACTGCTGGGTCACTGGCTGGGGCAGCCCCAGTGAGGAAGACCTCCTGCCCGAACCGCGGATCC 

TGCAGAAACTCGCTGTGCCCATCATCGACACACCCAAGTGCAACCTGCTCTACAGCAAAGACA 

CCGAGTTTGGCTACCAACCCAAAACCATCAAGAATGACATGCTGTGCGCCGGCTTCGAGGAGG 

GCAAGAAGGATGCCTGCAAGGGCGACTCGGGCGGCCCCCTGGTGTGCCTCGTGGGTCAGTCGT 

GGCTGCAGGCGGGGGTGATCAGCTGGGGTGAGGGCTGTGCCCGCCAGAACCGCCCAGGTGTCT 

ACATCCGTGTCACCGCCCACCACAACTGGATCCATCGGATCATCCCCAAACTGCAGTTCCAGC 

CAGCGAGGTTGGGCGGCCAGAAGTSAGACCCCCGGGGCCAGGAGCCCCTTGAGCAGAGCTCTG 

CACCCAGCCTGCCCGCCCACACCATCCTGCTGGTCCTCCCAGCGCTGCTGTTGCACCTGTGAG 

CCCCACCAGACTCATTTGTAAATAGCGCTCCTTCCTCCCCTCTCAAATACCCTTATTTTATTT 

ATGTTTCTCCCAATAAAAACCCAGCCTGTGTGCCAGCTGAAAAAAAAAAAAAAAAAAA 
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FIGURE 222 

MRRPAAVPLLLLLCFGSQRAKAATACGRPRMLNRMVGGQDTQEGEWPWQVSIQRNGSHFQGGS 
LIAEQWVLTAAHCFRNTSETSLYQVLLGARQLVQPGPHAMYARVRQVESNPLYQGTASSADVA 
LVELEAPVPFTNYILPVCLPDPSVIFETGMNCWVTGWGSPSEEDLLPEPRILQKLAVPIIDTP 
KCNLLYSKDTEFGYQPKTIKNDMLCAGFEEGKKDACKGDSGGPLVCLVGQSWLQAGVISWGEG 
CARQNRPGVYIRVTAHHNWIHRI I PKLQFQPARLGGQK 

In^ortant features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 55-58, 7 9-82 

Casein kinase II phosphorylation sites. 

amino acids 121-124, 165-168, 167-170, 248-251 

Tyrosine kinase phosphorylation sites « 
amino acids 78-86, 197-203 

N-myristoylation sites. 

amino acids 16-21, 37-42, 56-61, 62-67, 118-123 

Amidation site. 

amino acids 219-222 



Serine proteases, trypsin family, histidine active site. 

amino acids 71-76 



wo 01/40466 

PCTAJSOO/32678 



223/550 

FIGURE 27:^ 

CAAGAT6TGGACAGCTCTTGTGCTCATTTGGATTTTCTCCTTGTCCTTATCTGAAAGCCATGC 

GGCATCCAACGATCCACGCAACTTTGTCCCTAACAAAATGTGGAAGGGATTAGTCAAGAGGAA 

TGCATCTGTGGAAACAGTTGATAATAAAACGTCTGAGGATGTAACCATGGCAGCAGCTTCTCC 

TGTCACATTGACCAAAGGGACTTCGGCAGCCCACCTCAACTCTATGGAAGTCACAACAGAGGA 

CACAAGCAGGACAGATGTGAGTGAACCAGCAACTTCAGGAGTTGCAGCTGATGGTGTGACCTC 

CATTGCTCCCACGGCTGTGGCCTCCAGTACGACTGCGGCCTCCATTACGACTGCGGCCTCCAG 

TATGACTGTGGCCTCCAGTGCTCCCACGACTGCAGCCTCCAGTACAACTGTGGCCTCCATTGC 

TCCCACGACTGCAGCCTCCAGTATGACTGCGGCCTCCAGCACTCCCATGACACTTGCACTCCC 

CGCGCCCACGTCCACTTCCACAGGGCGGACCCCGTCCACTACCGCCACTGGGCATCCATCTCT 

CAGCACAGCCCTCGCACAAGTGCCAAAGAGCAGCGCGTTGCCAAGAACAGCAACCCTGGCCAC 

ATTGGCCACACGTGCTCAGACTGTAGCGACCACAGCAAACACAAGCAGCCCCATGAGCACTCG 

TCCAAGTCCTTCCAAGCACATGCCCAGTGACACCGCGGCAAGCCCTGTACCCCCTATGCGTCC 

CCAAGCACAAGGTCCCATTAGCCAGGTGTCAGTGGACCAGCCTGTGGTTAACACAACAAATAA 

ATCCACACCCATGCCCTCAAACACAACCCCAGAGCCCGCCCCCACCCCCACAGTGGTGACCAC 

CACCAAGGCACAAGCCAGGGAGCCAACTGCCAGCCCAGTGCCAGTACCTCACACCAGCCCAAT 

CCCTGAGATGGAGGCCATGTCCCCCACGACACAGCCAAGCCCCATGCCATATACCCAGAGGGC 

CGCTGGGCCAGGCACATCCCAGGCACCGGAGCAGGTAGAGACTGAAGCCACACCAGGTACTGA 

TTCCACTGGGCCAACACCCAGGAGCTCAGGGGGCACTAAGATGCCAGCCACGGACTCGTGCCA 

GCCCAGCACCCAAGGCCAGTACATGGTGGTCACCACTGAGCCCCTCACCCAGGCCGTGGTAGA 

CAAAACTCTCCTTCTGGTGGTGCTGTTACTCGGGGTGACCCTTTTCATCACAGTCTTGGTTTT 

GTTTGCCCTGCAGGCCTATGAGAGCTACAAGAAGAAGGACTACACCCAGGTGGACTACTTAAT 

CAACGGGATGTATGCGGACTCAGAAATGT^GGGGGGCGGGGGCCTGGCGGGAGGCCTGGCCC 

CTTCCTCGTCCTTTCCTTTTGCCTTTGAGACCAAACCAAGTGCTTCCAAATTCTTTTGGTGCA 

ATTGAGGAGATATGCCAGATGCTTAAACACATTTAATTGCTGTCAGATTAATTCCATGATCAC 

T^GAGTTGCTGCTTTTTTCATATTTATTTTTGTAAATGATTCTGTGCCCAGGAGCAGCTGG 

GGGTTCCACCTCAGGGTGGGGCGGGCAGGACCCCGTCTCCCCAGGTGTCGGAGCCTGACCTGA 
ATTAAAGTACTGACTGCTCGCCA 
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FIGURE 224 

MWTALVLIWIFSLSLSESHAASNDPRNFVPNKMWKGLVKRNASVETVDNKTSEDVTMAAASPV 
TLTKGTSAAHLNSMEVTTEDTSRTDVSEPATSGVAADGVTSIAPTAVASSTTAASITTAASSM 
TVASSAPTTAASSTTVASIAPTT7VASSMTAASSTPMTLALPAPTSTSTGRTPSTTATGHPSLS 
TALAQVPKSSALPRTATLATLATRAQTVATTANTSSPMSTRPSPSKHMPSDTAASPVPPMRPQ 
AQGPISQVSVDQPVVNTTNKSTPMPSNTTPEPAPTPTVVTTTKAQAREPTASPVPVPHTSPIP 
EMEAMSPTTQPSPMPYTQRAAGPGTSQAPEQVETEATPGTDSTGPTPRSSGGTKMPATDSCQP 
STQGQYMWTTEPLTQAWDKTLLLWLLLGVTLFITVLVLFALQAYESYKKKDYTQVDYLIN 
GMYADSEM 

Signal peptide: 

amino acids 1-20 

Transmembrcme domain: 

amino acids 396-420 

N-glycosylatxon sites. 

amino acids 41-44, 49-52, 222-225, 268-271, 271-274 
Casein kinase II phosphorylation sites. 

amino acids 14-17,. 51-54, 80-83, 85-88, 280-283, 434-437 

N-myristoylation sites. 

amino acids 68-73, 354-359 

Aldo/keto reductase family putative active site signature. 

amino acids 195-210 
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FIGURE 225 

GGAAGGCGCTCAAGGTGCGCGGCCCGGGGCGCGCTACTGGGGGCGCCCTCCGCGGTGGGCAGC 

GCGCCAGGGATCGGCCTGGGCAGCCGCGGGGCGCGCGAAGGCTGCGCTTTCCCTACGGCCCCC 

CTCGCTTCCTCCGGCACGGCGGCAACGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTT 

CGGGGATCCTCGCCCCGCCCCAGTTCTCCGCCCCCTCCCCTTTGCTGGGGCGCCTGGGCTGGC 

CCGCGCAGGGGAGGAGGCTCTGGCAGCCTGGGCAGGGAGGCGGCGGGGGGCCGCGGAGCCGCT 

GGCCATCGATTCTCCCCGCCATGTGACGCCGTCCTTAGCCCTGCGACCCCCAGCGCGTCCCGG 

GCCTGCGCCTCCGCCCCGCCGCGCAGCGCACGAT6CTTCTGCCGGGACGCGCACGCCAACCGC 

CGACGCCCCAGCCCGTGCAGCATCCCGGCCTCCGCCGGCAGGTAGAGCCGCCGGGGCAGCTCC 

TGCGCCTCTTCTACTGCACT6TCCTGGTCTGCTCCAAAGAGATCTCAGCGCTCACCGACTTCT 

CfGGTTACCTAACCAAACTCCTGCAAAACCACACCACCTATGCCTGTGATGGGGACTATTTGA 

ATCTACAGTGCCCTCGGCATTCTACGATAAGTGTCCAATCGGCATTTTATGGGCAAGATTACC 

AAATGTGTAGTTCCCAGAAGCCTGCCTCCCAGAGGGAAGACAGCTTAACCTGTGTGGCAGCCA 

CCACCTTCCAGAAGGTGCTGGACGAATGCCAGAACCAGCGGGCCTGCCACCTCCTGGTCAATA 

GCCGTGTTTTTGGACCTGACCTTTGTCCAGGAAGCAGTAAATACCTCCTGGTCTCCTTTAAAT 

GCCAACCTAATGAATTAAAAAACAAAACCGTGTGTGAAGACCAGGAGCTGAAACTGCACTGCC 

ATGAATCCAAGTTCCTCAACATCTACTCTGCGACCTACGGCAGGAGGACCCAGGAAAGGGACA 

TCTGCTCCTCCAAGGCAGAGCGGCTCCCCCCTTTCGATTGCTTGTCTTACTCAGCTTTGCAAG 

TCCTATCCCGAAGGTGCTATGGGAAGCAGAGATGCAAAATCATCGTCAACAATCACCATTTTG 

GAAGCCCCTGTTTGCCAGGCGTGAAAAAATACCTCACTGTGACCTACGCATGTGTTCCCAAGA 

ACATACTCACAGCGATTGATCCAGCCATTGCTAATCTAAAACCTTCTTTGAAGCAGAAAGATG 

GTGAATATGGTATAAACTTCGACCCAAGCGGATCGAAGGTTCTGAGGAAAGATGGAATTCTTG 

TTAGCAACTCTCTGGCAGCCTTTGCTTACATTAGAGCCCACCCAGAGAGAGCTGCCCTGCTGT 

TCGTGTCCAGTGTCTGCATCGGCCTGGCCCTCACACTGTGCGCCCTGGTCATCAGAGAGTCCT . 

GTGCCAAGGACTTCCGCGACTTGCAGCTGGGGAGGGAGCAGCTGGTGCCAGGAAGTGACAAGG 

TCGAGGAGGACAGCGAGGATGAAGAAGAGGAGGAGGACCCCTCTGAGTCTGATTTCCCAGGGG 

AACTGTCGGGGTTCTGTAGGACTTCATATCCTATATACAGTTCCATAGAAGCTGCAGAGCTCG 

CAGAAAGGATTGAGCGCAGGGAGCAAATCATTCAGGAAATATGGATGAACAGTGGTTTGGACA 

CCTCGCTCCCAAGAAACATGGGCCAGTTCTACTGAAAACCACATGCATCTTGATGCGATCGCA 

CTTTCTGAAGAAGGAAGGATCCCAAATGCCCCTCCAGTTCTGGTTCACCTGTACCTTCTATGA 

AGGAGAATTCGTCATGTCATTCAACACTCGTGAGGCCAGGAAGCTATTAAAGGGATGTTTCAA 

GCTGTTTCTAGCACATTCCAAAATAAATGAGGAGGGAGGAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAA/^AAAAAAAAA 
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FIGURE 226 

MLLPGRARQPPTPQPVQHPGLRRQVEPPGQLLRLFYCTVLVCSKEISALTDFSGYLTKLLQNH 
TTYACDGDYLNLQCPRHSTISVQSAFYGQDYQMCSSQKPASQREDSLTCVAATTFQKVLDECQ 
NQRACHLLVNSRVFGPDLCPGSSKYLLVSFKCQPNELKNKTVCEDQELKLHCHESKFLNIYSA 
TYGRRTQERDICSSKAERLPPFDCLSYSALQVLSRRCYGKQRCKIIVNNHHFGSPCLPGVKKY 
LTVTYACVPKNILTAIDPAIANLKPSLKQKDGEYGINFDPSGSKVLRKDGILVSNSLAAFAYI 
RAHPERAALLFVSSVCIGLALTLCALVIRESCAKDFRDLQLGREQLVPGSDKVEEDSEDEEEE 
EDPSESDFPGELSGFCRTSYPIYSSIEAAELAERIERREQIIQEIWMNSGLDTSLPRNMGQFY 

Transmembrane domains: 

amino acids 32-49, 322-343 

N-glycosylation sites. 

amino acids 62-66, 165-169 

Tyrosine kinase phosphorylation site, 
amino acids 280-287 

N-inyristoylation site. 

amino acids 302-308, 333-339, 428-434 



Amidation site. 

amino acids 191-195 




# 
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FIGURE 227 



GGCACGAGGTGGAAGGGCTTTTACAAACAGATTGCTGGCCCCACCCCCCAGAATTTCTCATCA 
GGAGTGGGCAAGACCAATCATTTGCATTTCTGACAAGTTCCCAGGAGCTGCAGCTGCTGGCCC 
TGGAACCACACTTTGAGAACCACTGCTTTAGACCAAACACCAAAGGAAGATGCAGCCACCCTC 
CTTTACATGTCACAACGCTCAGGGTCCATGAGTACCTCAGGCTGTCCAGCTGAGCTCCACCTG 
CAGCAGCCGAGATTCCCGACTCGCTCCACCATTGGGGGCTAGGAGTGAAGCGTGTCACCATGG 
TCAGCTCATGGCCAGCCAGGAAAGCCTCTCTGCTGTGCGTCTGTGCAGTTCTTGTTCTTCCCT 
GGAGGACTCTTGGATCGCCTGTGATCTTGGCCAGGAGACCAGGTGCCTGGGTCCCTTCCTGGA 
AGGGGACAAGTTACACACCCCAGCCCCATTTTCCCACCAACTTCTACATGCCTTGGGAGAACC 
TTCTACATGTTGGCTGCCCCCTTCCCCTATTTCAGCAGTGCCCAGTCCTGCTTATAAACCTGA 
GGCCTGCTCCCCATACCTTCCCTGTGCAAGTGCCAGCCGTTATTCCAGGCAGCCCAATGTTGT 
TGAGGCCAGATGGATTCCTGGAAGCAGCTGGCCCATGGATGTGAGTCATCACAGTATTCTAGA 
AACAGAGAAGAGGTCTTAACCTAATGCGCATAGAGAAATTGTTCTCATTGTAAACATACCCCT 
GTCCTTAGCTGATCTAGGTGGAAGCCCAGCTTCATGTGCTAGGGGGCATGATAATGATAATAA 
AGGAATTGTATCTAGGACTAA 
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FIGURE 228 

MVSSWPARKASLLCVCAVLVLPWRTLGSPVILARRPGAWVPSWKGTSYTPQPHFPTNFYMPWE 
NLLHVGCPLPLFQQCPVLLINLRPAPHTFPVQVPAVIPGSPMLLRPDGFLEAAGPWM 

Signal peptide: 

amino acids 1-27 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8-12 
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FIGURE 79 Q 

GGGAAGGGATGCAAGGAAGCCCTCCGGCGCTGCGCTCCGAGGCGGGAGACAGCGTCCCGCTGA 

AAATGTGTGTCTGACATGCAAGCTCAGTGGGGCAGAGACCCGTGGATTGCTGTGCCCTGCCCT 

CCGGACCTGGATCAT6AAGGTGTTGGGAAGAAGCTTCTTCTGGGTGCTGTTTCCCGTCCTTCC 

CTGGGCGGTGCAGGCTGTGGAGCACGAGGAGGTGGCGCAGCGTGTGATCAAACTGCACCGCGG 

GCGAGGGGTGGCTGCCATGCAGAGCCGGCAGTGGGTCCGGGACAGCTGCAGGAAGCTCTCAGG 

GCTTCTCCGCCAGAAGAATGCAGTTCTGAACAAACTGAAAACTGCAATTGGAGCAGTGGAGAA 

AGACGTGGGCCTGTCGGATGAAGAGAAACTGTTTCAGGTGCACACGTTTGAAATTTTCCAGAA 

AGAGCTGAATGAAAGTGAAAATTCCGTTTTCCAAGCTGTCTACGGACTGCAGAGAGCCCTGCA 

JSJSS^'^'^^^^^^'^^'^^^^^^^'^^AAGGAGAGCAGCCGGCAGCGCCTGGAGGCCCTGAG 

AGAGGCTGCAATAAAGGAAGAAACAGAATATATGGAACTTCTGGCAGCAGAAAAACATCAAGT 

TGAAGCCCTTAAAAATATGCAACATCAAAACCAAAGTTTATCCATGCTTGACGAGATTCTTGA 

AGATGTAAGAAAGGCAGCGGATCGTCTGGAGGAAGAGATAGAGGAACATGCTTTTGACGACAA 

TAAATCAGTCAAGGGGGTCAATTTTGAGGCAGTTCTGAGGGTGGAGGAAGAAGAGGCCAATTC 

TAAGCAAAATATAACAAAACGAGAAGTGGAGGATGACTTGGGTCTTAGCATGCTGATTGACTC 

CCAGAACAACCAGTATATTTTGACCAAGCCCAGAGATTCAACCATCCCACGTGCAGATCACCA 

CTTTATAAAGGACATTGTTACCATAGGAATGCTGTCCTTGCCTTGTGGCTGGCTATGTACAGC 

CATAGGATTGCCTACAATGTTTGGTTATATTATTTGTGGTGTACTTCTGGGACCTTCAGGACT 

AAATAGTATTAAGTCTATTGTGCAAGTGGAGACATTAGGAGAATTTGGGGTGTTTTTTACTCT 

TTTTCTTGTTGGCTTAGAATTTTCTCCAGAAAAGCTAAGAAAGGTGTGGAAGATTTCCTTACA 

AGGGCCGTGTTACATGACACTGTTAATGATTGCATTTGGCTTGCTGTGGGGGCATCTCTTGCG 

GATCAAACCCACGCAGAGCGTCTTCATTTCCACGTGTCTGTCCTTGTCAAGCACACCCCTCGT 

GTCCAGGTTCCTCATGGGCAGTGCTCGGGGTGACAAAGAAGGCGACATTGACTACAGCACCGT 

GCTCCTCGGCATGCTGGTGACGCAGGACGTGCAGCTCGGGCTCTTCATGGCCGTCATGCCGAC 

TCTCATACAGGCGGGCGCCAGTGCATCTTCTAGCATTGTCGTGGAAGTTCTCCGAATCCTGGT 

TTTGATTGGTCAGATTCTTTTTTCACTAGCGGCGGTTTTTCTTTTATGTCTTGTTATAAAGAA 

GTATCTCATTGGACCCTATTATCGGAAGCTGCACATGGAAAGCAAGGGGAACAAAGAAATCCT 

GATCTTGGGAATATCTGCCTTTATCTTCTTAATGTTAACGGTCACGGAGCTGCTGGACGTCTC 

CATGGAGCTGGGCTGTTTCCTGGCTGGAGCGCTCGTCTCCTCTCAGGGCCCCGTGGTCACCGA 

GGAGATCGCCACCTCCATCGAACCCATCCGCGACTTCCTGGCCATCGTTTTCTTCGCCTCCAT 

AGGGCTCCACGTGTTCCCCACGTTTGTGGCGTACGAGCTCACGGTGCTGGTGTTCCTCACCTT 

GTCAGTGGTGGTGATGAAGTTTCTCCTGGCGGCGCTGGTCCTGTCTCTCATTCTGCCGAGGAG 

CAGCCAGTACATCAAGTGGATCGTCTCTGCGGGGCTTGCCCAGGTCAGCGAGTTTTCCTTTGT 

CGTGGGGAGCCGGGCGCGAAGAGCGGGCGTCATCTCTCGGGAGGTGTACCTCCTTATACTGAG 

TGTGACCACGCTCAGCCTCTTGCTCGCCCCGGTGCTGTGGAGAGCTGCAATCACGAGGTGTGT 

GCCCAGACCGGAGAGACGGTCCAGCCTCTGATGGCTCGGAGATGATGGACCGTGGAAGGGAAG 

CGTCTGTGGGGAGTGAGCGCTTAGATGGCCAGCAGCTGCTCCTTCTGGGAAGCTCGCACCTTG 

GCAACAGAACAGCCCTCTAGCAGAGCGTCAGTGCAGTCGTGTTATCCCGGCTTTTACAGAATA 

TTCTTGTCCTATTTTAGAATTTTCCGGAGTAGTTTATTTGCAGTCTGTTGATTATGTGCAGTA 

GACCCGGGACACTGCGTTTTACCGATCACCTTGAATGTGGTGCCTGGATGTGCCTTTTTTTTT 

TTTCCCTGAAATTATTATTAATTTTCTATTGTGAGTTCATCAGTTCATAGTTTTTTTAGTAAA 

GAAGCAAAATTAAAAGGCTTTTAAAAATGTACAACTTCAGAATTATAATCTGTTAGTCAAATA 

TTTGTTATTAAACATTTCTGTAATATGAAGTTGTAATCCTGGCCGTGAGCTTGGAAGCTTACT 

TTTGATTCTTAAAGCCTATGTTTTCTAAAATGAGACAAATACGGATGTCTATTTGCCTTTTAT 

TGTAACTTTTAAATGAAATAATTTCATGTCAATTTCTATTAGATATATCACTTAAAATATTTG 

GTTTTAAATCACAAGAATATGTATTCTTTAATAAAGATAATTTATGATCATGGTAAAAAAAAAAA 
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FIGURE 230 

MkVLGRSFFWVLFPVLPWAVQAVEHEEVAQRVIKLHRGRGVA/J^MQSRQWVRDSCRKLSGLLRQ 
KNAVLNKLKTAIGAVEKDVGLSDEEKLFQVHTFEIFQKELNESENSVFQAVYGLQRALQGDYK 
DVVNMKESSRQRLEALREAAIKEETEYMELLAAEKHQVEALKNMQHQNQSLSMLDEILEDVRK 
AADRLEEEIEEHAFDDNKSVKGVNFEAVLRVEEEEANSKQNITKREVEDDLGLSMLIDSQNNQ 
YILTKPRDSTIPRADHHFIKDIVTIGMLSLPCGWLCTAIGLPTMFGYIICGVLLGPSGLNSIK 
SIVQVETLGEFGVFFTLFLVGLEFSPEKLRKVWKISLQGPCYMTLLMIAFGLLWGHLLRIKPT 
QSVFISTCLSLSSTPLVSRFLMGSARGDKEGDIDYSTVLLGMLVTQDVQLGLFMAVMPTLIQA 
GASASSSIWEVLRILVLIGQILFSLAAVFLLCLVIKKYLIGPYYRKLHMESKGNKEILILGI 
SAFIFLMLTVTELLDVSMELGCFLAGALVSSQGPWTEEIATSIEPIRDFLAIVFFASIGLHV 
FPTFVAYELTVLVFLTLSWVMKFLLAALVLSLILPRSSQYIKWIVSAGLAQVSEFSFVLGSR 
ARRAGVISREVYLLILSVTTLSLLLAPVLWRAAITRCVPRPERRSSL 

Signal peptide : 

amino acids 1-22 

Transmembrane domains: 

amino acids 282-304, 322-337, 354-370, 379-395, 445-474, 501-520, 
576-598, 641-660 

N-glycosylation sites. 

amino acids .104-108, 174-178, 206-210, 230-234 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 55-59, 673-677 

Tyrosine kinase phosphorylation site. 

amino acids 407-414 

N-myristoylation sites. 

amino acids 116-122, 327-333, 366-372, 401-407, 419-425, 429-435, 
442-448, 525-531, 530-536 

Cell attachment sequence. 

amino acids 404-407 
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FIGURE 231 

GAGAAAAACAACAGGAAGCAGCTTACAAACTCGGTGAACAACTGAGGGAACCAAACCAGAGAC 
GCGCTGAACAGAGAGAATCAGGCTCAAAGCAAGTGGAAGTGGGCAGAGATTCCACCAGGACTG 
GTGCAAGGCGCAGAGCCAGCCAGATTTGAGAAGAAGGCAAAAAGMGCTGGGGAGCAGAGCTG 
TAATGCTGCTGTTGCTGCTGCCCTGGACAGCTCAGGGCAGAGCTGTGCCTGGGGGCAGCAGCC 
CTGCCTGGACTCAGTGCCAGCAGCTTTCACAGAAGCTCTGCACACTGGCCTGGAGTGCACATC 
CACTAGTGGGACACATGGATCTAAGAGAAGAGGGAGATGAAGAGACTACAAATGATGTTCCCC 
ATATCCAGTGTGGAGATGGCTGTGACCCCCAAGGACTCAGGGACAACAGTCAGTTCTGCTTGC 
AAAGGATCCACCAGGGTCTGATTTTTTATGAGAAGCTGCTAGGATCGGATATTTTCACAGGGG 
AGCCTTCTCTGCTCCCTGATAGCCCTGTGGGCCAGCTTCATGCCTCCCTACTGGGCCTCAGCC 
AACTCCTGCAGCCTGAGGGTCACCACTGGGAGACTCAGCAGATTCCAAGCCTCAGTCCCAGCC 
AGCCATGGCAGCGTCTCCTTCTCCGCTTCAAAATCCTTCGCAGCCTCCAGGCCTTTGTGGCTG 
TAGCCGCCCGGGTCTTTGCCCATGGAGCAGCAACCCTGAGTCCCiaaAGGCAGCAGCTCAAGG 
ATGGCACTCAGATCTCCATGGCCCAGCAAGGCCAAGATAAATCTACCACCCCAGGCACCTGTG 
AGCCAACAGGTTAATTAGTCCATTAATTTTAGTGGGACCTGCATATGTTGAAAATTACCAATA 
CTGACTGACATGTGATGCTGACCTATGATAAGGTTGAGTATTTATTAGATGGGAAGGGAAATT 
TGGGGATTATTTATCCTCCTGGGGACAGTTTGGGGAGGATTATTTATTGTATTTATATTGAAT 
TATGTACTTTTTTCAATAAAGTCTTATTTTTGTGGCTAAAAAAAAAAAAA 
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FIGURE 232 

MLGSRAVMLLLLLPWTAQGRAVPGGSSPAWTQCQQLSQKLCTLAWSAHPLVGHMDLREEGDEE 
TTNDVPHIQCGDGCDPQGLRDNSQFCLQRIHQGLIFYEKLLGSDIFTGEPSLLPDSPVGQLHA 
SLLGLSQLLQPEGHHWETQQIPSLSPSQPWQRLLLRFKILRSLQAFVAVAARVFAHGAATLSP 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Casein kinase II phosphorylation site. 

amino acids 64-67 



N-myristoylation sites. 

amino acids 25-30, 81-86, 122-127 
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FIGURE 233 

CCCACGCGTCCGGCCCTGTAACCAAGATACTGACTGAACATGGCTGGCGGACTCAGGCTGGGGTCTGCAGTGCAG 
^^3^^^^^^'=^°<=T^^CATGAATATGGAGTAGTTTTCTCTAGCAAAGAGTAATCTGGGCCATGGAGTCAGGCCA 

CATAGTGGCTTCCAGCTTCTTAAAGTCTGACAAGAACCGGATAGGGGGAACCTACAAGAAGACCATCTAT^ 

atacaaggatgactcatacacagatgaagtggcccagcctgcctggttgggcttcctggggcSgtct^^ 
tcaagtgggggatgtcattcttattcacctgaagaattttgccactcgtccctatS 

tcccccggggggcagccatatctacaactggaccattccagaaggccatgcacccaccgatgctgacccagcgt^ 

cctcacctggatctaccattctcatgtagatgctccacgagacattgcaactggcctaattSgcct^ 

ctgtaaaagaggagccctggatgggaactcccctcctcaacgccaggatgtagaccaSttcttc^ 

S^^^^^^°^^*^^'^^^*^^^^^'^°^<^^^c^taggatgcatgcaatcaatggctttgtttttgggaact 

Ir^™^^^'''^^°'"°'=^^^^^^°'^°TGSCCTGGCACTTGrrTGGCATGGGCAATGAA^^ 
^™n!^^^^^^°^^^'^'^^^^^°^^TACCCGTGGACACCACACTGATGTGGCTAACAT^ 

tgtgactgctgagatggtgccctgggaacctggtacctggttaattagctgccaagt^cagtScttt^^^ 

Innr^I^S''^^^''''''''^^^^'^^^^CTTGCTCCATGGCCCCTCCT^^ 
IrJrr™™^^S^^*^^T*=^^^^'^^TTTTCCAGAAGAGCTC 

^^IS^^I^^^^^^^-^°^T<5^^C»TTCCAAGAGAAGATGCATTTGGAGGAAGATAG^ 

S^^I^^^^^'^^^'^^'^^^^'^^^^^^^^G^CTATGAAGGCACTGTGTACAATGATGGCTCM^^ 

GGATCCTGCTTGTCTCACTTGGATGTACTTCTCTGCTGCAGATCCCATAAGAGACACAAATTCTGGCCTrrTrr^ 

J^J^^^^^^tgtgttggatgagaacaagagctggtacagcaatgccaatcaagcagctgctatgtSttt^ 

on3^o^^r'^^'^^'^^^^^*^°^°'^^^'^^<=^^'5ACTCCAATCGGATGCATGCCATTAATGGGTT?C^^^ 

cctgcccaggctggacatgtgcaagggtgacacagtggcctggcacctgctcggcctgggcacagaSctS^ 

S^^J^^S^gtcatgttccagggcaacactgtgcagcttcagggcatgaggaagggtS 

tacctttgtcatggccatcatgcagcctgacaaccttgggacatttgagatttattgccaggcagSgccmS 

agaagcagggatgagggcaatctataatgtctcccagtgtcctggccaccaaSSctcSSSSJJS 

^^^^.'^'^'''^''''''^^''^''^''^^^^^c^^'^g^stagagtgggactattgccctSc^^ 

^^^^^'''''^''''''''''^"'^''^^^^^^'^^t'^^tggttacattttcctgagcaacaaggatgggctS 

»^™^^^^''^^^''^^^<^<5G^TACACTGATG6TACATTCAGGATCCCTCGGCCAAGG^ 

^^^^o^S^°'^^^^'^^^^'^^^^"="'ATCAAAGGTGAAGTTGGTGATATCCTGACTGTGGTA^^^ 

$^»^^»^^*'S^'''''"'^^^''°'=^''^^^^^T<^'^^GTGCTA6AATCTA^ 

^J^^^^'^^^'^'^'^^^'^^^^^'^^^catgtatagtggcctggtggggcccttggctatctgS 

»™TnSIS°^°^^^^^'^'^^^'^^^<5T<5'^CATGGATCGGGAATTTGCATTGTTGTTCTTS^^ 

S^I™*^^*^'^'^*'"'"^^<^<^^tgtggcaacccatgggtcccaggatccaggcagtatt^^ 

rS^^I^^n^^^^^''*''''^''^^'^^^'^'^^^'^'=^TGGGAAACTCTATGCCAACCT^^^ 
GTACCAAGGAGAACGAGTGGCCTGGTACATGCTGGCCATGGGCCAAGATGTGGATCTACACACCATCCACTTrc 

CATGGAGACCCTCTTCACTGTTTTTTCTCGAACAGAACACTTAAGCCCTCTCACCGTCATCACCAAAGAGACTGA 

aaaagtgccccccagagacattgaagaaggcaatgtgaagatgctgggcatgcaStoccSta^^ 

gatgctggcctctgttttggttgccattagtgtcacccttctgctcgttgttctggctcttgSg^ 

gtaccaacatcgacagagaaagctacgacgcaataggaggtccatcctggatgacagcttca^ 

^™oSf^S^I^°°^^^^''*^^''^^^^^^^'^«^^GCACATCTGTAGTGCACT 

nS^™^*^^^^^^^^^^^'^*^'^^^'f^^«5TGGAGAAGCAGAAGGAGCAATCAAGCTTATCTGG^^^ 

»™IS^^^*^'^^^^^^^"^™'^^^^C^TCGCAAATTTCAACAGCTACATTATATTTCCT^^ 

^^^^^^^''''''^'^'''^''^^'^^^'^'^^C^CAAAGTAGAGACCAAGAGAAAAACTcS^ 

^AnrJ^J^^^^^'^'''^°'^''^^^^^'^^<^^^T'^^G°CCAAGTGAGCTGTTAAGATAA^ 

^^SS™^r^^^'^^'^^^''<^^^^T''<^^^GTAGGCTGAGTATTGGGAATCCAAATTGAASTG^^ 

CTCCTTGGCAGTGAACTACTTTGAAGAAGTGGTCAATGGGTTGTTGCTGCCATGAGCATGTACAACCTCT^^ 

tagaagctcctcaggaaagccagttctccaagttcttaacctgtggcactgaaaggaatgttg^taS5cto^ 

ATGTTTTAGACAGCAAACCCTATCCATTAAAGTACTTGTTAGACCAAAAAAAAAAAAA ^^^^^^^ 
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FIGURE 234 

MWAMESGHLLWALLFMQSLWPQLTDGATRVYYLGIRDVQWNYAPKGRNVITNQPLDSDIVASS 
FLKSDKNRIGGTYKKTIYKEYKDDSYTDEVAQPAWLGFLGPVLQAEVGDVILIHLKNFATRPY 
TIHPHGVFYEKDSEGSLYPDGSSGPLKADDSVPPGGSHIYNWTIPEGHAPTDADPACLTWIYH 
SHVDAPRDIATGLIGPLITCKRGALDGNSPPQRQDVDHDFFLLFSVVDENLSWHLNENIATYC 
SDPASVDKEDETFQESNRMHAINGFVFGNLPELNMCAQKRVAWHLFGMGNEIDVHTAFFHGQM 
LTTRGHHTDVANIFPATFVTAEMVPWEPGTWLISCQVNSHFRDGMQALYKVKSCSMAPPVDLL 
TGKVRQYFIEAHEIQWDYGPMGHDGSTGKNLREPGSISDKFFQKSSSRIGGTYWKVRYEAFQD 
ETFQEKMHLEEDRHLGILGPVIRAEVGDTIQWFYNRASQPFSMQPHGVFYEKDYEGTVYNDG 
SSYPGLVAKPFEKVTYRWTVPPHAGPTAQDPACLTWMYFSAADPIRDTNSGLVGPLLVCRAGA 
LGADGKQKGVDKEFFLLFTVLDENKSWYSNANQAAAMLDFRLLSEDIEGFQDSNRMHAINGFL 
FSNLPRLDMCKGDTVAWHLLGLGTETDVHGVMFQGNTVQLQGMRKGAAMLFPHTFVMAIMQPD 
NLGTFEIYCQAGSHREAGMRAIYNVSQCPGHQATPRQRYQAARIYYIMAEEVEWDYCPDRSWE 
REWHNQSEKDSYGYIFLSNKDGLLGSRYKKAVFREYTDGTFRIPRPRTGPEEHLGILGPLIKG 
EVGDILTWFKNNASRPYSVHAHGVLESTTVWPLAAEPGEWTYQWNIPERSGPGPNDSACVS 
WIYYSAVDPIKDMYSGLVGPLAICQKGILEPHGGRSDMDREFALLFLIFDENKSWYLEENVAT 
HGSQDPGSINLQDETFLESNKMHAINGKLYANLRGLTMYQGERVAWYMLAMGQDVDLHTIHFH 
AESFLYRNGENYRADWDLFPGTFEWEMVASNPGTWLMHCHVTDHVHAGMETLFTVFSRTEH 
LSPLTVITKETEKVPPRDIEEGNVKMLGMQIPIKNVEMLASVLVAISVTLLLWLALGGWWY 
QHRQRKLRRN RRS I LDDS FKLLS FKQ 

Signal peptide: 
amino acids 1-21 
Tr an smemhrane domain: 
amino acids 1109-1130 
N-glycosylation sites. 

amino acids 167-171, 239-243, 591-595, 717-721, 761-765, 832-836, 
876-880, 934-^938 

Glycosaminoglyean attachment site. 

amino acids 871-875 

Tyrosine kinase phosphorylation sites . 

amino acids 82-90, 137-145, 494-502, 513-521 
N-nqfristoylation sites. 

amino acids 212-218, 313-319, 498-504, 566-572, 672-678, 778-784,. • 
843-849 

Milticopper oxidases signature 1. 

amino acids 344-365, 696-717, 1043-1064 
Milticopper oxidases signature 2. 
amino acids 1048-1060 
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FIGURE 235 



GGAAAGAGTGCTGGTACTACAACCAGGAAGTGACAGATAATGTGCTTTAAACTACATTAGAAAAGCTTCTCATAG 

CAAAACTGAGAGATTGAAGCAGTGATTATTTTTACATAGTTGTCATTAAATATTTGGAGCTCTGCTGTGCATAGA 

GATGGCAACATACTTAGAATACACAGCTTTCTGGGCCAGAAATTGATCTTCTGACTTTTGAGCCTTATCTGATTA 

CTGCTTGGTTCATCTTTATTTTGTTAAACTACTCTGTAGGCTGAAAGGGAGAGACTCTCCTTGGTTTGCAGAGCC 

TGACTAGACAGGAATTCTGGCAACTGCTCCAGCAGAACTATGGCACTGAGCTAGGTTTAAATGCTGAGGAGATGG 

AAAACTTGTCACTGTCGATTGAGGATGTGCAGCCAAGAAGTCCAGGAAGAAGCAGCTTGGATGACTCTGGGGAGA 

GAGATGAAAAATTATCCAAGTCAATCAGTTTTACCAGTGAATCAATTAGTCGGGTTTCAGAAACAGAGTCATTCG 

ATGGAAATTCATCAAAAGGAGGATTAGGCAAAGAGGAGTCCCAAAATGAGAAACAGACCAAAAAGAGTCTCTTAC 

CAACTTTGGAAAAGAAGTTAACTAGAGTGCCATCAAAGTCACTGGACTTGAATAAAAATGAATATCTTTCTCTGG 

ACAAAAGCAGCACTTCAGATTCTGTTGATGAAGAAAATGTTCCTGAGAAAGATCTTCATGGAAGACTTTTTATCA 

ACCGTATTTTTCATATCAGTGCTGACAGAATGTTTGAATTGCTCTTTACCAGTTCACGCTTTATGCAGAAATTTG 

CCAGTTCTAGAAATATAATAGATGTAGTATCTACCCCTTGGACTGCAGAACTTGGAGGTGATCAGCTGAGAACGA 

TGACCTACACTATAGTCCTTAATAGTCCACTTACTGGAAAATGCACTGCTGCCACTGAAAAGCAGACACTGTATA 

AAGAAAGTCGGGAAGCACGATTTTATTTGGTAGATTCAGAAGTACTGACACATGATGTCCCCTACCATGATTACT 

TCTATACCGTGAACAGATACTGTATCATCCGATCTTCAAAACAGAAATGCAGGCTAAGAGTTTCCACAGATTTGA 

AATACAGAAAACAGCCATGGGGCCTTGTCAAATCTTTAATTGAAAAGAATTCCTGGAGTTCTTTGGAGGACTATT 

TCAAACAGCTTGAATCAGATTTGTTAATTGAAGAATCTGTATTAAATCAGGCCATTGAAGACCCTGGAAAACTTA 

CTGGCCTACGAAGGAGAAGGCGAACCTTCAACCGAACAGCAGAAACAGTTCCTAAACTTTCCTCTCAGCATTCCT 

CTGGAGATGTGGGCTTAGGTGCCAAAGGGGATATTACAGGAAAGAAAAAGGAAATGGAAAACTATAACGTCACTC 

TTATTGTGGTAATGAGTATTTTTGTGTTGTTATTAGTTTTGTTGAATGTGACACTGTTTCTGAAGCTGTCAAAGA 

TAGAACATGCTGCTCAGTCCTTTTACCGTCTCCGCCTCCAAGAAGAGAAATCTTTAAATTTAGCCTCTGATATGG 

TGTCAAGAGCAGAAACTATTCAGAAGAATAAAGATCAGGCCCATCGTTTAAAGGGAGTGCTCCGAGACTCCATAG 

TGATGCTTGAACAGCTGAAGAGCTCACTCATTATGCTTCAGAAAACGTTTGATCTACTAAATAAGAATAAGACTG 

GCATGGCTGTTGAAAGCTAGTGATCTGAAGGACTAAAACCGCAGAGATACTTGGAACTTAAAGAAAATACCTGGA 

AGT^AAACCAGACGAATGAAGGATTTTGGCATAGAACATTTCTATGTTTTTTCATTATTGAGATTTCTAATATGAA 

CATTTCTTTCAGTAACATTTATTTGATAATTAGTTTCTGCTGGCCTTAATAATCCATCCTTTCACTTCTTATAGA 

TATTTTTAAGCTGTGAATTTCTTCAGTGAACCATGAAATATATTATAGAACTGAATTTCTCTGATACAAAAAGAA 

AATGACACACCCTGAATTGAGTGGTATGGTCTCATTTCTACAGTGAAGTCTGATGCTTTGTTAGCACAGAATCCG 

TACATGTCCAATAGGTCGCTTTTGTAACTGAGATAAGACCAAGAGGATAAACAGGACAATATAAGAAGAAACCTC 

TATGTCATTACTGATTTTAAAGGTTCTGTTTTCAGGCATATAACATTTCCAGGTTTGTGTACTGTAAAGATTATA 

ATGTCTTCATTTATTTAGCATGCAAATTTAATAGTCAAACTTTTTGAATCTGCATGTTGATGATGATTATCAGAA 

AGGGTCTTCTGCCATGCTGTATCTTTATGAAAGAAATAGTTGTTTTTTCTTAAGGTAACTATCAGAGGTGGGATT 

ATCTTGCCTCCTCACTTAGAATACCAACAGTCAAAAGGAAGAACCATCCTCTGAGTTTTAAAAACCAGAAGGTTA 

TGTTAAAATCTGGGCATTTAGTGACAGATCAAATGCATACTTGAACTAAGATTGGCTTCAGCTTAGCAGTCTTTC 

ATGGTGGAAGTGACACATCTGGTTGAAAATAATTTGTGTATTTTCAGTAACCATGTATGGCTTCCTTCTTTATGT 

ATGTGTGTGACTTGTTTTAATTGGTAAGTTATAAGCCAGACATAGATTTTAGCTCTTTAATAAAAACTTCAGGGG 

CACGTATGTCCCAGTACAAGTGTACTGACTATCAAGTTTTAACTCAGATGCAAGCTTTGGCTCTTTCATAAAAAG 

TTTTTATGCATATGTGTCTCCATACAAGTGGCTCATTAAAATAAGAACTTTGTAAACTGACTTAAAATCAGATAT 

TTTTTCAAGAGTTAGGGAAAGTTGAAGTGTTTTACTGTTTTGTCTCTTGAGCCCTTTCTCTGGGGAAAAAATACA 

TATCCATCTATCTATCTATATATAAACTGTGTATACATTCTTACTGTTTGAACAACTATTGCCTTTAATTAAATG 

TTTCATTTTTCTCCAGAGTCCCCAAAGCCACATGGCATTATTATAGTCATTTTTGAGATGCCTGTAGAGAATGAA 

AGTATTGACTCCGTTAGAGGGAAAATGGGTTTCTCTGGGTGAATTCCAACGAAGCATACCTAGGGGTAACAGTGA 

ACCTACCTGGGTTTGTTTTGTTTTGGTAAGGATTTATGTAGTGTCTGGCTGTAAGCAAGAATGAGTGGATTATAA 

ACTTGAAGATTTCTCTGTTAAAGTCACAAAAATGATCGACAAACAATATTTTTGTGATGTTTATTTAAACGTTGT 

ATTTTATAACATACTTCAAGGAAGAGTATCGAAGTAAGTTGCTTTATAAATTAAGACTAAATTCGTATGGATGCA 

GAATTCAATTAATAAAATTTGAGCCTGTTACGTAAATTGAATATTAATAAAATTGAAAATTTCAAAA 
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FIGURE 236 

MENLSLSIEDVQPRSPGRSSLDDSGERDEKLSKSISFTSESISRVSETESFDGNSSKGGLGKE 
ESQNEKQTKKSLLPTLEKKLTRVPSKSLDLNKNEYLSLDKSSTSDSVDEENVPEKDLHGRLFI 
NRIFHISADRMFELLFTSSRFMQKFASSRNIIDWSTPWTAELGGDQLRTMTYTIVLNSPLTG 
KCTAATEKQTLYKESREARFYLVDSEVLTHDVPYHDYFYTVNRYCIIRSSKQKCRLRVSTDLK 
YRKQPWGLVKSLIEKNSWSSLEDYFKQLESDLLIEESVLNQAIEDPGKLTGLRRRRRTFNRTA 
ETVPKLSSQHSSGDVGLGAKGDITGKKKEMENYNVTLIVVMSIFVLLLVLLNVTLFLKLSKIE 
HAAQSFYRLRLQEEKSLNLASDMVSRAETIQKNKDQAHRLKGVLRDSIVMLEQLKSSLIMLQK 
TFDLLNKNKTGMAVES 

Transmembrane domain: 

amino acids 352-371 

N-glycosylation sites. 

amino acids 3-7, 54-58, 312-316, 349-353, 367-371, 449-453 

CAMP- and cG^-dependent protein kinase phosphorylation sites. 

amino acids 81-85, 307-311 

Tyrosine kinase phosphorylation sites « 

amino acids 202-211, 246-254, 341-349 

N-myristoylation site. 

amino acids 259-265 

Aniidation site. 

amino acids 339-343 
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FIGURE 237 

CAGGGGCTGGAGGGCAGGGGAGGGGMGATGTCATTCCTGCTCGGCGCAATCCTGACCCTGCT 

CTGGGCGCCCACGGCTCAGGCTGAGGTTCTGCTGCAGCCTGACTTCAATGCTGAAAAGTTCTC 

AGGCCTCTGGTACGTGGTCTCCATGGCATCTGACTGCAGGGTCTTCCTGGGCAAGAAGGACCA 

CCTGTCCATGTCCACCAGGGCCATCAGGCCCACAGAGGAGGGCGGCCTCCACGTCCACATGGA 

GTTCCCGGGGGCGGACGGCTGTAACCAGGTGGATGCCGAGTACCTGAAGGTGGGCTCCGAGGG 

ACACTTCAGAGTCCCGGCCTTGGGCTACCTGGACGTGCGCATCGTGGACACAGACTACAGCTC 

CTTCGCCGTCCTTTACATCTACAAGGAGCTGGAGGGGGCCCTCAGCACCATGGTGCAGCTCTA 

CAGCCGGACCCAGGATGTGAGTCCCCAGGCTCTGAAGTCCTTCCAGGACTTCTACCCGACCCT 

GGGGCTCCCCAAGGACATGATGGTCATGCTGCCCCAGTCAGATGCATGCAACCCTGAGAGCAA 

GGAGGCGCCCT^CACCTCCGGAGCCCCACCCCCGCCCTTCCCAGGTGGAGCCAAAGCAGCAG 

GCGCCTTTGCCCCTGGAGTCAAGACCCACAGCCCTCGGGGACCACCTGGAGTCTCTCCATCCT 

CCACCCCCCGCCTGTGGGATGCCTTGTGGGACGTCTCTTTCTATTCAATAAACAGATGCTGCA 
GCCTCA 
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FIGURE 238 



MMSFLLGAILTLLWAPTAQAEVLLQPDFNAEKFSGLWYVVSMASDCRVFLGKKDHLSMSTRAI 
RPTEEGGLHVHMEFPGADGCNQVDAEYLKVGSEGHFRVPALGYLDVRIVDTDYSSFAVLYIYK 
ELEGALSTMVQLYSRTQDVSPQALKSFQDFYPTLGLPKDMMVMLPQSDACNPESKEAP 

Signal peptide: 

amino acids 1-20 

Tyrosine kinase phosphorylation site. 

amino acids 110-117 

N-myristoylation sites. 

amino acids 7-13, 79-85, 130-136 

iUaidation site. 

amino acids 50-54 
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FIGURE 

GGCGCGCTGGTCCAGGTGAGCGGGCGCGTCCCCGCGACGGCGCTGCCTGCCCGAGGCGGTTCA 

CGTAAAGACAGCGAGATCCTGAGGGCCAGCCGGGAAGGAGGCGTGGATATGGAGCTGGCTGCT 

GCCAAGTCCGGGGCCCGCGCCGCTGCCTAGCGCGTCCTGGGGACTCTGTGGGGACGCGCCCCG 

CGCCGCGGCTCGGGGACCCGTAGAGCCCGGCGCTGCGCGCATGGCCCTGCTCTCGCGCCCCGC 

GCTCACCCTCCTGCTCCTCCTCATGGCCGCTGTTGTCAGGTGCCAGGAGCAGGCCCAGACCAC 

CGACTGGAGAGCCACCCTGAAGACCATCCGGAACGGCGTTCATAAGATAGACACGTACCTGAA 

CGCCGCCTTGGACCTCCTGGGAGGCGAGGACGGTCTCTGCCAGTATAAATGCAGTGACGGATC 

TAAGCCTTTCCCACGTTATGGTTATAAACCCTCCCCACCGAATGGATGTGGCTCTCCACTGTT 

TGGTGTTCATCTTAACATTGGTATCCCTTCCCTGACAAAGTGTTGCAACCAACACGACAGGTG 

CTATGAGACCTGTGGCAAAAGCAAGAATGACTGTGATGAAGAATTCCAGTATTGCCTCTCCAA 

GATCTGCCGAGATGTACAGAAAACACTAGGACTAACTCAGCATGTTCAGGCATGTGAAACAAC 

AGTGGAGCTCTTGTTTGACAGTGTTATACATTTAGGTTGTAAACCATATCTGGACAGCCAACG 

AGCCGCATGCAGGTGTCATTATGAAGAAAAAACTGATCTTSaaAGGAGATGCCGACAGCTAGT 

GACAGATGAAGATGGAAGAACATAACCTTTGACAAATAACTAATGTTTTTACAACATAAAACT 

GTCTTATTTTTGTGAAAGGATTATTTTGAGACCTTAAAATAATTTATATCTTGATGTTAAAAC 

CTCAAAGCAAAAAAAGTGAGGGAGATAGTGAGGGGAGGGCACGCTTGTCTTCTCAGGTATCTT 

CCCCAGCATTGCTCCCTTACTTAGTATGCCAAATGTCTTGACCAATATCAAAAACAAGTGCTT 

GTTTAGCGGAGAATTTTGAAAAGAGGAATATATAACTCAATTTTCACAACCACATTTACCAAA 

AAAAGAGATCAAATATAAAATTCATCATAATGTCTGTTCAACATTATCTTATTTGGAAAATGG 

GGAAATTATCACTTACAAGTATTTGTTTACTATGAAATTTTAAATACACATTTATGCCTAGAA 

GGAACGGACTTTTTTTTTCTATTTTAATTACACATAATATGTAATTAAAGTACAACATAATAT 

GTTGTTTCTCTGTAGCCCGTTGAGCATATGAGTAAGTCACATTTCTATTAGGACTACTTACAA 

GGACAAGGTTTCCATTTTTCCAGTTGTAAAATTGGAACCATCAGCTGATAACCTCGTAGGGAG 

CAACCCCAGGATAGCTAAGTGTTATGTAATATGCCTAGAAGGTGATGTGAATGCGATTCAGAA 

GCATAGCCACTCCCATTTTATGAGCTACTCACATGACAAATGTCATCTTTTGCTATAACCTTT 

GCCAAGTTAGAGAAAAGATGGATTTAATGAGATAAATGAAAAGATATTTAACCTAAAAAAAAA 
AAAAT^AAAAAAAAAAAAA 




wo 01/40466 PCT/USOO/32678 

240/550 

FIGURE 240 

MALLSRPALTLLLLLMAAVVRCQEQAQTTDWRATLKTIRNGVHKIDTYLNAALDLLGGEDGLC 
QYKCSDGSKPFPRYGYKPSPPNGCGSPLFGVHLNIGIPSLTKCCNQHDRCYETCGKSKNDCDE 
EFQYCLSKICRDVQKTLGLTQHVQACETTVELLFDSVIHLGCKPYLDSQRAACRCHYEEKTDL 

Important features: 
Signal peptide : 

amino acids 1-22 

N-nQfristoylation sites: 

amino acids 57-63,93-99 

Phospholipase A2 histidine active site: 

amino acids 106-114 



Neuraxin and MAFlB proteins repeat proteins Block: 

amino acids 109-137 
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FIGURE 241 

GATTCCGAGCGCCTCCACTGCTGGTCCGTTGGCCAGATCAACTCGCCGCGTGGGCCGGCCGTT 

CCCTGAGAGTCTGAGCGCTCGCCGCACCCCCTTCCGAGCTTCTATTGGCCGTAGCAGACGTCC 

GTCTGCCGCTATCTCCGCCCCAATACGGAAGCGGCCTAGTCCTCCGGCTCCGACAGCTGGGTG 

TCCAGGCCASSGGGCAGCCCTGGGCGGCTGGGAGCACGGACGGGGCGCCCGCGCAGCTGCCTC 

TCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTGGCTTACGTGCTGGTGCTCG 

GTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTGCAGCTGGCGCTGGCCGCCTTCC 

AGCTGCTCAACCTGCTGGGCAACGTGGGGCTCTTCCTGCGCTCGGATCCCAGCATCCGTGGCG 

TGATGCTGGCCGGCpGCGGTCTGGGCCAGGGCTGGGCTTACTGCTACCAATGCCAAAGCCAGG 

TGCCGCCACGCAGCGGACACTGCTCTGCCTGCCGCGTCTGCATCCTGCGTCGGGACCACCACT 

GCCGCCTGCTGGGCCGCTGCGTGGGCTTCGGCAACTACCGGCCCTTCCTGTGCCTGCTGCTTC 

ATGCCGCCGGCGTCCTGCTCCACGTCTCTGTGCTGCTGGGCCCTGCACTGTCGGCCCTGCTGC 

GAGCCCACACGCCCCTCCACATGGCTGCCCTCCTCCTGCTTCCCTGGCTCATGTTGCTCACAG 

GCAGAGTGTCTCTGGCACAGTTTGCCTTGGCCTTCGTGACGGACACGTGCGTGGCGGGTGCGC 

TGCTGTGCGGGGCTGGGCTGCTCTTCCATGGGATGCTGCTGCTGCGGGGCCAGACCACATGGG 

AGTGGGCTCGGGGCCAGCACTCCTATGACCTGGGTCCCTGCCACAACCTGCAGGCAGCCCTGG 

GGCCCCGCTGGGCCCTCGTCTGGCTCTGGCCCTTCCTGGCCTCCCCATTGCCTGGGGATGGGA 

TCACCTTCCAGACCACAGCAGATGTGGGACACACAGCCTCCISaCTCCAGGAAGAGCCAGAGC 

TGTGCAGGGAGGAAGGGGTGAGAGGGGGGCCCCCACACCTAGACTCAGTAAGGAAGTCGGGTT 

GGACCTTAACATCTGCATTGGACAACTCCACCCCTTCCTTGGCCTTGCCCCTGCCCGCCTACA 

CTCCTACGTGTCCAGGGCTTGGGCCGTGACTTAGGCAGAGGAGTGCAGAGGAGGGTCTGGCAG 

GGGCTGCTCAGGCCGCCTAGCTGCCCCTTTGCCAGGTTAATAAAGCACTGACTTGTTAA 



# 
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FIGURE 242 



MGQPW/VAGSTDGAPAQLPLVLTALWAAAVGLELAYVLVLGPGPPPLGPLARALQLALAAFQLL 
NLLGNVGLFLRSDPSIRGVMLAGRGLGQGWAYCYQCQSQVPPRSGHCSACRVCILRRDHHCRL 
LGRCVGFGNYRPFLCLLLHAAGVLLHVSVLLGPALSALLRAHTPLHMAALLLLPWLMLLTGRV 
SLAQFALAFVTDTCVAGALLCGAGLLFHGMLLLRGQTTWEWARGQHSYDLGPCHNLQAALGPR 
WALVWLWP FLAS PLPG DG I T FQTTADVGHTAS 

Important, features: 
Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 51-66,143-160,174-191,198-214 
N-myristoylation sites: 

amino acids 2-8,8-14,30-36,81-87,88-94,90-96,206-212 



Leucine zipper pattern: 

amino acids 143-165,150-172,157-179,164-186 
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FIGURE 243 



CTTGTCTTTGTGTCGGTTGTGATTTTCCTAATCTCTGATTTTGCTTTTCTCTCGGACGCTCTC 

CCTCTTCGGACCCATTTTCTCCCGTGCTTCATGCCCTGATAGCCTGGCCCCTTCCCGGCTTCC 

TTCGCTACCGGGGACGCCTCTAGTTTTTCTGAATTTCTGGCTGGCTCCACCCTCCGCGTTCAT 

CTTCCTCAAGAGTTCGCCCCTCTGGGGGCTCCTCTGTGTAATCGTCGCCTTCTCTGGGTATTT 

CTGTGAACTCCGTCTCACACCATCCCGCCATCTTCTCTGCCTTGGCCCCTTTTCTCTGTACAG 

CCAGCTCTGTGTCCTTTTCTTCTCCCCCTCTAAAATCGACTCCTCTTCTCCCTGAGAGCCCCA 

CCTTTGTGCCCCACTCCTCATTTTCCTACGCCTCCCTCTCTCTGCTGGTCCTCTCTCTCCCTG 

CAAGGTTCCATTCCATCAATTTGTTTGTCTTTTGTAGGGGTGGCATCCCCTCTGACTACTGCT 

CCATCCTTTTTTTTTTTTTTTTTTTTTTTTTTGCTTGAGGATTTCACTTCAATCTTTTCTGGT 

TGCGTCTCCACTTGTACTCAGCTTGTTAGGTCCAGGTCCAGTTGTTCTGCATCTGAGGCTGGC 

GTGTGCTGTCTTCTCTGATTGGCCTAATCTCCCTCACCCCCGTGAGATCTGTTGTCAGCCTTC 

GTTTCTCTTTCCTGTGTCCCAGCTTTTCTGCGGGTCTTGGCACCTTTCTTGGCCACAGATTTC 

TGGGTTACAGAGCATGTGTGTCTGAGGCATTGCAGGCAGAAAAGGGTGGCCGACGTGACCTCT 

AGCTGGACTGCTGGGCAGGGGAGCTGTCCTAGATAAAATTGGAAAGAAACAGTGACCCAGAGA 

CAGGTGGACAAAGAATTCGGGGACTGATGGGAACTGAGCTTGGGATCCAGACTGAAACTGATT 

CCAGACTGACCrCTAGCACCCAGGACCCAGACACAGGGCCaSSGGACCCCAGCATTTGAGACT 

TGTGCAGCTGTTCTGCCTTCTAGGGGCCATCCCCACTCTGCCTCGGGCTGGAGCTCTTTTGTG 

CTATGAAGCAACAGCCTCAAGATTCAGAGCTGTTGCTTTCCATAACTGGAAGTGGCTTCTGAT 

GAGGAACATGGTGTGTAAGCTGCAAGAGGGCTGCGAGGAGACGCTAGTGTTCATTGAGACAGG 

GACTGCAAGGGGAGTTGTGGGCTTTAAAGGCTGCAGCTCGTCTTCGTCTTACCCTGCGCAAAT 

CTCCTACCTTGTTTCCCCACCCGGAGTGTCCATTGCCTCCTACAGTCGCGTCTGCCGGTCTTA 

TCTCTGCAACAACCTCACCAATTTGGAGCCTTTTGTGAAACTCAAGGCCAGCACTCCTAAGTC 

TATCACATCTGCGTCCTGTAGCTGCCCGACCTGTGTGGGCGAGCACATGAAGGATTGCCTCCC 

AAATTTTGTCACCACTAATTCTTGCCCCTTGGCTGCTTCTACGTGTTACAGTTCCACCTTAAA 

ATTTCAGGCAGGGTTTCTCAATACCACCTTCCTCCTCATGGGGTGTGCTCGTGAACATAACCA 

GCTTTTAGCAGATTTTCATCATATTGGGAGCATCAAAGTGACTGAGGTCCTCAACATCTTAGA 

GAAGTCTCAGATTGTTGGTGCAGCATCCTCCAGGCAAGATCCTGCTTGGGGTGTCGTCTTAGG 

CCTCCTGTTTGCCTTCAGGGACTGACCATCTAGCTGCACCCGACAAGCACCCAGACTCTTTCA 

CATAACAAATAAAATAGCAGAGTTCCCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAA 
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FIGURE 244 

MGPQHLRLVQLFCLLGAIPTLPRAGALLCYEATASRFRAVAFHNWKWLLMRNMVCKLQEGCEE 
TLVFIETGTARGVVGFKGCSSSSSYPAQISYLVSPPGVSIASYSRVCRSYLCNNLTNLEPFVK 
LKASTPKSITSASCSCPTCVGEHMKDCLPNFVTTNSCPLAASTCYSSTLKFQAGFLNTTFLLM 
GCAREHNQLLADFHHIGSIKVTEVLNILEKSQIVGAASSRQDPAWGVVLGLLFAFRD 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation sites: 

amino acids 117-121,183-187 

N-myristoylaticn sites: 

amino acids 16-22,25-31,60-66,71-77,81-87,100-106,224-230, 
235-241,239-245 

Prokaryotic membrane lipoprotein lipid attachment site: 

amino acids 181-192 
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FIGURE 245 

GTGGAGTTGGGTGGTGTCGGGAGCCTCTCCCTGAGGGGCACCGCGTCTTCAGGAGCTGGGCCTCCAGTGCGGCGC 
GATGTCAGGCGCGGTGACAGCTCTGTGAGTCCGAGGCCGCGGCCGTGGCGCTGGGCGGCTGCGGGGCCTGACCGG 
TCCGCTCMGGTGCCGCCACGACGCCATCGCGGGGCAGGAAGGCCAGGGGTGCTGAGTTCTTCACCTCCTTTTAG 
ACTGAGATCTGCCAAGTTTTCCGGCATTGCTCTTGAGGATCTCAGAAGGGCTCTTAAGACAAGACTGCAAATGGT 
GTGTGTATTTGTCATGAACCGAATGAATTCCCAGAACAGTGGTTTCACTCAGCGCAGGCGAATGGCTCTTGGGAT 
TGTTATTCTTCTGCTTGTTGATGTGATATGGGTTGCTTCCTCTGAACTTACTTCGTATGTTTTTACCCAGTACAA 
CAAACCATTCTTCAGCACCTTTGCAAAAACATCTATGTTTGTTTTGTACCTTTTGGGCTTTATTATTTGGAAGCC 
ATGGAGACAACAGTGTACAAGAGGACTTCGCGGAAAGCATGCTGCTTTTTTTGCAGATGCTGAAGGTTACTTTGC 
TGCTTGCACAACAGATACAACTATGAATAGTTCTTTGAGTGAACCTCTGTATGTGCCTGTGAAATTCCATGATCT 
TCCAAGTGAAAAACCTGAGAGCACAAACATTGATACTGAAAAAACCCCCAAAAAGTCTCGTGTGAGGTTCAGTAA 
• TATCATGGAGATTCGACAGCTTCCGTCAAGTCATGCATTGGAAGCAAAGTTGTCTCGCATGTCATATCCTGTGAA 
AGAACAAGAATCCATACTGAAAACTGTGGGGAAACTTACTGCAACTCAAGTAGCGAAAATTAGCTTTTTTTTTTG 
CTTTGTGTGGTTTTTGGCAAATTTGTCATATCAAGAAGCACTTTCAGACACACAAGTTGCTATAGTTAATATTTT 
ATCTTCAACTTCCGGACTTTTTACCTTAATCCTTGCTGCAGTATTTCCAAGTAACAGTGGAGATAGATTTACCCT 
TTCTAAACTATTAGCTGTAATTTTAAGCATTGGAGGCGTTGTACTGGTAAACCTGGCAGGGTCTGAAAAACCTGC 
TGGAAGAGACACAGTAGGTTCCATTTGGTCTCTTGCTGGAGCCATGCTCTATGCTGTCTATATTGTTATGATTAA 
GAGAAAAGTAGATAGAGAAGACAAGTTGGATATTCCAATGTTCTTTGGTTTTGTAGGTTTGTTTAATCTGCTGCT 
CTTATGGCCAGGTTTCTTTTTACTTCATTATACTGGATTT6AGGACTTCGAGTTTCCCAATAAAGTAGTATTAAT 
GTGCATTATCATTAATGGCCTTATTGGAACAGTACTCTCAGAGTTCCTGTGGTTGTGGGGCTGCTTTCTTACCTC 
ATCATTGATAGGCACACTTGCACTAAGCCTTACAATACCTCTGTCCATAATAGCTGACATGTGTATGCAAAAGGT 
GCAGTTTTCTTGGTTATTTTTTGCAGGAGCTATCCCTGTATTTTTTTCATTTTTTATTGTAACTCTCCTATGCCA 
TTATAATAATTGGGATCCTGTGATGGTGGGAATCAGAAGAATATTTGCTTTTATATGCAGAAAACATCGAATTCA 
GAGAGTTCCAGAAGACAGCGAACAGTGTGAGAGTCTCATTTCTATGCACAGTGTTTCTCAGGAGGATGGAGCTAG 
TTAGCTGTCTGTTGTCTGTAGCCCAGCTTGATAATGGAACTATACAGCGAAGAGACAATCTCTGGCAAGTTTTTG 
TAGAAAAAATGTTTCAGTGCCTAGTCTGAAAAATAACAGTTTGAGTTCTTTGAAACTCTAAAATATATTTTTCTC 
ATACCTGTTTTCTTCATTTTCATAATGAAGCACTTTGCTATGTAGCTGTGTACATATCACTACAGTTATAGGAAG 
TTTCAGTCTACAGTCCATCCAAAGGACCAACCTGCCTTACACATCTCAAGGAATTCAGCTGTTGAAATCATTTGA 
ACTAATCAAGGAATAAATCCTAATGTTCTGGGACTTTATTTTCACATGTTAAATGCTGGAATATATTATGAAAAT 
GTTTTCAAGAAATCACTTAAGTGTTCATAGACCAGTATTTCTGACAGGTAAAATGCTAAAATAAGCTACCTGTAA 
TAAGTGTGGATTATATTTTTGGGTTTTGTAGAATATTGCAAATTAACCACACAAAAAATGTTTAATTTATGCAAC 
AAGCATGTTTGTGCAAATTTCATGGGACTTTAAAAAGAATAAGTATTTGAGAAAATATCTGGTTCACTTACACTA 
CATTTACTGTATTATTCTTTTATAGCATTAGGTGCCTTGTATTTTAAATCTGTGACAAACCATGGCAAATTTTTA 
AAGGGGAAGTATTATTATAAAATGAAGAAATATGTATTTCTAAAGGCTATATTGCTGTAAACTTAATTGATAAAG 
CTCTGTTTAATTTAGAGTTTTGAAGAAATAGTCTCCCTTCAATTAAGAAATTTTCATAATGGAATGATTTAAATT 
GAAGTGACAAAGAGTATTATTAAAATACAATGTTTATAAAAAAA 
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FIGURE 246 



MVPPRRHRGAGRPGVLSSSPPFfU:.RSAKFSGIALEDLRRALKTRLQMVCVFVMNRMNSQNSGF 

TQRRRMALGIVILLLVDVIWVASSELTSYVFTQYNKPFFSTFAKTSMFVLYLLGFIIWKPWRQ 

QCTRGLRGKHAAFFADAEGYFAACTTDTTMNSSLSEPLYVPVKFHDLPSEKPESTNIDTEKTP 

KKSRVRFSNIMEIRQLPSSHALEAKLSRMSYPVKEQESILKTVGKLTATQVAKISFFFCFVWF 

LANLSYQEALSDTQVAIVNILSSTSGLFTLILAAVFPSNSGDRFTLSKLLAVILSIGGWLVN 

LAGSEKPAGRDTVGSIWSLAGAMLYAVYIVMIKRKVDREDKLDIPMFFGFVGLFNLLLLWPGF 

FLLHYTGFEDFEFPNKVVLMCIIINGLIGTVLSEFLWLWGCFLTSSLIGTLALSLTIPLSIIA 

DMCMQKVQFSWLFFAGAIPVFFSFFIVTLLCHYNNWDPVMVGIRRIFAFICRKHRIQRVPEDS 
EQCESLISMHSVSQEDGAS 

la^ort^anl: features: 
Transmembrane domain: 

amino acids 69-87,105-118,237-256,266-285,300-316,332-346, 
364-379,399-419,453-472 

N-glycosylation sites : 

amino acids 157-161,255-259 

N-nryristoylation sites : 

amino acids 14-20,329-335,404-410,407-413,418-424 
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FIGURE 247 

CGTCTGTAGAGATATCATGAACTTCAACTTAGCTTTGGTACTTTCTTCCCTGAAGACAGAGGG 
CAGAACTCTGAGTTCCAGAACCATTTTCAACTGTATTGGGGACCAATCACTTGACTCTATTCT 
TGTCTCTCTGACAGATGACGCTACACTCTCCTCTGAATAATGGACACCATTTCTAAAACTGAA 
TCCTGCTACTAAAATAATTCAGATGATATATTTTTCCAATTCTACAATCTTGCTTTGTTTTAT 
TTAGTTGTTTTCTCTCTCTCTTCCCAGTTTTCCAGAGACTGGAGCTAAACTGGGCTTTCAACA 
TCATCATGAAGTTTATCCTCCTCTGGGCCCTCTTGAATCTGACTGTTGCTTTGGCCTTTAATC 
CAGATTACACAGTCAGCTCCACTCCCCCTTACTTGGTCTATTTGAAATCTGACTACTTGCCCT 
GCGCTGGAGTCCTGATCCACCCGCTTTGGGTGATCACAGCTGCACACTGCAATTTACCAAAGC 
TTCGGGTGATATTGGGGGTTACAATCCCAGCAGACTCTAATGAAAAGCATCTGCAAGTGATTG 
GCTATGAGAAGATGATTCATCATCCACACTTCTCAGTCACTTCTATTGATCATGACATCATGC 
TAATCAAGCTGAAAACAGAGGCTGAACTCAATGACTATGTGAAATTAGCCAACCTGCCCTACC 
AAACTATCTCTGAAAATACCATGTGCTCTGTCTCTACCTGGAGCTACAATGTGTGTGATATCT 
ACAAAGAGCCCGATTCACTGCAAACTGTGAACATCTCTGTAATCTCCAAGCCTCAGTGTCGCG 
ATGCCTATAAAACCTACAACATCACGGAAAATATGCTGTGTGTGGGCATTGTGCCAGGAAGGA 
GGCAGCCCTGCAAGGAAGTTTCTGCTGCCCCGGCAATCTGCAATGGGATGCTTCAAGGAATCC 
TGTCTTTTGCGGATGGATGTGTTTTGAGAGCCGATGTTGGCATCTATGCCAAAATTTTTTACT 
ATATACCCTGGATTGAAAATGTAATCCAAAATAACJGAGCTGTGGCAGTTGTGGACCATATGA 
CACAGCTTGTCCCCATCGTTCACCTTTAGAATTAAATATAAATTAACTCCTC 
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FIGURE 248 

MKFILLWALLNLTVALAFNPDYTVSSTPPYLVYLKSDYLPCAGVLIHPLWVITAAHCNLPKLR 
VILGVTIPADSNEKHLQVIGYEKMIHHPHFSVTSIDHDlMLIKLKTEAELNDYVKLANLPYQT 
ISENTMCSVSTWSYNVCDIYKEPDSLQTVNISVISKPQCRDAYKTYNITENMLCVGIVPGRRQ 
PCKEVSAAPAICNGMLQGILSFADGCVLRADVGIYAKIFYYI PWIENVIQNN 

Important features: 
Signal peptide: 

amino acids 1-17 

N^glycosylation sites: 

amino acids 11-15,156-160,173-177 

Tyrosine kinase phosphorylation site: 

amino acids 108-117 

N-myristoylation sites: 
amino acids 182-188,203-209 

i 

Amidation site: 

amino acids 185-189 

Serine proteases, trypsin family, histidine active site: 

amino acids 52-58 
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FIGURE 249 

GCGAGGCGGCCGCTGTCTTCTGCTGCGGCTTCCGCGACCACAAGTACTGCTGCGACGACCCGC 

ACAGCTTCTTCCCCTACGAGCACAGCTACATGTGGTGGCTCAGCATTGGCGCTCTCATAGGCC 

TGTCCGTAGCAGCAGTGGTTCTTCTCGCCTTCATTGTTACCGCCTGTGTGCTCTGCTACCTGT 

TCATCAGCTCTAAGCCCCACACAAAGTTGGACCTGGGCTTGAGCTTACAGACAGCAGGCCCTG 

AGGAGGTTTCTCCTGACTGCCAAGGTGTGAACACAGGCATGGCGGCAGAAGTGCCAAAAGTGA 

GCCCTCTCCAGCAGAGTTACTCCTGCTTGAACCCGCAGCTGGAGAGCAATGAGGGGCAGGCTG 

TGAACTCCAAACGCCTCCTCCATCATTGCTTCATGGCCACAGTGACCACCAGTGACATTCCAG 

GCAGCCCTGAGGAAGCCTCTGTACCCAACCCTGACCTATGTGGACCAGTCCCATAAACATTCA 
ATAAATGTCTCCATACCATC/^ 
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FIGURE 250 



MWWLSIGALIGLSVAAWLLAFIVTACVLCYLFISSKPHTKLDLGLSLQTAGPEEVSPDCQGV 
NTGMAAEVPKVSPLQQSYSCLNPQLESNEGQAVNSKRLLHHCFMATVTTSDIPGSPEEASVPN 
PDLCGPVP 

Important features: 
Signal peptide: 

Amino acids 1-26 

N-myristoylation sites: 

Amino acids 7-13,11-17,62-68,93-99 
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FIGURE 251 

GTGGTTTGGATTGAGCCGGGCCCGGCCGGGGCGCCGAGTCGGAGGGGGTGGCAGTGAGCGGCG 

GCAGAGGCTACGGGGCTCGGTTTGGCTGACTGGGGAGTCGGCAGGCGGCAGGAACCATGCGAG 

GCCAGCGGAGCCTGCTGCTGGGCCCGGCCCGCCTCTGCCTCCGCCTCCTTCTGCTGCTGGGTT 

ACAGGCGCCGCTGTCCACCTCTACTCCGGGGTCTAGTACAGCGCTGGCGCTACGGCAAGGTCT 

GCCTGCGCTCCCTGCTCTACAACTCCTTTGGGGGCAGTGACACCGCTGTTGATGCTGCCTTTG 

AGCCTGTCTACTGGCTGGTAGACAACGTGATCCGCTGGTTTGGAGTGGTGTTCGTGGTCCTGG 

TGATCGTGCTGACAGGCTCCATTGTAGCTATCGCCTACCTGTGTGTCCTGCCTCTCATCCTCC 

GAACCTACTCAGTGCCACGACTCTGCTGGCATTTCTTCTATAGCCACTGGAATCTGATCCTGA 

TTGTCTTCCACTACTACCAGGCCATCACCACTCCGCCTGGGTACCCACCCCAGGGCAGGAATG 

ATATCGCCACCGTCTCCATCTGTAAGAAGTGCATTTACCCCAAGCCAGCCCGAACACACCACT 

GCAGCATCTGCAACAGGTGTGTGCTGAAGATGGATCACCACTGCCCCTGGCTAAACAATTGTG 

TGGGCCACTATAACCATCGGTACTTCTTCTCTTTCTGCTTTTTCATGACTCTGGGCTGTGTCT 

ACTGCAGCTATGGAAGTTGGGACCTTTTCCGGGAGGCTTATGCTGCCATTGAGACTTATCACC 

AGACCCCACCACCCACCTTCTCCTTTCGAGAAAGGATGACTCACAAGAGTCTTGTCTACCTCT 

GGTTCCTGTGCAGTTCTGTGGCACTTGCCCTGGGTGCCCTAACTGTATGGCATGCTGTTCTCA 

TCAGTCGAGGTGAGACTAGCATCGAAAGGCACATCAACAAGAAGGAGAGACGTCGGCTACAGG 

CCAAGGGCAGAGTATTTAGGAATCCTTACAACTACGGCTGCTTGGACAACTGGAAGGTATTCC 

TGGGTGTGGATACAGGAAGGCACTGGCTTACTCGGGTGCTCTTACCTTCTAGTCACTTGCCCC 

ATGGGAATGGAATGAGCTGGGAGCCCCCTCCCTGGGTGACTGCTCACTCAGCCTCTGTGATGG 

CAGTGT^GCTGGACTGTGTCAGCCACGACTCGAGCACTCATTCTGCTCCCTATGTTATTTCA 

AGGGCCTCCAAGGGCAGCTTTTCTCAGAATCCTTGATCAAAAAGAGCCAGTGGGCCTGCCTTA 

GGGTACCATGCAGGACAATTCAAGGACCAGCCTTTTTACCACTGCAGAAGAAAGACACAATGT 

GGAGAAATCTTAGGACTGACATCCCTTTACTCAGGCAAACAGAAGTTCCAACCCCAGACTAGG 

GGTCAGGCAGCTAGCTACCTACCTTGCCCAGTGCTGACCCGGACCTCCTCCAGGATACAGCAC 

TGGAGTTGGCCACCACCTCTTCTACTTGCTGTCTGAAAAAACACCTGACTAGTACAGCTGAGA 

TCTTGGCTTCTCAACAGGGCAAAGATACCAGGCCTGCTGCTGAGGTCACTGCCACTTCTCACA 

TGCTGCTTAAGGGAGCACAAATAAAGGTATTCGATTTTTAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 252 



MRGQRSLLLGPARLCLRLLLLLGYRRRCPPLLRGLVQRWRYGKVCLRSLLYNSFGGSDTAVDA 
AFEPVYWLVDNVIRWFGWFVVLVIVLTGSIVAIAYLCVLPLILRTYSVPRLCWHFFYSHWNL 
ILIVFHYYQAITTPPGYPPQGRNDIATVSICKKCIYPKPARTHHCSICNRCVLKMDHHCPWLN 
NCVGHYNHRYFFSFCFFMTLGCVYCSYGSWDLFREAYAAIETYHQTPPPTFSFRERMTHKSLV 
YLWFLCSSVALALGALTVWHAVLISRGETSIERHINKKERRRLQAKGRVFRNPYNYGCLDNWK 
VFLGVDTGRHWLTRVLLPSSHLPHGNGMSWEPPPWVTAHSASVMAV 

In^ortant features: 
Transmeinbrane domain : * 

amino acids 88-100,202-216,254-274 

N-nQfristoyiatlon sites: 

amino acids 55-61,56-62, 92-98,210-216,309-315, 319-325,340-346 

Prokaryotic membrane lipoprotein lipid attachment site: 

amino acids 201-212 
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FIGURE 253 



GATCAAGCGCCTTCCTTTCCCTTCCTCTCCCTACTTGGCCTTTGCCCTAAGCCAAGACCTGGCCATCAGCCTGGC 

TGCAGGGGCCTGCAGAGCCAGCTGCACTTTTTCAGGTATGGGGGAGGGCCAGGCACCATgAAGCCAGTGTGGGTC 

GCCACCCTTCTGTGGATGCTACTGCTGGTGCCCAGGCTGGGGGCCGCCCGGAAGGGGTCCCCAGAAGAGGCCTCC 

TTCTACTATGGAACCTTCCCTCTTGGCTTCTCCTGGGGCGTGGGCAGTTCTGCCTACCAGACGGAGGGCGCCTGG 

GACCAGGACGGGAAAGGGCCTAGCATCTGGGACGTCTTCACACACAGTGGGAAGGGGAAAGTGCTTGGGAATGAG 

ACGGCAGATGTAGCCTGTGACGGCTACTACAAGGTCCAGGAGGACATCATTCTGCTGAGGGAACTGCACGTCAAC 

CACTACCGATTCTCCCTGTCTTGGCCCCGGCTCCTGCCCACAGGCATCCGAGCCGAGCAGGTGAACAAGAAGGGA 

ATCGAATTCTACAGTGATCTTATCGATGCCCTTCTGAGCAGCAACATCACTCCCATCGTGACCTTGCACCACTGG 

GATCTGCCACAGCTGCTCCAGGTCAAATACGGTGGGTGGCAGAATGTGAGCATGGCCAACTACTTCAGAGACTAC 

GCCAACCTGTGCTTTGAGGCCTTTGGGGACCGTGTGAAGCACTGGATCACGTTCAGTGATCCTCGGGCAATGGCA 

GAAAAAGGCTATGAGACGGGCCACCATGCGCCGGGCCTGAAGCTCCGCGGCACCGGCCTGTACAAGGCAGCACAC 

CACATCATTAAGGCCCACGCCAAAACCTGGCATTCTTATAACACCACGTGGCGCAGCAAGCAGCAAGGTCTGGTG 

GGAATTTCACTGAACTGTGACTGGGGGGAACCTGTGGACATTAGTAACCCCAAGGACCTAGAGGCTGCCGAGAGA 

TACCTACAGTTCTGTCTGGGCTGGTTTGCCAACCCCATTTATGCCGGTGACTACCCCCAAGTCATGAAGGACTAC 

ATTGGAAGAAAGAGTGCAGAGCAAGGCCTGGAGATGTCGAGGTTACCGGTGTTCTCACTCCAGGAGAAGAGCTAC 

ATTAAAGGCACATCCGATTTCTTGGGATTAGGTCATTTTACTACTCGGTACATCACGGAAAGGAACTACCCCTCC 

CGCCAGGGGCCCAGCTACCAGAACGATCGTGACTTGATAGAGCTGGTTGACCCAAACTGGCCAGATCTGGGGTCT 

AAATGGCTATATTCTGTGCCATGGGGATTTAGGAGGCTCCTTAACTTTGCTCAGACTCAATACGGTGATCCTCCC 

ATATATGTGATGGAAAATGGAGCATCTCAAAAATTCCACTGTACTCAATTATGTGATGAGTGGAGAATTCAATAC 

CTTAAAGGATACATAAATGAAATGCTAAAAGCTATAAAAGATGGTGCTAATATAAAGGGGTATACTTCCTGGTCT 

CTGTTGGATAAGTTTGAATGGGAGAAAGGATACTCAGATAGATATGGATTCTACTATGTTGAATTTAACGACAGA 

AATAAGCCTCGCTATCCAAAGGCTTCAGTTCAATATTACAAGAAGATTATCATTGCCAATGGGTTTCCCT^TCCA 

AGAGAGGTGGAAAGTTGGTACCTCAAAGCTTTGGAAACTTGCTCTATCAACAATCAGATGCTTGCTGCAGAGCCT 

TTGCTAAGTCACATGCAAATGGTTACGGAGATCGTGGTACCCACTGTCTGCTCCCTCTGTGTCCTCATCACTGCT 

GTTCTACTAATGCTCCTCCTGAGGAGGCAGAGCTGAGACAGGATTATCAATTTTGGAGCTTCATAAGAG7UVTCTT 

CAGGATCTTCCTCCCTTTTCTGCTTTGAGGGTTTCCATACATTGCTGTTTTCAGGTTCTACAATAATTACCTTTT 

TTTCTCTTTCTCTTTTTGGCTTGTGCTGGGATTTAAGAATTAGAAAATAAAAATAAGCAGAAATTA 
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FIGURE 254 



MKPVWVATLLWMLLLVPRLGAARKGSPEEASFYYGTFPLGFSWGVGSSAYQTEGAWDQDGKGPSIWDVFTHSGKG 
KVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPRLLPTGIRAEQVNKKGIEFYSDLIDALLSSNITPI 
VTLHHWDLPQLLQVKYGGWQNVSMANYFRDYANLCFEAFGDRVKHWITFSDPRAMAEKGYETGHHAPGLKLRGTG 
LYKAAHHIIKAHAKTWHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYAGDYP 
QVMKDYIGRKSAEQGLEMSRLPVFSLQEKSYiKGTSDFLGLGHFTTRYITERNYPSRQGPSYQNDRDLIELVDPN 
WPDLGSKWLYSVPWGFRRLLNFAQTQYGDPPIYVMENGASQKFHCTQLCDEWRIQYLKGYINEMLKAIKDGANIK 
GYTSWSLLDKFEWEKGYSDRYGFYYVEFNDRNKPRYPKASVQYYKKIIIANGFPNPREVESWYLKALETCSINNQ 
MLAAEPLLSHMQMVTEIVVPTVCSLCVLITAVLLMLLLRRQS 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 
amino acids 541-558 

N-glycosylation sites: 

amino acids 80-84^171-175,245-249 

Glycosaminoglycan attachment site: 

amino acids 72.-76 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 
amino acids 23-27,564-568 

Tyrosine kinase phosphorylation sites: 
amino acids 203-211,347-355,460-468,507-514 

N-siyristoylation sites: 

amino acids 44-50,79-85,167-173,225-231,257-263,315-321 

Amidation site: 

amino acids 307-311 

Glycosyl hydrolases family 1 active site: 
amino acids 407-416 

Glycosyl hydrolases family 1 N*terminal signature: 

amino acids 41-56 

Motif name Glycosyl hydrolases family: . 

amino acids 37- 67 
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FIGURE 255 



CGCGAAGATGCGAAAGGTGGTTTTGATCACCGGGGCTAGCAGTGGCATTGGCCTGGCCCTCTG 

CAAGCGGCTGCTGGCGGAAGATGATGAGCTTCATCTGTGTTTGGCGTGCAGGAACATGAGCAA 

GGCAGAAGCTGTCTGTGCTGCTCTGCTGGCCTCTCACCCCACTGCTGAGGTCACCATTGTCCA 

GGTGGATGTCAGCAACCTGCAGTCGGTCTTCCGGGCCTCCAAGGAACTTAAGCAAAGGTTTCA 

GAGATTAGACTGTATATATCTAAATGCTGGGATCATGCCTAATCCACAACTAAATATCAAAGC 

ACTTTTCTTTGGCCTCTTTTCAAGAAAAGTGATTCATATGTTCTCCACAGCTGAAGGCCTGCT 

GACCCAGGGTGATAAGATCACTGCTGATGGACTTCAGGAGGTGTTTGAGACCAATGTCTTTGG 

CCATTTTATCCTGATTCGGGAACTGGAGCCTCTCCTCTGTCACAGTGACAATCCATCTCAGCT 

CATCTGGACATCATCTCGCAGTGCAAGGAAATCTAATTTCAGCCTCGAGGACTTCCAGCACAG 

CAAAGGCAAGGAACCCTACAGCTCTTCCAAATATGCCACTGACCTTTTGAGTGTGGCTTTGAA 

CAGGAACTTCAACCAGCAGGGTCTCTATTCCAATGTGGCCTGTCCAGGTACAGCATTGACCAA 

TTTGACATATGGAATTCTGCCTCCGTTTATATGGACGCTGTTGATGCCGGCAATATTGCTACT 

TCGCTTTTTTGCAAATGCATTCACTTTGACACCATATAATGGAACAGAAGCTCTGGTATGGCT 

TTTCCACCAAAAGCCTGAATCTCTCAATCCTCTGATCAAATATCTGAGTGCCACCACTGGCTT 

TGGAAGAAATTATATTATGACCCAGAAGATGGACCTAGATGAAGACACTGCTGAAAAATTTTA 

TCAAAAGTTACTGGAACTGGAAAAGCACATTAGGGTCACTATTCAAAAAACAGATAATCAGGC 

CAGGCTCAGTGGCTCATGCCTA^TTCCAGCACTTTGGGAGGCCAAGGCAGAAGGATCACTT 

GAGACCAGGAGTTCAAGACCAGCCTGAGAAACATAGTGAGCCCTTGTCTCTACAAAAAGAAAT 

AAAAATAATAGCTGGGTGTGGTGGCATGCGCATGTAGTCCCAGCTACTCAGAAGGATGAGGTG 

GGAGGATCTCTTGAGGCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATTGTGCCACTGCACTCC 

AGCCTGGGTGACAGCGAGACCCTGTCTCAAAATATGTATATATTTAATATATATATAAAACCA 

GAGCTGACAATGACACTCTGGAACATTGCATACCTTCTGTACATTCTGGGGTACATGGATTTC 

TACTGAGTTGGATAATATGCATTTGTAATAAACTATGAACTATGAA 
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FIGURE 256 

MRKWLITGASSGIGLALCKRLLAEDDELHLCLACRNMSKAEAVCAALLASHPTAEVTIVQVD 
VSNLQSVFRASKELKQRFQRLDCIYLNAGIMPNPQLNIKALFFGLFSRKVIHMFSTAEGLLTQ 
GDKITADGLQEVFETNVFGHFILIRELEPLLCHSDNPSQLIWTSSRSARKSNFSLEDFQHSKG 
KEPYSSSKYATDLLSVALNRNFNQQGLYSNVACPGTALTNLTYGILPPFIWTLLMPAILLLRF 
FANAFTLTPYNGTEALVWLFHQKPESLNPLIKYLSATTGFGRNYIMTQKMDLDEDTAEKFYQK 
LLELEKH I RVT I QKT DNQARLSGSCL 

Inqportant features: 
Transmembrane domain: 

amino acids 234-254 

N-glycosylation sites: 

amino acids 37-41,178-182,229-233,263-267 

Glycosaminoglycan attachment site: 

amino acids 12-16 

H-myristoylation sites: 

amino acids 9-15,13-19,15-21,215-221,224-230 
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FIGURE 257 

CGGACGCGTGGGGCCGTMSCGCGGCTCTGTGGAGTGCACCTGGGGTTGGGGGCACTGTGCCC 

CCAGCCCCCTGCTCCTTTGGACTCTACTTCTGTTTGCAGCCCCATTTGGCCTGCTGGGGGAGA 

AGACCCGCCAGGTGTCTCTGGAGGTCATCCCTAACTGGCTGGGCCCCCTGCAGAACCTGCTTC 

ATATACGGGCAGTGGGCACCAATTCCACACTGCACTATGTGTGGAGCAGCCTGGGGCCTCTGG 

CAGTGGTAATGGTGGCCACCAACACCCCCCACAGCACCCTGAGCATCAACTGGAGCCTCCTGC 

TATCCCCTGAGCCCGATGGGGGCCTGATGGTGCTCCCTAAGGACAGCATTCAGTTTTCTTCTG 

CCCTTGTTTTTACCAGGCTGCTTGAGTTTGACAGCACCAACGTGTCCGATACGGCAGCAAAGC 

CTTTGGGAAGACCATATCCTCCATACTCCTTGGCCGATTTCTCTTGGAACAACATCACTGATT 

CATTGGATCCTGCCACCCTGAGTGCCACATTTCAAGGCCACCCCATGAACGACCCTACCAGGA 

CTTTTGCCAATGGCAGCCTGGCCTTCAGGGTCCAGGCCTTTTCCAGGTCCAGCCGACCAGCCC 

AACCCCCTCGCCTCCTGCACACAGCAGACACCTGTCAGCTAGAGGTGGCCCTGATTGGAGCCT 

CTCCCCGGGGAAACCGTTCCCTGTTTGGGCTGGAGGTAGCCACATTGGGCCAGGGCCCTGACT 

GCCCCTCAATGCAGGAGCAGCACTCCATCGACGATGAATATGCACCGGCCGTCTTCCAGTTGG 

ACCAGCTACTGTGGGGCTCCCTCCCATCAGGCTTTGCACAGTGGCGACCAGTGGCTTACTCCC 

AGAAGCCGGGGGGCCGAGAATCAGCCCTGCCCTGCCAAGCTTCCCCTCTTCATCCTGCCTTAG 

CATACTCTCTTCCCCAGTCACCCATTGTCCGAGCCTTCTTTGGGTCCCAGAATAACTTCTGTG 

CCTTCAATCTGACGTTCGGGGCTTCCACAGGCCCTGGCTATTGGGACCAACACTACCTCAGCT 

GGTCGATGCTCCTGGGTGTGGGCTTCCCTCCAGTGGACGGCTTGTCCCCACTAGTCCTGGGCA 

TCATGGCAGTGGCCCTGGGTGCCCCAGGGCTCATGCTGCTAGGGGGCGGCTTGGTTCTGCTGC 

TGCACCACAAGAAGTACTCAGAGTACCAGTCCATAAATTAAGGCCCGCTCTCTGGAGGGAAGG 

ACATTACTGAACCTGTCTTGCTGTGCCTCGAAACTCTGGAGGTTGGAGCATCAAGTTCCAGCC 

GGCCCCTTCACTCCCCCATCTTGCTTTTCTGTGGAACCTCAGAGGCCAGCCTCGACTTCCTGG 

AGACCCCCAGGTGGGGCTTCCTTCATACTTTGTTGGGGGACTTTGGAGGCGGGCAGGGGACAG 

GGCTATTGATAAGGTCCCCTTGGTGTTGCCTTCTTGCATCTCCACACATTTCCCTTGGATGGG 

ACTTGCAGGCCTAAATGAGAGGCATTCTGACTGGTTGGCTGCCCTGGAAGGCAAGAAAATAGA 
TTTATTTTTTTTCACAGGGAAAAAAAAAAAA 
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FIGURE 258 

MRGSVECTWGWGHCAPSPLLLWTLLLFAAPFGLLGEKTRQVSLEVIPNWLGPLQNLLHIE^VG 
TNSTLHYVWSSLGPLAVVMVATNTPHSTLSINWSLLLSPEPDGGLMVLPKDSIQFSSALVFTR 
LLEFDSTNVSDTAAKPLGRPYPPYSLADFSWNNITDSLDPATLSATFQGHPMNDPTRTFANGS 
LAFRVQAFSRSSRPAQPPRLLHTADTCQLEVALIGASPRGNRSLFGLEVATLGQGPDCPSMQE 
QHSIDDEYAPAVFQLDQLLWGSLPSGFAQWRPVAYSQKPGGRESALPCQASPLHPALAYSLPQ 
SPIVRAFFGSQNNFCAFNLTFGASTGPGYWDQHYLSWSMLLGVGFPPVDGLSPLVLGIMAVAL 
GAPGLMLLGGGLVLLLHHKKYSEYQSIN 

Important features: 
Signal peptide: 

amino acids 1-35 

Transmembrane domain: 

amino acids 365-386 

N-glycosylation sites: 

amino acids 65-69, 95-99, 134-138,159-163,187-191,230-234,333-337 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 397-401 

N-myristoylation sites : 

amino acids 3-9,63-69,235-241,273-279,292-298,324-330 



Leucine zipper pattern: 

amino acids 371-393 
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FIGURE 259 

CAGGCGGGCCCCCGCGCGGCAGGGCCCTGGACCCGCGCGGCTCCCGGGGATGGTGAGCAAGGCGCTGCTGCGCCT 

CGTGTCTGCCGTCAACCGCAGGAGGATGAAGCTGCTGCTGGGCATCGCCTTGCTGGCCTACGTCGCCTCTGTTTG 

GGGCAACTTCGTTAATATGAGGTCTATCCAGGAAAATGGTGAACTAAAAATTGAAAGCAAGATTGAAGAGATGGT 

TGAACCACTAAGAGAGAAAATCAGAGATTTAGAAAAAAGCTTTACCCAGAAATACCCACCAGTAAAGTTTTTATC 

AGAAAAGGATCGGAAAAGAATTTTGATAACAGGAGGCGCAGGGTTCGTGGGCTCCCATCTAACTGACAAACTCAT 

6ATGGACGGCCACGAGGTGACCGTGGTGGACAATTTCTTCACGGGCAGGAAGAGAAACGTGGAGCACTGGATCGG 

ACATGAGAACTTCGAGTTGATTAACCACGACGTGGTGGAGCCCCTCTACATCGAGGTTGACCAGATATACCATCT 

GGCATCTCCAGCCTCCCCTCCAAACTACATGTATAATCCTATCAAGACATTAAAGACCAATACGATTGGGACATT 

AAACATGTTGGGGCTGGCAAAACGAGTCGGTGCCCGTCTGCTCCTGGCCTCCACATCGGAGGTGTATGGAGATCC 

TGAAGTCCACCCTCAAAGTGAGGATTACTGGGGCCACGTGAATCCAATAGGACCTCGGGCCTGCTACGATGAAGG 

CAAACGTGTTGCAGAGACCATGTGCTATGCCTACATGAAGCAGGAAGGCGTGGAAGTGCGAGTGGCCAGAATCTT 

CAACACCTTTGG6CCACGCATGCACATGAACGATGGGCGAGTAGTCAGCAACTTCATCCTGCAGGCGCTCCAGGG 

GGAGCCACTCACGGTATACGGATCC6GGTCTCAGACAAGGGCGTTCCAGTACGTCAGCGATCTAGTGAATGGCCT 

CGTGGCTCTCATGAACAGCAACGTCAGCAGCCCGGTCAACCTGGGGAACCCAGAAGAACACACAATCCTAGAATT 

TGCTCAGTTAATTAAAAACCTTGTTGGTAGCGGAAGTGAAATTCAGTTTCTCTCCGAAGCCCAGGATGACCCACA 

GAAAAGAAAACCAGACATCAAAAAAGCAAAGCTGATGCTGGGGTGGGAGCCCGTGGTCCCGCTGGAGGAAGGTTT 

AAACAAAGCAATTCACTACTTCCGTAAAGAACTCGAGTACCAGGCAAATAATCAGTACATCCCCAAACCAAAGCC 

TGCCAGAATAAAGAAAGGACGGACTCGCCACAGCT^CTCCTCACTTTTAGGACACAAGACTACCATTGTACAC 

TTGATGGGATGTATTTTTGGCTTTTTTTTGTTGTCGTTTAAAGAAAGACTTTAACAGGTGTCATGAAGAACAAAC 

TGGAATTTCATTCTGAAGCTTGCTTTAATGAAATGGATGTGCCTAAAAGCTCCCCTCAAAAAACTGCAGATTTTG 

CCTTGCACTTTTTGAATCTCTCTTTTTATGTAAAATAGCGTAGATGCATCTCTGCGTATTTTCAAGTTTTTTTAT 

CTTGCTGTGAGAGCATATGTTGTGACTGTCGTTGACAGTTTTATTTACTGGTTTCTTTGTGAAGCTGAAAAGGAA 

CATTAAGCGGGACAAAAAATGCCGATTTTATTTATAAAAGTGGGTACTTAATAAATGAGTCGTTATACTATGCAT 

AAAGAAAAATCCTAGCAGTATTGTCAGGTGGTGGTGCGCCGGCATTGATTTTAGGGCAGATAAAAGAATTCTGTG 

TGAGAGCTTTATGTTTCTCTTTTAATTCAGAGTTTTTCCAAGGTCTACTTTTGAGTTGCAAACTTGACTTTGAAA 

TATTCCTGTTGGTCATGATCAAGGATATTTGAAATCACTACTGTGTTTTGCTGCGTATCTGGGGCGGGGGCAGGT 

TGGGGGGCACAAAGTTAACATATTCTTGGTTAACCATGGTTAAATATGCTATTTTAATAAAATATTGAAACTCA 



• 
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FIGURE 260 



MVSKALLRLVSAVNRRRMKLLLGIALLAYVASVWGNFVNMRSIQENGELKIESKIEEMVEPLR 
EKIRDLEKSFTQKYPPVKFLSEKDRKRILITGGAGFVGSHLTDKLMMDGHEVTWDNFFTGRK 
RNVEHWIGHENFELINHDWEPLYIEVDQIYHLASPASPPNYMYNPIKTLKTNTIGTLNMLGL 
AKRVGARLLLASTSEVYGDPEVHPQSEDYWGHVNPIGPRACYDEGKRVAETMCYAYMKQEGVE 
VRVARIFNTFGPRMHMNDGRVVSNFILQALQGEPLTVYGSGSQTRAFQYVSDLVNGLVALMNS 
NVSSPVNLGNPEEHTILEFAQLIKNLVGSGSEIQFLSEAQDDPQKRKPDIKKAKLMLGWEPW 
PLEEGLNKAIHYFRKELEYQANNQYIPKPKPARIKKGRTRHS 

Important features: 
Signal peptide: 

amino a^cids 1-32 

N-glycosylation site: 
amino acids 316-320 

Tyrosine kinase phosphorylation site: 
amino acids 235-244 

N-myristoylation sites : . 

amino acids 35-41, 101-107, 383-389 

Amidation sites: 

amino acids 123-127,233-237- 
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FIGURE 261 

GCGTGGTGCGGGGGCGTGGGGAAATCGGGTTGCCCCAGCCGTTACTGGTCCGCGCAGTCAGGG 
CATCCTCCGCATCCTCCACATCCTTCCATGGCTCTGAAGAATAAATTCAGTTGTTTATGGATC 
TTGGGTCTGTGTTTGGTAGCCACTACATCTTCCAAAATCCCATCCATCACTGACCCACACTTT 
ATAGACAACTGCATAGAAGCCCACAACGAATGGCGTGGCAAAGTCAACCCTCCCGCGGCCGAC 
ATGAAATACATGATTTGGGATAAAGGTTTAGCAAAGATGGCTAAAGCATGGGCAAACCAGTGC 
AAATTTGAACATAATGACTGTTTGGATAAATCATATAAATGCTATGCAGCTTTTGAATATGTT 
GGAGAAAATATCTGGTTAGGTGGAATAAAGTCATTCACACCAAGACATGCCATTACGGCTTGG 
TATAATGAAACCCAATTTTATGATTTTGATAGTCTATCATGCTCCAGAGTCTGTGGCCATTAT 
ACACAGTTAGTTTGGGCCAATTCATTTTATGTCGGTTGTGCAGTTGCAATGTGTCCTAACCTT 
GGGGGAGCTTCAACTGCAATATTTGTATGCAACTACGGACCTGCAGGAAATTTTGCAAATATG 
CCTCCTTACGCAAGAGGAGAATCTTGCTCTCTCTGCTCAAAAGAAGAGAAATGTGTAAAGAAC 
CTCTGCAGGACTCCACAACTTATTATACCTAACCAAAATCCATTTCTGAAGCCAACGGGGAGA 
GCACCTCAGCAGACAGCCTTTAATCCATTCAGCTTAGGTTTTCTTCTTCTGAGAATCTTTTAA 
TGTCATTTATATACAAAAGAAATTCTCAAATGTTAAAATAAAGGAATAGTTTATTGCTTAATA 



• 
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FIGURE 262 



MALKNKFSCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKG 
LAKMAKAWANQCKFEHNDCLDKSYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNETQFYDF 
DSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANMPPYARGESC 
SLCSKEEKCVKNLCRTPQLIIPNQNPFLKPTGRAPQQTAFNPFSLGFLLLRIF 

Impor'bant featiures: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site: 

amino acids 119-123 

N-myristoylation sites: 

amino acids 103-109,150-156,160-166,161-167,175-181 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 1: 

amino acids 136-156 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 2: 

amino acids 166-178 
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FIGURE 263 



CGCCCTCCGACCCGCCCCGCGGCGCATTGTGGGATCTGTCGGCTTGTCAGGTGGTGGAGGAAA 

AGGCGCTCCGTCATSGGGATCCAGACGAGCCCCGTCCTGCTGGCCTCCCTGGGGGTGGGGCTG 

GTCACTCTGCTCGGCCTGGCTGTGGGCTCCTACTTGGTTCGGAGGTCCCGCCGGCCTCAGGTC 

ACTCTCCTGGACCCCAATGAAAAGTACCTGCTACGACTGCTAGACAAGACGACTGTGAGCCAC 

AACACCAAGAGGTTCCGCTTTGCCCTGCCCACCGCCCACCACACTCTGGGGCTGCCTGTGGGC 

AAACATATCTACCTCTCCACCCGAATTGATGGCAGCCTGGTCATCAGGCCATACACTCCTGTC 

ACCAGTGATGAGGATCAAGGCTATGTGGATCTTGTCATCAAGGTCTACCTGAAGGGTGTGCAC 

CCCAAATTTCCTGAGGGAGGGAAGATGTCTCAGTACCTGGATAGCCTGAAGGTTGGGGATGTG 

GTGGAGTTTCGGGGGCCAAGCGGGTTGCTCACTTACACTGGAAAAGGGCATTTTAACATTCAG 

CCCAACAAGAAATCTCCACCAGAACCCCGAGTGGCGAAGAAACTGGGAATGATTGCCGGCGGG 

ACAGGAATCACCCCAATGCTACAGCTGATCCGGGCCATCCTGAAAGTCCCTGAAGATCCAACC 

CAGTGCTTTCTGCTTTTTGCCAACCAGACAGAAAAGGATATCATCTTGCGGGAGGACTTAGAG 

GAACTGCAGGCCCGCTATCCCAATCGCTTTAAGCTCTGGTTCACTCTGGATCATCCCCCAAAA 

GATTGGGCCTACAGCAAGGGCTTTGTGACTGCCGACATGATCCGGGAACACCTGCCCGCTCCA 

GGGGATGATGTX3CTGGTACTGCTTTGTGGGCCACCCCCAATGGTGCAGCTGGCCTGCCATCCC 

AACTTGGACAAACTGGGCTACTCACAAAAGATGCGATTCACCTACS^GCATCCTCCAGCTTC 

CCTGGTGCTGTTCGCTGCAGTTGTTCCCCATCAGTACTCAAGCACTATAAGCCTTAGATTCCT 

TTCCTCAGAGTTTCAGGTTTTTTCAGTTACATCTAGAGCTGAAATCTGGATAGTACCTGCAGG 

AACAATATTCCTGTAGCCATGGAAGAGGGCAAGGCTCAGTCACTCCTTGGATGGCCTCCTAAA 

TCTCCCCGTGGCAACAGGTCCAGGAGAGGCCCATGGAGCAGTCTCTTCCATGGAGTAAGAAGG 

AAGGGAGCATGTACGCTTGGTCCAAGATTGGCTAGTTCCTTGATAGCATCTTACTCTCACCTT 

CTTTGTGTCTGTGATGAAAGGAACAGTCTGTGCAATGGGTTTTACTTAAACTTCACTGTTCAA 

CCTATGAGCAAATCTGTATGTGTGAGTATAAGTTGAGCATAGCATACTTCCAGAGGTGGTNTT " 

ATGGAGATGGCAAGAAAGGAGGAAATGATTTCTTCAGATNTCAAAGGAGTCTGAAATATCATA 

TTTCTGTGTGTGTCTCTCTCAGCCCCTGCCCAGGCTAGAGGGAAACAGCTACTGATAATCGAA 

AACTGCTGTTTGTGGCANGAACCCCTGGCTGTGCAAATAAATGGGGCTGAGGCCCCTGTGTGA 

TATTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FIGURE 264 

MGIQTSPVLLASLGVGLVTLLGLAVGSYLVRRSRRPQVTLLDPNEKYLLRLLDKTTVSHNTKR 
FRFALPTAHHTLGLPVGKHIYLSTRIDGSLVIRPYTPVTSDEDQGYVDLVIKVYLKGVHPKFP 
EGGKMSQYLDSLKVGDVVEFRGPSGLLTYTGKGHFNIQPNKKSPPEPRVAKKLGMIAGGTGIT 
PMLQLIRAILKVPEDPTQCFLLFANQTEKDIILREDLEELQARYPNRFKLWFTLDHPPKDWAY 
SKGFVTADMIREHLPAPGDDVLVLLCGPPPMVQLACHPNLDKLGYSQKMRFTY 

Important features: 
Signal peptide: 

amino acids 1-26 

N-glycosylation site: 

amino acids 214-218 

N-myristoylation sites: 

amino acids 22-28,76-82,128-134,180-186 



• 
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FIGURE 265 



CCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACTTGGCTTCGTTAGA 

ACGCGGCTACAATTAATACATAACCTTATGTATCATACACATACGATTTAGGTGACACTATAG 

AATAACATCCACTTTGCCTTTCTCTCCACAGGTGTCCACTCCCAGGTCCAACTGCACCTCGGT 

TCTATCGATAATCTCAGCACCAGCCACTCAGAGCAGGGCACGarSTTGGGGGCCCGCCTCAGG 

CTCTGGGTCTGTGCCTTGTGCAGCGTCTGCAGCATGAGCGTCCTCAGAGCCTATCCCAATGCC 

TCCCCACTGCTCGGCTCCAGCTGGGGTGGCCTGATCCACCTGTACACAGCCACAGCCAGGAAC 

AGCTACCACCTGCAGATCCACAAGAATGGCCATGTGGATGGCGCACCCCATCAGACCATCTAC 

AGTGCCCTGATGATCAGATCAGAGGATGCTGGCTTTGTGGTGATTACAGGTGTGATGAGCAGA 

AGATACCTCTGCATGGATTTCAGAGGCAACATTTTTGGATCACACTATTTCGACCCGGAGAAC 

TGCAGGTTCCAACACCAGACGCTGGAAAACGGGTACGACGTCTACCACTCTCCTCAGTATCAC 

TTCCTGGTCAGTCTGGGCCGGGCGAAGAGAGCCTTCCTGCCAGGCATGAACCCACCCCCGTAC 

TCCCAGTTCCTGTCCCGGAGGAACGAGATCCCCCTAATTCACTTCAACACCCCCATACCACGG 

CGGCACACCCGGAGCGCCGAGGACGACTCGGAGCGGGACCCCCTGAACGTGCTGAAGCCCCGG 

GCCCGGATGACCCCGGCCCCGGCCTCCTGTTCACAGGAGCTCCCGAGCGCCGAGGACAACAGC 

CCGATGGCCAGTGACCCATTAGGGGTGGTCAGGGGCGGTCGAGTGAACACGCACGCTGGGGGA 

ACGGGCCCGGAAGGCTGCCGCCCCTTCGCCAAGTTCATCTAGGGTCGCTKC; 
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FIGURE 266 

MLGARLRLWVCALCSVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVDG 

m 

APHQTIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGYDV 
YHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSERDP 
LNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGWRGGRVNTHAGGTGPEGCRPFAKFI 

In^ortant features: 
Signal peptide: 
amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 175-179 

N-myristoylation site. 

amino acids 33-39, 100-106, 225-231, 229-235 

HB6F/F6F family proteins 

amino acids 73-124 
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FIGURE 267 

GGCTGAGGGGAG6CCCGGAGCCTTTCT(Ma;CCTG03GGATCCTCTTGCACTGGTGGGTGGAGAGAAGCGCCTGC 

AGCCAACCAGGGTO^GGCTGTGCTCACAGTTTCCTCTGGCGGCATGTAAAGGCTCCACAAAGGAGTTGGGAGTTC 

AAATGAGGCTGCTGCGGACGGCCTGAGGATGGACCCCAAGCCCTGGACCTGCCGAGCGTGGCACTGAGGCAGCGG 

CTGACGCTACTGTGAGGGAAAGAAGGTTGTGAGCAGCCCCGCAGGACCCCTGGCCAGCCCTGGCCCCAGCCTCTG 

CCGGAGCCCTCTGTGGAGGCAGAGCCAGTGGAGCCCAGTGAGGCAGGGCTGCTTGGCAGCCACCGGCCTGCAACT 

CAGGAACCCCTCCAGAGGCCATGGACAGGCTGCCCCGCTGACGGCCAGGGTGAAGCATGTGAGGAGCCGCCCCGG 

AGCCAAGCAGGAGGGAAGAGGCTTTCATAGATTCTATTCACAAAGAATAACCACCATTTTGCAAGGACCMGAGG 

CCACTGTGCGTGACATGCTGGTGGCTCGGACTGCTGGCTGCCATGGGAGCTGTTGCAGGCCAGGAGGACGGTTTT 

GAGGGCACTGAGGAGGGCTCGCCAAGAGAGTTCATTTACCTAAACAGGTACAAGCGGGCGGGCGAGTCCCAGGAC 

AAGTGCACCTACACCTTCATTGTGCCCCAGCAGCGGGTCACGGGTGCCATCTGCGTCAACTCCAAGGAGCCTGAG 

GTGCTTCTGGAGAACCGAGTGCATAAGCAGGAGCTAGAGCTGCTCAACAATGAGCTGCTCAAGCAGAAGCGGCAG 

ATCGAGACGCTGCAGCAGCTGGTGGAGGTGGACGGCGGCATTGTGAGCGAGGTGAAGCTGCTGCGCAAGGAGAGC 

CGCAACATGAACTCGCGGGTCACGCAGCTCTACATGCAGCTCCTGCACGAGATCATCCGCAAGCGGGACAACGCG 

TTGGAGCTCTCCCAGCTGGAGAACAGGATCCTGAACCAGACAGCCGACATGCTGCAGCTGGCCAGCAAGTACAAG 

GAOCTGGAGCACAAGTACCAGCACCTGGCCACACTGGCCCACAACCAATCAGAGATCATCGCGCAGCTTGAGGAG 

CACTGCCAGAGGGTGCCCTCGGCCAGGCCCGTCCCCCAGCCACCCCCCGCTGCCCCGCCCCGGGTCTACCAACCA 

CCCACCTACAACCGCATCATOU^CCAGATCTCTACCAACGAGATCCAGAGTGACawa^CCTGAAGGTGCTGC 

CCCCCTCTGCCCACTATGCCCACTCTCACCAGCCTCCCATCTTCCACCGACAAGCCGTCGGGCCCATGGAGAGAC 

TGCCTGCAGGCCCTGGAGGATGGCCACGACACCAGCTCCATCTACCTGGTGAAGCCGGAGAACACCAACCGCCTC 

ATGCAGGTGTGGTGCGACCAGAGACACGACCCCGGGGGCTGGACCGTCATCCAGAGACGCCTGGATGGCTCTGTT 

AACTTCTTCAGGAACTGGGAGACGTACAAGCAAGGGTTTGGGAACATTGACGGCGAATACTGGCTGGGCCTGGAG 

AACATTTACTGGCTGACGAACCAAGGCAACTACAAACTCCTGGTGACCATGGAGGACTGGTCCGGCCGCAAAGTC 

TTTGCAGAATACGCCAGTTTCCGCCTGGAACCTGAGAGCGAGTATTATAAGCTGCGGCTGG6GCGCTACCATGGC 

AATGCGGGTGACTCCTTTACATGGCACAACGGCAAGCAGTTCACCACCCTGGACAGAGATCATGATGTCTACACA 

GGAAACTGTGCCCACTACCAGAAGGGAGGCTGGTGGTATAACGCCTGTGCCCACTCCAACCTCAACGGGGTCTGG 

TACCGCGGGGGCCATTACCGGAGCCGCTACCAGGACGGAGTCTACTGGGCTGAGTTCCGAGGAGGCTCTTACTCA 

CTCAAGAAAGTGGTGATGATGATCCGACCGAACCCCAACACCTTCCACmGCCAGCTCCCCCTCCTGACCTCTC 

GTGGCCATTGCCAGGAGCCCACCCTGGTCACGCTGGCCACAGCACAAAGAACAACTCCTCACCAGTTCATCCTGA 

GGCTGGGAGGACCGGGATGCTGGATTCTGTTTTCCGAAGTCACTGCAGCGGATGATGGAACTGAATCGATACGGT 

GTTTTCTGTCCCTCCTACTTTCCTTCACACCAGACAGCCCCTCATGTCTCCAGGACAGGACAGGACTACAGACAA 
CTCTTTCTTTAAATAAATTAAGTCTCTACAATAAAAAAAA 
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FIGURE 268 

MRPLCVTCWWLGLLAAMGAVAGQEDGFEGTEEGSPREFIYLNRYKFAGESQDKCTYTFIVPQQ 
RVTGAICVNSKEPEVLLENRVHKQELELLNNELLKQKRQIETLQQLVEVDGGIVSEVKLLRKE 
SRNMNSRVTQLYMQLLHEIIRKRDNALELSQLENRiLNQTADMLQLASKYKDLEHKYQHLATL 
AHNQSEIIAQLEEHCQRVPSARPVPQPPPAAPPRVYQPPTYNRIINQISTNEIQSDQNLKVLP 
PPLPTMPTLTSLPSSTDKPSGPWRDCLQALEDGHDTSSIYLVKPENTNRLMQVWCDQRHDPGG 
WTVIQRRLDGSVNFFRNWETYKQGFGNIDGEYWLGLENIYWLTNQGNYKLLVTMEDWSGRKVF 
AEYASFRLEPESEYYKLRLGRYHGNAGDSFTWHNGKQFTTLDRDHDVYTGNCAHYQKGGWWYN 
ACAHSNLNGVWYRGGHYRSRYQDGVYWAEFRGGSYSLKKWMMIRPNPNTFH 

Important features: 

Signal peptide: 
amino acids 1-22 

N-glycosylation sites: 
amino acids 164-168,192-196 

cAMP- and cGMP-dependent protein kinase phosphorylation isite: 
amino acids 124-128 

Tyrosine kinase phosphorylation Sites: 

amino acids 177-184,385-393,385-394,461-468 

N-myristoylation sites: 

amino acids 12-18,18-24,22-28,29-35,114-120,341-347,465-471, 
473-479 

Amidation site: 

amino acids 373-377 

Fibrinogen beta and gamma chains C-terminal domain signature: 

amino acids 438-451 

Fibrinogen beta and gcumaa chains C-terminal domain proteins: 

amino acids 305-343, 365-402, 411-424, 428-458 

Trehalase proteins: 

amino acids 275-292 
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FIGURE 269 

GCCGAGCTGAGCGGATCCTCACATGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAA 

GCGGGTCTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGA 

GAGTCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGATGAGCGGTGCTCCGACGGCCGGGGC 

AGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTCAGGGCGGACCCGTGCAGTC 

CAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCTGCAGCT 

CGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCTGGAGCGGCG 

CCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCTCCCGTTAGCCCC 

TGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAG 

CAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCACCTGGAGAAGCAGCACCT 

GCGAATTCAGCATCTGCAAAGCCAGTTTG6CCTCCTGGACCACAAGCACCTAGACCATGAGGT 

GGCCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCTCACAA 

TGTCAGCCGCCTGCACCGGCTGCCCAGGGATTGCCAGGAGCTGTTCCAGGTTGGGGAGAGGCA 

GAGTGGACTATTTGAAATCCAGCCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGATGAC 

CTCAGATGGAGGCTGGACAGTAATTCAGAGGCGCCACGATGGCTCAGTGGACTTCAACCGGCC 

CTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAA 

GGTGCATAGCATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGG 

CAACGCCGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCTATAGCCTGCA 

GCTCACTGCACCCGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGT 

ACCCTTCTCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCT 

CTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGC.TC 

CATCCCACAGCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTA 

CTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCTAGCG 

TCCTGGCTGGGCCTGGTCCCAGGCCCACGAAAGACGGTGACTCTTGGCTCTGCCCGAGGATGT 

GGCCGTTCCCTGCCTGGGCAGGGGCTCCAAGGAGGGGCCATCTGGAAACTTGTGGACAGAGAA 

GAAGACCACGACTGGAGAAGCCCCCTTTCTGAGTGCAGGGGGGCTGCATGCGTTGCCTCCTGA 

GATCGAGGCTGCAGGATATGCTCAGACTCTAGAGGCGTGGACCAAGGGGCATGGAGCTTCACT 

CCTTGCTGGCCAGGGAGTTGGGGACTCAGAGGGACCACTTGGGGCCAGCCAGACTGGCCTCAA 

TGGCGGACTCAGTCACATTGACTGACGGGGACCAGGGCTTGTGTGGGTCGAGAGCGCCCTCAT 

GGTGCTGGTGCTGTTGTGTGTAGGTCCCCTGGGGACACAAGCAGGCGCCAATGGTATCTGGGC 

GGAGCTCACAGAGTTCTTGGAATAAAAGCAACCTCAGAACAC 



# 
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FIGURE 270 



MTVIRFFPAASATKRVLPPVLRVSSPRTWNPNVPESPRIPAPRLPKRMSGAPTAGAALMLCAA 

TAVLLSAQGGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAERTRSQLSALERRLSACGSA 

CQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLFHKVAQQQRHLEKQHLRIQHLQS 

QFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSRLHRLPRDCQELFQVGERQSGLFEIQ 

PQGSPPFLVNCKMTSDGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHSITGD 

RNSRLAVQLRDWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQ 

DH DLRRDKNC AKS LSGGWW FGTCS HSNLNGQ Y FRS I PQQRQKLKKG I FWKT WRGRY Y PLQATT 
MLIQPMAAEAAS 

Important features: 
Signal peptide: 

Amino acids 1-13 

Transmembrane domain: 

7\mino acids 53-70 

N-glycosylation site: 

Amino acids 224-228 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

Amino acids 46-50; 118-122 . 

N-myristoylation sites: 

Amino acids 50-56;129-135;341-347;357-363 

Fibrinogen beta and gamma chains C-terminal domain signature: 

Amino acids 396-409 
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FIGURE 271 



CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGGG 

AACAAGMSGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGCTG 

CTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGACTCG 

GTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTAC 

CCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTT 

GTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 

TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGCT 

GAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAACT 

CTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACCTCTTCATGG 

ACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCCAGAAATCCAG 

TACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAAGCAAAATGTCC 

TATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGAGAAAGTGATGGC 

TTTT.TAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCTTGTCCTCTCGGTG 

ATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGCAGTATGTTCCCTCT 

GAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAGCTAAACAGATATCCA 

GCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGAAGCAGGGCCTCTACCT 

ACAAAAGTGAATCTTGCTCATTCTGAAATTTAAGCATTTTTCTTTTAAAAGACAAGTGTAATA 

GACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCATTGGATATAGGCCTTAAG 

AAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 
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FIGURE 272 



MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTYPK 

EEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFAEL 

RQEQLMSLMPKMHLLFPLTLVRSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEIQYA 

PHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVLSVMV 

LLWICCATVATAVEQYVPSEKLSIYGDLEFMNEQKLNRYPASSLVVVRSKTEDHEEAGPLPTK 
VNLAHSEI 

Important features: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site: 

amino acids 90-94 

N-myristoylation sites: 

amino acids 28-34,29-35,31-37,86-92 
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FIGURE 273 

CCCACGCGTCCGAACCTCTCCAGCGAXGGGAGCCGCCCGCCTGCTGCCCAACCTCACTCTGTG 

CTTACAGCTGCTGATTCTCTGCTGTCAAACTCAGTACGTGAGGGACCAGGGCGCCATGACCGA 

CCAGCTGAGCAGGCGGCAGATCCGCGAGTACCAACTCTACAGCAGGACCAGTGGCAAGCACGT 

GCAGGTCACCGGGCGTCGCATCTCCGCCACCGCCGAGGACGGCAACAAGTTTGCCAAGCTCAT 

AGTGGAGACGGACACGTTTGGCAGCCGGGTTCGCATCAAAGGGGCTGAGAGTGAGAAGTACAT 

CTGTATGAACAAGAGGGGCAAGCTCATCGGGAAGCCCAGCGGGAAGAGCAAAGACTGCGTGTT 

CACGGAGATCGTGCTGGAGAACAACTATACGGCCTTCCAGAACGCCCGGCACGAGGGCTGGTT 

CATGGCCTTCACGCGGCAGGGGCGGCCCCGCCAGGCTTCCCGCAGCCGCCAGAACCAGCGCGA 

GGCCCACTTCATCAAGCGCCTCTACCAAGGCCAGCTGCCCTTCCCCAACCACGCCGAGAAGCA 

GAAGCAGTTCGAGTTTGTGGGCTCCGCCCCCACCCGCCGGACCAAGCGCACACGGCGGCCCCA 

GCCCCTCACGTAGTCTGGGAGGCAGGGGGCAGCAGCCCCTGGGCCGCCTCCCCACCCCTTTCC 

CTTCTTAATCCAAGGACTGGGCTGGGGTGGCGGGAGGGGAGCCAGATCCCCGAGGGAGGACCC 

TGAGGGCCGCGAAGCATCCGAGCCCCCAGCTGGGAAGGGGCAGGCCGGTGCCCCAGGGGCGGC 

TGGCACAGTGCCCCCTTCCCGGACGGGTGGCAGGCCCTGGAGAGGAACTGAGTGTCACCCTGA 

TCTCAGGCCACCAGCCTCTGCCGGCCTCCCAGCCGGGCTCCTGAAGCCCGCTGAAAGGTCAGC 

GACTGAAGGCCTTGCAGACAACCGTCTGGAGGTGGCTGTCCTCAAAATCTGCTTCTCGGATCT 

CCCTCAGTCTGCCCCCAGCCCCCAAACTCCTCCTGGCTAGACTGTAGGAAGGGACTTTTGTTT 

GTTTGTTTGTTTCAGGAAAAAAGAAAGGGAGAGAGAGGAAAATAGAGGGTTGTCCACTCCTCA 

CATTCCACGACCCAGGCCTGCACCCCACCCCCAACTCCCAGCCCCGGAATAAAACCATTTTCC 
TGC 
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FIGURE 274 



MGAARLLPNLTLCLQLLILCCQTQYVRDQGAMTDQLSRRQIREYQLYSRTSGKHVQVTGRRIS 



ATAEDGNKFAKLIVETDTFGSRVRIKGAESEKYICMNKRGKLIGKPSGKSKDCVFTEIVLENN 
YTAFQNARHEGWFMAFTRQGRPRQASRSRQNQREAHFIKRLYQGQLPFPNHAEKQKQFEFVGS 
APTRRTKRTRRPQPLT 

Important features: 
Signal peptide: 

Amino acids 1-22 

N-glycosylation site, 
amino acids 9-13, 126-130 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 60-64 

Casein kinase II phosphorylation site. 

amino acids 65-69 

Tyrosine kinase phosphorylation site. 

amino acids 39-48, 89-97 

N-nr/ristoylation site. 

amino acids 69-75, 188-194 

Amidation site. 

amino acids 58-62 



HB6F/FGF family signature. 

amino acids 103-128 
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FIGURE 275 



TATTTACCATATCAGATTCACATTCAGTCCTCAGCAAAATSAAGGGCTCCATTTTCACTCTGT 

TTTTATTCTCTGTCCTATTTGCCATCTCAGAAGTGCGGAGCAAGGAGTCTGTGAGACTCTGTG 

GGCTAGAATACATACGGACAGTCATCTATATCTGTGCTAGCTCCAGGTGGAGAAGGCATCTGG 

AGGGGATCCCTCAAGCTCAGCAAGCTGAGACAGGAAACTCCTTCCAGCTCCCACATAAACGTG 

AGTTTTCTGAGGAAAATCCAGCGCAAAACCTTCCGAAGGTGGATGCCTCAGGGGAAGACCGTC 

TTTGGGGTGGACAGATGCCCACTGAAGAGCTTTGGAAGTCAAAGAAGCATTCAGTGATGTCAA 

GACAAGATTTACAAACTTTGTGTTGCACTGATGGCTGTTCCATGACTGATTTGAGTGCTCTTT 

GC2AAGACAAGAGCAAATACCCAATGGGTGGCAGAGCTTTATCACATGTTTAATTACAGTGTT 

TTACTGCCTGGTAGAACACTAATATTGTGTTATTAAAATGATGGCTTTTGGGTAGGCAAAACT 

TCTTTTCTAAAAGGTATAGCTGAGCGGTTGAAACCACAGTGATCTCTATTTTCTCCCTTTGCC 

AAGGTTAATGAACTGTTCTTTTCAAATTCTACTAATGCTTTGAAATTTCAAATGCTGCGCAAA 
ATTGCAATAAAAATGCTATAAA 
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FIGURE 276 



MKGSIFTLFLFSVLFAISEVRSKESVRLCGLEYIRTVIYICASSRWRRHLEGIPQAQQAETGN 



SFQLPHKREFSEENPAQNLPKVDASGEDRLWGGQMPTEELWKSKKHSVMSRQDLQTLCCTDGC 
SMTDLSALC 

Important features: 
Signal seq[uence: 

amino acids 1-18 

CAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 107-111 

N-myristoylation sites: 

amino acids 3-9,52-58,96-102,125-131 

Insulin family signature: 

amino acids 121-136 



Insulin family proteins : 

amino acids 28-46 
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FIGURE 277 



GCAGCTGGTTACTGCATTTCTCCATGTGGCAGACAGAGCAAAGCCACAACGCTTTCTCTGCTGGATTAAAGACGG 

CCCACAGACCAGAACTTCCACTATACTACTTAAAATTACATAGGTGGCTTGTCAAATTCAATTGATTAGTATTGT 

AAAAGGAAAAAGAAGTTCCTTCTTACAGCTTGGATTCAACGGTCCAAAACAAAAATGCAGCTGCCATTAAAGTCT 

CAGATGAACAAACTTCTACACTGATTTTTAAAATCAAGAATAAGGGCAGCAAGTTTCTGGATTCACTGAATCAAC 

AGACACAAAAAGCTGGCAATATAGCAACTATGAAGAGAAAAGCTACTAATAAAATTAACCCAACGCATAGAAGAC 

TTTTTTTTCTCTTCTAAAAACAACTAAGTAAAGACTTAAATTTAAACACATCATTTTACAACCTCATTTCAAA^ 

GAAGACTTTTACCTGGACCCTAGGTGTGCTATTCTTCCTACTAGTGGACACTGGACATTGCAGAGGTGGACAATT 

CAAAATTAAAAAAATAAACCAGAGAAGATACCCTCGTGCCACAGATGGTAAAGAGGAAGCAAAGAAATGTGCATA 

CACATTCCTGGTACCTGAACAAAGAATAACAGGGCCAATCTGTGTCAACACCAAGGGGCAAGATGCAAGTACCAT 

TAAAGACATGATCACCAGGATGGACCTTGAAAACCTGAAGGATGTGCTCTCCAGGCAGAAGCGGGAGATAGATGT 

TCTGCAACTGGTGGTGGATGTAGATGGAAACATTGTGAATGAGGTAAAGCTGCTGAGAAAGGAAAGCCGTAACAT 

GAACTCTCGTGTTACTCAACTCTATATGCAATTATTACATGAGATTATCCGTAAGAGGGATAATTCACTTGAACT 

TTCCCAACTGGAAAACAAAATCCTCAATGTCACCACAGAAATGTTGAAGATGGCAACAAGATACAGGGAACTAGA 

GGTGAAATACGCTTCCTTGACTGATCTTGTCAATAACCAATCTGTGATGATCACTTTGTTGGAAGAACAGTGCTT 

GAGGATATTTTCCCGACAAGACACCCATGTGTCTCCCCCACTTGTCCAGGTGGTGCCACAACATATTCCTAACAG 

CCAACAGTATACTCCTGGTCTGCTGGGAGGTAACGAGATTCAGAGGGATCCAGGTTATCCCAGAGATTTAATGCC 

ACCACCTGATCTGGCAACTTCTCCCACCAAAAGCCCTTTCAAGATACCACCGGTAACTTTCATCAATGAAGGACC 

ATTCAAAGACTGTCAGCAAGCAAAAGAAGCTGGGCATTCGGTCAGTGGGATTTATATGATTAAACCTGAAAACAG 

CAATGGACCAATGCAGTTATGGTGTGAAAACAGTTTGGACCCTGGGGGTTGGACTGTTATTCAGAAAAGAACAGA 

CGGCTCTGTCAACTTCTTCAGAAATTGGGAAAATTATAAGAAAGGGTTTGGAAACATTGACGGAGAATACTGGCT 

TGGACTGGAAAATATCTATATGCTTAGCAATCAAGATAATTACAAGTTATTGATTGAATTAGAAGACTGGAGTGA 

TAAAAAAGTCTATGCAGAATACAGCAGCTTTCGTCTGGAACCTGAAAGTGAATTCTATAGACTGCGCCTGGGT^C 

TTACCAGGGAAATGCAGGGGATTCTATGATGTGGCATAATGGTAAACAATTCACCACACTGGACAGAGATAAAGA 

TATGTATGCAGGAAACTGCGCCCACTTTCATAAAGGAGGCTGGTGGTACAATGCCTGTGCACATTCTAACCTAAA 

TGGAGTATGGTACAGAGGAGGCCATTACAGAAGCAAGCACCAAGATGGAATTTTCTGGGCCGAATACAGAGGCGG 

GTCATACTCCTTAAGAGCAGTTCAGATGATGATCAAGCCTATTGACT^GAGAGACACTCGCCAATTTAAATGA 

CACAGAACTTTGTACTTTTCAGCTCTTAAAAATGTAAATGTTACATGTATATTACTTGGCACAATTTATTTCTAC 

ACAGAAAGTTTTTAAAATGAATTTTACCGTAACTATAAAAGGGAACCTATAAATGTAGTTTCATCTGTCGTCAAT 

TACTGCAGAAAATTATGTGTATCCACAACCTAGTTATTTTAAAAATTATGTTGACTAAATACAAAGTTTGTTTTC 

TAAAATGTAAATATTTGCCACAATGTAAAGCAAATCTTAGCTATATTTTAAATCATAAATAACATGTTCAAGATA 

CTTAACAATTTATTTAAAATCTAAGATTGCTCTAACGTCTAGTGAAAAAAATATTTTTTAAATTTCAGCCAAATA 

ATGCATTTTATTTTATAAAAATACAGACAGAAAATTAGGGAGAAACTTCTAGTTTTGCCAATAGAAAATGTTCTT 

CCATTGAATAAAAGTTATTTCAAATTGAATTTGTGCCTTTCACACGTAATGATTAAATCTGAATTCTTAATAATA 

TATCCTATGCTGATTTTCCCAAAACATGACCCATAGTATTAAATACATATCATTTTTAAAAATAAAAAAAAACCC 

AAAAATAATGCATGCATAATTTAAATGGTCAATTTATAAAGACAAATCTATGAATGAATTTTTCAGTGTTATCTT 

CATATGATATGCTGAACACCAAAATCTCCAGAAATGCATTTTATGTAGTTCTAAAATCAGCAAAATATTGGTATT 

ACAAAAATGCAGAATATTTAGTGTGCTACAGATCTGAATTATAGTTCTAATTTATTATTACTTTTTTTCTAATTT 

ACTGATCTTACTACTACAAAGAAAAAAAAACCCAACCCATCTGCAATTCAAATCAGAAAGTTTGGACAGCTTTAC 

AAGTATTAGTGCATGCTCAGAACAGGTGGGACTAAAACAAACTCAAGGAACTGTTGGCTGTTTTCCCGATACTGA 

GAATTCAACAGCTCCAGAGCAGAAGCCACAGGGGCATAGCTTAGTCCAAACTGCTAATTTCATTTTACAGTGTAT 

GTAACGCTTAGTCTCACAGTGTCTTTAACTCATCTTTGCAATCAACAACTTTACTAGTGACTTTCTGGAACAATT 

TCCTTTCAGGAATACATATTCACTGCTTAGAGGTGACCTTGCCTTAATATATTTGTGAAGTTAAAATTTTAAAGA 

TAGCTCATGAAACTTTTGCTTAAGCAAAAAGAAAACCTCGAATTGAAATGTGTGAGGCAAACTATGCATGGGAAT 

AGCTTAATGTGAAGATAATCATTTGGACAACTCAAATCCATCAACATGACCAATGTTTTTCATCTGCCACATCTC 

AAAATAAAACTTCTGGTGAAACAAATTAAACAAAATATCCAAACCTCAAAAAAAA 
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FIGURE 278 

MKTFTWTLGVLFFLLVDTGHCRGGQFKIKKINQRRYPRATDGKEEAKKCAYTFLVPEQRITGP 
ICVNTKGQDASTIKDMITRMDLENLKDVLSRQKREIDVLQLWDVDGINIVNEVKLLRKESRNM 
NSRVTQLYMQLLHEIIRKRDNSLELSQLENKILNVTTEMLECMATRYRELEVKYASLTDLVNNQ 
SVMITLLEEQCLRIFSRQDTHVSPPLVQWPQHIPNSQQYTPGLLGGNEIORDPGYPRDLMPP 
PDLATSPTKSPFKIPPVTFINEGPFKDCQQAKEAGSSVSGIYMIKPENSNGPMQLWCENSLDP 
GGWTVIQKRTDGSVNFFRNWENYKKGFGNIDGEYWLGLENIYMLSNQDNYKLLIELEDWSDKK 
VYAEYSSFRLEPESEFYRLRLGTYQGNAGDSMMWHNGKQFTTLDRDKDMYAGNCAHFHKGGWW 
YNACAHSNLNGVWYRGGHYRSKHQDGI FW7VEYRGGS YSLRAVQMMI KPI D 

Iinportiant features: 
Signal sequence: 

Amino acids 1-23 

N-glycosylation sites: 
Amino acids 160-164;188-192 

CAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 120-124 

Tyrosine kinase phosphorylation sites: 

Amino acids 173-180; 387-396 

N-myristoylation sites: 

Amino acids 70-76; 110-116;232-238, 343-349; 400-406; 467-473; 
475-487 

Fibrinogen beta and gamma chains C-terminal domain signature: 

Amino acids 440-453 
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FIGURE 279 

CCCACGCGTCCGCGCAGTCGCGCAGTTCTGCCTCCGCCTGCCAGTCTCGCCCGCGATCCCGGC 

CCGGGGCTGTGGCGTCGACTCCGACCCAGGCAGCCAGCAGCCCGCGCGGGAGCCGGACCGCCG 

CCGGAGGAGCTCGGACGGCATGCTGAGCCCCCTCCTTTGCTGAAGCCCGAGTGCGGAGAAGCC 

CGGGCAAACGCAGGCTAAGGAGACCAAAGCGGCGAAGTCGCGAGACAGCGGACAAGCAGCGGA 

GGAGAAGGAGGAGGAGGCGAACCCAGAGAGGGGCAGCAAAAGAAGCGGTGGTGGTGGGCGTCG 

TGGCCATGGCGGCGGCTATCGCCAGCTCGCTCATCCGTCAGAAGAGGCAAGCCCGCGAGCGCG 

AGAAATCCAACGCCTGCAAGTGTGTCAGCAGCCCCAGCAAAGGCAAGACCAGCTGCGACAAAA 

ACAAGTTAAATGTCTTTTCCCGGGTCAAACTCTTCGGCTCCAAGAAGAGGCGCAGAAGAAGAC 

CAGAGCCTCAGCTTAAGGGTATAGTTACCAAGCTATACAGCCGACAAGGCTACCACTTGCAGC 

TGCAGGCGGATGGAACCATTGATGGCACCAAAGATGAGGACAGCACTTACACTCTGTTTAACC 

TCATCCCTGTGGGTCTGCGAGTGGTGGCTATCCAAGGAGTTCAAACCAAGCTGTACTTGGCAA 

TGAACAGTGAGGGATACTTGTACACCTCGGAACTTTTCACACCTGAGTGCAAATTCAAAGAAT 

CAGTGTTTGAAAATTATTATGTGACATATTCATCAATGATATACCGTCAGCAGCAGTCAGGCC 

GAGGGTGGTATCTGGGTCTGAACAAAGAAGGAGAGATCATGAAAGGCAACCATGTGAAGAAGA 

ACAAGCCTGCAGCTCATTTTCTGCCTAAACCACTGAAAGTGGCCATGTACAAGGAGCCATCAC 

TGCACGATCTCACGGAGTTCTCCCGATCTGGAAGCGGGACCCCAACCAAGAGCAGAAGTGTCT 

CTGGCGTGCTGAACGGAGGCAAATCCATGAGCCACAATGAATCAACGiaSCCAGTGAGGGCAA 

AAGAAGGGCTCTGTAACAGAACCTTACCTCCAGGTGCTGTTGAATTCTTCTAGCAGTCCTTCA 

CCCAAAAGTTCAAATTTGTCAGTGACATTTACCAAACAAACAGGCA6AGTTCACTATTCTATC 

TGCCATTAGACCTTCTTATCATCCATACTAAAGC 
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FIGURE 280 

MAAAIASSLIRQKRQAREREKSNACKCVSSPSKGKTSCDKNKLNVFSRVKLFGSKKRRRRRPE 
PQLKGIVTKLYSRQGYHLQLQADGTIDGTKDEDSTYTLFNLIPV.GLRWAIQGVQTKLYLAMN 
SEGYLYTSELFTPECKFKESVFENYYVTYSSMIYRQQQSGRGWYLGLNKEGEIMKGNHVKKNK 
PAAHFLPKPLKVAMYKEPSLHDLTEFSRSGSGTPTKSRSVSGVLNGGKSMSHNEST 

Ixnportant Features: 
N-glycosylation site: 
Amino acids 242^246 

Glycosaminoglycan attachment sites: 

Amino acids 165-169, 218-222 

Tyrosine kinase phosphorylation site: 

Amino acids 93-100 

N-myristoylation sites: 

Amino acids 87-93, 231-237 

ATP/GTP-binding site motif A (P-loop) : 

Amino acids 231-239 



HB6F/FGF family proteins: 

Amino acids 78-94, 102-153 
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FIGURE 2ftl 

CCAGGATGGAGCTGGGGCCTGTATAGCCATATTATTGTTCTATGCT^VCTAGACATGGGGGGGA 

CTTGGTGAAAAAGGTATTATCCAGCCAGAGGGTCTGGGAGCCCTGTCTTACTGAACCTGGGCA 

ACCTGGATATTCTGAGACATATTTTGGGGGGATTTCAGTGAAAAAAGTGGGGGATCCCCTCCA 

TTTAGAGTGTAGCAAAGGAAAAAACACCAAGGTTGGGTTCCTTCCTGACATTGGCAGTGCCCC 

AGTAGGGGTGGGATGAGCGAATATTCCCAAAGCTAAAGTCCCACACCCTGTAGATTACAAGAG 

TGGATTTGGCAGGAGTGTGCCCCAAAATACAGTGGAAAGGTGCCTGAAGATATTTAAACCACG 

TCTTGGAAATTTAGTGGGTCTTGGCTTTGGGATAGGTGAAGTGAGGACAGACACTGGAGAGGA 

GGGAAAGGGGACGTTTTCAATAGGAGGCAAAACTCGAGGGTGGGATCCACTGAGGAGTACATA 

GGCTGCTGGATCTGGTGGAGCCAGCACTGGGCCCACGGGTGGTAACTGGCTGCTGTGGAGGGG 

GGTACGTGAGGGGGGGGTCTGGGGCTTATCCTCAGGTCCTGTGGGTGGGGCAGCGAGTCGGGG 

CCTGAGCGTCAAGAGCATGCCCTAGTGAGCGGGCTCCTCTGGGGGAGCCCAGCGCGCTCCGGG 

CGCCTGCCGGTTTGGGGGTGTCTCCTCCCGGGGCGCTAT6GCGGCGCTGGCCAGTAGCCTGAT 

CCGGCAGAAGCGGGAGGTCCGCGAGCCCGGGGGCAGCCGGCCGGTGTCGGCGCAGCGGCGCGT 

GTGTCCCCGCGGCACCAAGTCCCTTTGCCAGAAGCAGCTCCTCATCCTGCTGTCCAAGGTGCG 

ACTGTGCGGGGGGCGGCCCGCGCGGCCGGACCGCGGCCCGGAGCCTCAGCTCAAAGGCATCGT 

CACCAAACTGTTCTGCCGCCAGGGTTTCTACCTCCAGGCGAATCCCGACGGAAGCATCCAGGG 

CACCCCAGAGGATACCAGCTCCTTCACCCACTTCAACCTGATCCCTGTGGGCCTCCGTGTGGT 

CACCATCCAGAGCGCCAAGCTGGGTCACTACATGGCCATGAATGCTGAGGGACTGCTCTACAG 

TTCGCCGCATTTCACAGCTGAGTGTCGCTTTAAGGAGTGTGTCTTTGAGAATTACTACGTCCT 

GTACGCCTCTGCTCTCTACCGCCAGCGTCGTTCTGGCCGGGCCTGGTACCTCGGCCTGGACAA 

GGAGGGCCAGGTCATGAAGGGAAACCGAGTTAAGAAGACCAAGGCAGCTGCCCACTTTCTGCC 

CAAGCTCCTGGAGGTGGCCATGTACCAGGAGCCTTCTCTCCACAGTGTCCCCGAGGCCTCCCC 

TTCCAGTCCCCCTGCCCCCTGAAATGTAGTCCCTGGACTGGAGGTTCCCTGCAqTCCCAGTGA 
GCCAGCCACCACCACAACCTGT 
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FIGURE 282 

MAALASSLIRQKREVREPGGSRPVSAQRRVCPRGTKSLCQKQLLILLSKVRLCGGRPARPDRG 
PEPQLKGIVTKLFCRQGFYLQANPDGSIQGTPEDTSSFTHFNLIPVGLRVVTIQSAKLGHYMA 
MNAEGLLYSSPHFTAECRFKECVFENYYVLYASALYRQRRSGRAWYLGLDKEGQVMKGNRVKK 
TKAAAHFLPKLLEVAMYQEPSLHSVPEASPSSPPAP. 

losportant features: 

Tyrosine kinase phosphorylation site: 

Amino acids 199-207 

N-myristoylation sites: 

Amino acids 54-60; 89-95; 131-137 

HBGF/FGF family signature: 

Amino acids 131-155 
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FIGURE 283 

ATGGCCGCGGCCATCGCTAGCGGCTTGATCCGCCAGAAGCGGCAGGCGCGGGAGCAGCACTGG 
GACCGGCCGTCTGCCAGCAGGAGGCGGAGCAGCCCCAGCAAGAACCGCGGGCTCTGCAACGGC 
AACCTGGTGGATATCTTCTCCAAAGTGCGCATCTTCGGCCTCAAGAAGCGCAGGTTGCGGCGC 
CAAGATCCCCAGCTCAAGGGTATAGTGACCAGGTTATATTGCAGGCAAGGCTACTACTTGCAA 
ATGCACCCCGATGGAGCTCTCGATGGAACCAAGGATGACAGCACTAATTCTACACTCTTCAAC 
CTCATACCAGTGGGACTACGTGTTGTTGCCATCCAGGGAGTGAAAACAGGGTTGTATATAGCC 
ATGAATGGAGAAGGTTACCTCTACCCATCAGAACTTTTTACCCCTGAATGCAAGTTTAAAGAA 
TCTGTTTTTGAAAATTATTATGTAATCTACTCATCCATGTTGTACAGACAACAGGAATCTGGT 
AGAGCCTGGTTTTTGGGATTAAATAAGGAAGGGCAAGCTATGAAAGGGAACAGAGTAAAGAAA 
ACCAAACCAGCAGCTCATTTTCTACCCAAGCCATTGGAAGTTGCCATGTACCGAGAACCATCT 
TTGCATGATGTTGGGGAAACGGTCCCGAAGCCTGGGGTGACGCCAAGTAAAAGCACAAGTGCG 
TCTGCAATAATGAATGGAGGCAAACCAGTCAACAAGAGTAAGACAACATAS 
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FIGURE 284 



MAAAIASGLIRQKRQAREQHWDRPSASRRRSSPSKNRGLCNGNLVDIFSKVRIFGLKKRRLRR 
QDPQLKGIVTRLYCRQGYYLQMHPDGALDGTKDDSTNSTLFNLIPVGLRWAIQGVKTGLYIA 
MNGEGYLYPSELFTPECKFKESVFENYYVIYSSMLYRQQESGRAWFLGLNKEGQAMKGNRVKK 
TKPAAHFLPKPLEVAMYREPSLHDVGETVPKPGVTPSKSTSASAIMNGGKPVNKSKTT 

Important features: 
N-glycosylation sites: 
Amino acids 100--104, 242-246 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

Amino acids 28-32, 29-33 

Tyrosine kinase phosphorylation site: 

Amino acids 199-207 

N-myristoylation sites: 

Amino acids 38-44, 89-95, 118-124, 122-128, 222-228 



HBGF/FGF family proteins: 
Amino acids 104-155, 171-198 
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FIGURE 285 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCAGGTTTTGCTTTGA 

TCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAGTTTTGGATGGGATTATGTGGAAACTACCCT 

GCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGAC 

TCGGGAGTCGCTGCTTCCAAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAAfiTSAGCCTCTTCGGGC 

TTCTCCTGCTGACATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAATTCC 

AGTTTTCCAGCAACAAGGAACAGAACGGAGTACAAGATCCTCAGCATGAGAGAATTATTACTGTGTCTACTAATG 

GAAGTATTCACAGCCCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAG 

AGGAAAATGTATGGATACAACTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGT 

ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAG 

GAAAACAGATTTCTAAAGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGT 

TCTGCATCCACTACAACATTGTCATGCCACfiATTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGCTT 

TGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTATTCGATATCTTGAACCAG 

AGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAA 

GAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCT 

CAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTG 

GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 

ACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGC 

ACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCA 

GAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCT 

TCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCA 

ACA6CTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTAT 

TAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTC 

GATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 

TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCACAT 

ATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGT 

GTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACA 

TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTG 

TTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 

ATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCT 

AAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 

TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 

GACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGA 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACAT 

AAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTQTATAAATACATGGATATTTTTATGTACA 

GAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCAATTTAAAAGAAAATCAGTAAAATATTTTGCTTGT 

AAAATGCTTAATATNGTGCCTAGGTTATGTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGA 

ATGTGGCTATTTTGGGGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 



wo 01/40466 PCT/USOO/32678 

286/550 

FIGURE 286 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPRF 
PHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTILGRWCGS 
GTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNA 
ITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLYSCTP 
RNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSKVTKKYHEVLQLRPKT 
GVRGLHKSLTDVALEHHEECDCVCRGSTGG 

Important features: 
signal sequence: 

Amino acids 1-14 

N-glycosylation sites: 

Amino acids 25-29;55-59;254-258 

N-myristoyiation sites: 

Amino acids l5-21;ir7-123;127-133;281-287;282-288;319-325 



Ami dation site: 

Amino acids 229-233 
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FIGURE 287 

CAGCGCTGACTGCGCCGCGGAGAAAGCCAGTGGGAACCCAGACCCATAGGAGACCCGCGTCCC 

CGCTCGGCCTGGCCAGGCCCCGCGCTATGGAGTTCCTCTGGGCCCCTCTCTTGGGTCTGTGCT 

GCAGTCTGGCCGCTGCTGATCGCCACACCGTCTTCTGGAACAGTTCAAATCCCAAGTTCCGGA 

ATGAGGACTACACCATACATGTGCAGCTGAATGACTACGTGGACATCATCTGTCCGCACTATG 

AAGATCACTCTGTGGCAGACGCTGCCATGGAGCAGTACATACTGTACCTGGTGGAGCATGAGG 

AGTACCAGCTGTGCCAGCCCCAGTCCAAGGACCAAGTCCGCTGGCAGTGCAACCGGCCCAGTG 

CCAAGCATGGCCCGGAGAAGCTGTCTGAGAAGTTCCAGCGCTTCACACCTTTCACCCTGGGCA 

AGGAGTTCAAAGAAGGACACAGCTACTACTACATCTCCAAACCCATCCACCAGCATGAAGACC 

GCTGCTTGAGGTTGAAGGTGACTGTCAGTGGCAAAATCACTCACAGTCCTCAGGCCCATGACA 

ATCCACAGGAGAAGAGACTTGCAGCAGATGACCCAGAGGTGCGGGTTCTACATAGCATCGGTC 

ACAGTGCTGCCCCACGCCTCTTCCCACTTGCCTGGACTGTGCTGCTCCTTCCACTTCTGCTGC 

TGCAAACCCCGTSAAGGTGTGTGCCACACCTGGCCTTAAAGAGGGACAGGCTGAAGAGAGGGA 

GAGGCACTCCAAACCTGTCTTGGGGCCACTTTCAGAGCCCCCAGCCCTGGGAACCACTCCCAC 

CACAGGCATAAGCTATCACCTAGCAGCCTCAAAACGGGTCAATATTAAGGTTTTCAACCGGAA 

GGAGGCCAACCAGCCCGACAGTGCCATCCCCACCTTCACCTCGGAGGGATGGAGAAAGAAGTG 

GAGACAGTCCTTTCCCACCATTCCTGCCTTTAAGCCAAAGAAACAAGCTGTGCAGGCATGGTC 

CCTTAAGGCACAGTGGGAGCTGAGCTGGAAGGGGCCACGTGGATGGGCAAAGCTTGTCAAAGA 

TGCCCCCTTCAGGAGAGAGCCAGGATGCCCAGATGAACTGACTGAAGGAAAAGCAAGAAACAG 

TTTdTTGCTTGGAAGCCAGGTACAGGAGAGGCAGCATGCTTGGGCTGACCCAGCATCTCCCAG 

CAAGACCTCATCTGTGGAGCTGCCACAGAGAAGTTTGTAGCCAGGTACTGCATTCTCTCCCAT 

CCTGGGGCAGCACTCCCCAGAGCTGTGCCAGCAGGGGGGCTGTGCCAACCTGTTCTTAGAGTG 

TAGCTGTAAGGGCAGTGCCCATGTGTACATTCTGCCTAGAGTGTAGCCTAAAGGGCAGGGCCC 

ACGTGTATAGTATCTGTATATAAGTTGCTGTGTGTCTGTCCTGATTTCTACAACTGGAGTTTT 

TTTATACAATGTTCTTTGTCTCAAAATAAAGCAATGTGTTTTTTCGG 
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FIGURE 288 



MEFLWAPLLQLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSADAAM 
EQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEFKEGHSYY 
YISKPIHQHEDRCLRLKVTVSGKITHSPQAHDNPQEKRLAADDPEVRVLHSIGHSAAPRLFPL 
AWTVLLLPLLLLQTP 

Important features: 
Sxgrnal sequence: 
Ttoino acids 1-17 

N-glycosylation site: 

Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 118-127 



N-myristoylation site: 

Amino acids 10-16 
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FIGURE 289 

CGGACGCGTGGGCGGACGCGTGGGCGGCCCACGGCGCCCGCGGGCTGGGGCGGTCGCTTCTTC 

CTTCTCCGTGGCCTACGAGGGTCCCCAGCCTGGGTAAAGATSGCCCCATGGCCCCCGAAGGGC 

CTAGTCCCAGCTGTGCTCTGGGGCCTCAGCCTCTTCCTCAACCTCCCAGGACCTATCTGGCTC 

CAGCCCTCTCCACCTCCCCAGTCTTCTCCCCCGCCTCAGCCCCATCCGTGTCATACCTGCCGG 

GGACTGGTTGACAGCTTTAACAAGGGCCTGGAGAGAACCATCCGGGACAACTTTGGAGGTGGA 

AACACTGCCTGGGAGGAAGAGAATTTGTCCAAATACAAAGACAGTGAGACCCGCCTGGTAGAG 

GTGCTGGAGGGTGTGTGCAGCAAGTCAGACTTCGAGTGCCACCGCCTGCTGGAGCTGAGTGAG 

GAGCTGGTGGAGAGCTGGTGGTTTCACAAGCAGCAGGAGGCCCCGGACCTCTTCCAGTGGCTG 

TGCTCAGATTCCCTGAAGCTCTGCTGCCCCGCAGGCACCTTCGGGCCCTCCTGCCTTCCCTGT 

CCTGGGGGAACAGAGAGGCCCTGCGGTGGCTACGGGCAGTGTGAAGGAGAAGGGACACGAGGG 

GGCAGCGGGCACTGTGACTGCCAAGCCGGCTACGGGGGTGAGGCCTGTGGCCAGTGTGGCCTT 

GGCTACTTTGAGGCAGAACGCAACGCCAGCCATCTGGTATGTTCGGCTTGTTTTGGCCCCTGT 

GCCCGATGCTCAGGACCTGAGGAATCAAACTGTTTGCAATGCAAGAAGGGCTGGGCCCTGCAT 

CACCTCAAGTGTGTAGACATTGATGAGTGTGGCACAGAGGGAGCCAACTGTGGAGCTGACCAA 

TTGTGCGTGAACACTGAGGGCTCCTATGAGTGCCGAGACTGTGCCAAGGCCTGCCTAGGCTGC 

ATGGGGGCAGGGCCAGGTCGCTGTAAGAAGTGTAGCCCTGGCTATCAGCAGGTGGGCTCCAAG 

TGTCTCGATGTGGATGAGTGTGAGACAGAGGTGTGTCCGGGAGAGAACAAGCAGTGTGAAAAC 

ACCGAGGGCGGTTATCGCTGCATCTGTGCCGAGGGCTACAAGCAGATGGAAGGCATCTGTGTG 

AAGGAGCAGATCCCAGAGTCAGCAGGCTTCTTCTCAGAGATGACAGAAGACGAGTTGGTGGTG 

CTGCAGCAGATGTTCTTTGGCATCATCATCTGTGCACTGGCCACGCTGGCTGCTAAGGGCGAC 

TTGGTGTTCACCGCCATCTTCATTGGGGCTGTGGCGGCCATGACTGGCTACTGGTTGTCAGAG 

CGCAGTGACCGTGTGCTGGAGGGCTTCATCAAGGGCAGA2AATCGCGGCCACCACCTGTAGGA 

CCTCCTCCCACCCACGCTGCCCCCAGAGCTTGGGCTGCCCTCCTGCTGGACACTCAGGACAGC 

TTGGTTTATTTTTGAGAGTGGGGTAAGCACCCCTACCTGCCTTACAGAGCAGCCCAGGTACCC 

AGGCCCGGGCAGACAAGGCCCCTGGGGTAAAAAGTAGCCCTGAAGGTGGATACCATGAGCTCT 

TCACCTGGCGGGGACTGGCAGGCTTCACAATGTGTGAATTTCAAAAGTTTTTCCTTAATGGTG 

GCTGCTAGAGCTTTGGCCCCTGCTTAGGATTAGGTGGTCCTCACAGGGGTGGGGCCATCACAG 

CTCCCTCCTGCCAGCTGCATGCTGCCAGTTCCTGTTCTGTGTTCACCACATCCCCACACCCCA 

TTGCCACTTATTTATTCATCTCAGGAAATAAAGAAAGGTCTTGGAAAGTTAAAAAAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 290 

MAPWPPKGLVPAVLWGLSLFLNLPGPIWLQPSPPPQSSPPPQPHPCHTCRGLVDSFNKGLERT 
IRDNFGGGNTAWEEENLSKYKDSETRLVEVLEGVCSKSDFECHRLLELSEELVESWWFHKQQE 
APDLFQWLCSDSLKLCCPAGTFGPSCLPCPGGTERPCGGYGQCEGEGTRGGSGHCDCQAGYGG 
EACGQCGLGYFEAERNASHLVCSACFGPCARCSGPEESNCLQCKKGWALHHLKCVDIDECGTE 
GANCGADQFCVNTEGS YECRDCAKACLGCMGAGPGRCKKCS PGYQQVGSKCLDVDECETEVC P 
GENKQCENTEGGYRCICAEGYKQMEGICVKEQIPESAGFFSEMTEDELWLQQMFFGIIICAL 
ATLAAKGDLVFTAIFIGAVAAMTGYWLSERSDRVLEGFIKGR 

Inrportant features : 
Signal sequence: 

Amino acids 1-29 

Transmembrane domain: 

Amino acids 342-392 

N-glycosylation sites: 
Amino acids 79-83; 205-209 

cAMP- and cG^dP-dependent protein kinase phosphorylation site: 

Amino acids 290-294 

Aspartic acid and asparagine hydroxylatlon site: 

Amino acids 321-333 

EGF-llke domain cysteine pattern signature: 

Amino acids 181-193 
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FIGURE 291 



CAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCTCGACCTCGACCCAC 

GCGTCCGAACACAGGTCCTTGTTGCTGCAGAGAAGCAGTTGTTTTGCTGGAAGGAGGGAGTGCGCGGGCTGCCCC 

GGGCTCCTCCCTGCCGCCTCCTCTCAGTGGATGGTTCCAGGCACCCTGTCTGGGGCAGGGAGGGCACAGGCCTGC 

ACATCGAAGGTGGGGTGGGACCAGGCTGCCCCTCGCCCCAGCATCCAAGTCCTCCCTTGGGCGCCCGTGGCCCTG 

CAGACTCTCAGGGCTAAGGTCCTCTGTTGCTTTTTGGTTCCACCTTAGAAGAGGCTCCGCTTGACTAAGAGTAGC 

TTGAAGGAGGCACCATCCAGGAGCTGCATCTGCTCTGGTGGGCGCTTCTCCTGGGCCTGGCTCAGGCCTGCCCTG 

AGCCCTGCGACTGTGGGGAAAAGTATGGCTTCCAGATCGCCGACTGTGCCTACCGCGACCTAGAATCCGTGCCGC 

CTGGCTTCCCGGCCAATGTGACTACACTGAGCCTGTCAGCCAACCGGCTGCCAGGCTTGCCGGAGGGTGCCTTCA 

GGGAGGTGCCCCTGCTGCAGTCGCTGTGGCTGGCACACAATGAGATCCGCACGGTGGCCGCCGGAGCCCTGGCCT 

CTCTGAGCCATCTCAAGAGCCTGGACCTCAGCCACAATCTCATCTCTGACTTTGCCTGGAGCGACCTGCACAACC 

TCAGTGCCCTCCAATTGCTCAAGATGGACAGCAACGAGCTGACCTTCATCCCCCGCGACGCCTTCCGCAGCCTCC 

GTGCTCTGCGCTCGCTGCAACTCAACCACAACCGCTTGCACACATTGGCCGAGGGCACCTTCACCCCGCTCACCG 

CGCTGTCCCACCTGCAGATCAACGAGAACCCCTTCGACTGCACCTGCGGCATCGTGTGGCTCAAGACATGGGCCC 

TGACCACGGCCGTGTCCATCCCGGAGCAGGACAACATCGCCTGCACCTCACCCCATGTGCTCAAGGGTACACCGC 

TGAGCCGCCTGCCGCCACTGCCATGCTCGGCGCCCTCAGTGCAGCTCAGCTACCAACCCAGCCAGGATGGTGCCG 

AGCTGCGGCCTGGTTTTGTGCTGGCACTGCACTGTGATGTGGACGGGCAGCCGGCCCCTCAGCTTCACTGGCACA 

TCCAGATACCCAGTGGCATTGTGGAGATCACCAGCCCCAACGTGGGCACTGATGGGCGTGCCCTGCCTGGCACCC 

CTGTGGCCAGCTCCCAGCCGCGCTTCCAGGCCTTTGCCAATGGCAGCCTGCTTATCCCCGACTTTGGCAAGCTGG 

AGGAAGGCACCTACAGCTGCCTGGCCACCAATGAGCTGGGCAGTGCTGAGAGCTCAGTGGACGTGGCACTGGCCA 

CGCCCGGTGAGGGTGGTGAGGACACACTGGGGCGCAGGTTCCATGGCAAAGCGGTTGAGGGAAAGGGCTGCTATA 

CGGTTGACAACGAGGTGCAGCCATCAGGGCCGGAGGACAATGTGGTCATCATCTACCTCAGCCGTGCTGGGAACC 

CTGAGGCTGCAGTCGCAGAAGGGGTCCCTGGGCAGCTGCCCCCAGGCCTGCTCCTGCTGGGCCAAAGCCTCCTCC 

TCTTCTTCTTCCTCACCTCCTTC2A6CCCCACCCAGGGCTTCCCTAACTCCTCCCCTTGCCCCTACCAATGCCCC 

TTTAAGTGCTGCAGGGGTCTGGGGTTGGCAACTCCTGAGGCCTGCATGGGTGACTTCACATTTTCCTACCTCTCC 

TTCTAATCTCTTCTAGAGCACCTGCTATCCCCAACTTCTAGACCTGCTCCAAACTAGTGACTAGGATAGAATTTG 

ATCCCCTAACTCACTGTCTGCGGTGCTCATTGCTGCTAACAGCATTGCCTGTGCTCTCCTCTCAGGGGCAGCATG 

CTAACGGGGCGACGTCCTAATCCAACTGGGAGAAGCCTCAGTGGTGGAATTCCAGGCACTGTGACTGTCAAGCTG 

GCAAGGGCCAGGATTGGGGGAATGGAGCTGGGGCTTAGCTGGGAGGTGGTCTGAAGCAGACAGGGAATGGGAGAG 

GAGGATGGGAAGTAGACAGTGGCTGGTATGGCTCTGAGGCTCCCTGGGGCCTGCTCAAGCTCCTCCTGCTCCTTG 

CTGTTTTCTGATGATTTGGGGGCTTGGGAGTCCCTTTGTCCTCATCTGAGACTGAAATGTGGGGATCCAGGATGG 

CCTTCCTTCCrCTTACCCTTCCTCCCTCAGCCTGCAACCTCTATCCTGGAACCTGTCCTCCCTTTCTCCCCAACT 

ATGCATCTGTTGTCTGCTCCTCTGCAAAGGCCAGCCAGCTTGGGAGCAGCAGAGAAATAAACAGCATTTCTGATG 

CCAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCT 



# 
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FIGURE 292 



MQELHLLWWALLLGLAQACPEPCDGGEKYGFQIADCAYRDLESVPPGFPANVTTLSLSANRLP 
GLPEGAFREVPLLQSLWLAHNEIRTVAAGALASLSHLKSLDLSHNLISDFAWSDLHNLSALQL 
LKMDSNELTFIPRDAFRSLRALRSLQLNHNRLHTLAEGTFTPLTALSHLQINENPFDCTCGIV 
WLKTWALTTAVSIPEQDNIACTSPHVLKGTPLSRLPPLPCSAPSVQLSYQPSQDGAELRPGFV 
LALHCDVDGQPAPQLHWHIQIPSGIVEITSPNVGTDGRALPGTPVASSQPRFQAFANGSLLIP 
DFGKLEEGTYSCLATNELGSAESSVDVALATPGEGGEDTLGRRFHGKAVEGKGCYTVDNEVQP 
SGPEDNVVIIYLSRAGNPEAAVAEGVPGQLPPGLLLLGQSLLLFFFLTSF 

Important features: 

Signal j>eptide: 

amino acids 1-18 

Transmembrane domain: 
amino acids 403-418 

N-glycosylation sites: 

Amino acids 51-55,120-124,309-313 

Tyrosine kinase phosphorylation site: 
amino acids 319-326 

N-myristoylation sites: 

amino acids 14-20, 64-70, 92-98, 218-224, 294-300, 323-329, 334-340, 
350-356,394-400 

Amidation site: 

amino acids 355-359 

Leucine Rich Repeat: 

amino acids 51-74,75-98, 99-122,123-146,147-170 

Leucine rich repeat C-terminal domain: 

amino acids 180-230 ; 
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FIGURE 293 

ACTTGGAGCAAGCGGCGGCGGCGGAGACAGAGGCAGAGGCAGAAGCTGGGGCTCCGTCCTCGCCTCCCACGAGCG 

ATCCCCGAGGAGAGCCGCGGCCCTCGGCGAGGCGAAGAGGCCGACGAGGAAGACCCGGGTGGCTGCGCCCCTGCC 

TCGCTTCCCAGGCGCCGGCGGCTGCAGCCTTGCCCCTCTTGCTCGCCTTGAAAATGGAAAAGATGCTCGCAGGCT 

GCTTTCTGCTGATCCTCGGACAGATCGTCCTCCTCCCTGCCGAGGCCAGGGAGCGGTCACGTGGGAGGTCCATCT 

CTAGGGGCAGACACGCTCGGACCCACCCGCAGACGGCCCTTCTGGAGAGTTCCTGTGAGAACAAGCGGGCAGACC 

TGGTTTTCATCATTGACAGCTCTCGCAGTGTCAACACCCATGACTATGCAAAGGTCAAGGAGTTCATCGTGGACA 

TCTTGCAATTCTTGGACATTGGTCCTGATGTCACCCGAGTGGGCCTGCTCCAATATGGCAGCACTGTCAAGAATG 

AGTTCTCCCTCAAGACCTTCAAGAGGAAGTCCGAGGTGGAGCGTGCTGTCAAGAGGATGCGGCATCTGTCCACGG 

^^^^^'^'^^^^'^'^^'^^'^^^^'^CATCCAGTATGCCCTGAACATCGCATTCTCAGAAGCAGAGGGGGCCCGGCCCCTGA 

GGGAGAATGTGCCACGGGTCATAATGATCGTGACAGATGGGAGACCTCAGGACTCCGTGGCCGAGGTGGCTGCTA 

AGGCACGGGACACGGGCATCCTAATCTTTGCCATTGGTGTGGGCCAGGTAGACTTCAACACCTTGAAGTCCATTG 

GGAGTGAGCCCCATGAGGACCATGTCTTCCTTGTGGCCAATTTCAGCCAGATTGAGACGCTGACCTCCGTGTTCC 

AGAAGAAGTTGTGCACGGCCCACATGTGCAGCACCCTGGAGCATAACTGTGCCCACTTCTGCATCAACATCCCTG 

GCTCATACGTCTGCAGGTGCAAACAAGGCTACATTCTCAACTCGGATCAGACGACTTGCAGAATCCAGGATCTGT 

GTGCCATGGAGGACCACAACTGTGAGCAGCTCTGTGTGAATGTGCCGGGCTCCTTCGTCTGCCAGTGCTACAGTG 

GCTACGCCCTGGCTGAGGATGGGAAGAGGTGTGTGGCTGTGGACTACTGTGCCTCAGAAAACCACGGATGTGAAC 

^^^^IS?^J^'^®="^*^'^'^^°^TC^^CCTTTGCCAGTGCCATGAAGGATTTGCTCTTAACCCAGATGAAAAAA 

CGTGCACAAGGATCAACTACTGTGCACTGAACAAACCGGGCTGTGAGCATGAGTGCGTCAACATGGAGGAGAGCT 

^H~^H^^^^*^^^^^*^^^^°^°°^'^^^^CTCTGGACCCCAATGGCAAAACCTGCAGCCGAGTGGACCACTGTGCAC 

AGCAGGACCATGGCTGTGAGCAGCTGTGTCTGAACACGGAGGATTCCTTCGTCTGCCAGTGCTCAGAAGGCTTCC 

TCATCAACGAGGACCTCAAGACCTGCTCCCGGGTGGATTACTGCCTGCTGAGTGACCATGGTTGTGAATACTCCT 

GTGTCAACATGGACAGATCCTTTGCCTGTCAGTGTCCTGAGGGACACGTGCTCCGCAGCGATGGGAAGACGTGTG 

CAAAATTGGACTCTTGTGCTCTGGGGGACCACGGTTGTGAACATTCGTGTGTAAGCAGTGAAGATTCGTTTGTGT 

GCCAGTGCTTTGAAGGTTATATACTCCGTGAAGATGGAAAAACCTGCAGAAGGAAAGATGTCTGCCAAGCTATAG 

ACCATGGCTGTGAACACATTTGTGTGAACAGTGACGACTCATACACGTGCGAGTGCTTGGAGGGATTCCGGCTCG 

CTGAGGATGGGAAACGCTGCCGAAGGAAGGATGTCTGCAAATCAACCCACCATGGCTGC6AACACATTTGTGTTA 

ATAATGGGAATTCCTACATCTGCAAATGCTCAGAGGGATTTGTTCTAGCTGAGGACGGAAGACGGTGCAAGAAAT 

GCACTGAAGGCCCAATTGACCTGGTCTTTGTGATCGATGGATCCAAGAGTCTTGGAGAAGAGAATTTTGAGGTCG 

TGAAGCAGTTTGTCACTGGAATTATAGATTCCTTGACAATTTCCCCCAAAGCCGCTCGAGTGGGGCTGCTCCAGT 

ATTCCACACAGGTCCACACAGAGTTCACTCTGAGAAACTTCAACTCAGCCAAAGACATGAAAAAAGCCGTGGCCC 

ACATGAAATACATGGGAAAGGGCTCTATGACTGGGCTGGCCCTGAAACACATGTTTGAGAGAAGTTTTACCCAAG 

»^^S^*^°°*^^^^*^*^^^^^^'^^^^^^^°^'^°^^^^^'^<^C*GCCATTGTGTTCACCGACGGACGGGCTCAGGATG 

ACGTCTCCGAGTGGGCCAGTAAAGCCAAGGCCAATGGTATCACTATGTATGCTGTTGGGGTAGGAAAAGCCATTG 

AGGAGGAACTACAAGAGATTGCCTCTGAGCCCACAAACAAGCATCTCTTCTATGCCGAAGACTTCAGCACAATGG 

^IS^S^^^°'^*^'^'^'^^^^°^°'^<=ATCTGTGAAGCTCTAGAAGACTCCGATGGAAGACAGGACTCTCCAG 

^^^™^^°^^^^^^^*^°^'''=*=^'='^^'=C^CAGAATCTGAGCCAGTCACCATAAATATCCAAGACCTACTTT 

CCTGTTCTAATTTTGCAGTGCAACACAGATATCTGTTTGAAGAAGACAATCTTTTACGGTCTACACAAAAGCTTT 

CCCATTCAACAAAACCTTCAGGAAGCCCTTTGGAAGAAAAACACGATCAATGCAAATGTGAAAACCTTATAATGT 

TCCAGAACCTTGCAAACGAAGAAGTAAGAAAATTAACACAGCGCTTAGAAGAAATGACACAGAGAATGGAAGCCC 

TGGAAAATCGCCTGAGATACAGATSaAGATTAGAAATCGCGACACATTTGTAGTCATTGTATCACGGATTACAAT 

GAACGCAGTGCAGAGCCCCAAAGCTCAGGCTATTGTTAAATCAATAATGTTGTGAAGTAAAACAATCAGTACTGA 

GAAACCTGGTTTGCCACAGAACAAAGACAAGAAGTATACACTAACTTGTATAAATTTATCTAGGAAAAAAATCCT 

TCAGAATTCTAAGATGAATTTACCAGGTGAGAATGAATAAGCTATGCAAGGTATTTTGTAATATACTG'I'GGACAC 

AACTTGCTTCTGCCTCATCCTGCCTTAGTGTGCAATCTCATTTGACTATACGATAAAGTTTGCACAGTCTTACTT 

CTGTAGAACACTGGCCATAGGAAATGCTGTTTTTTTGTACTGGACTTTACCTTGATATATGTATATGGATGTATG 

CATAAAATCATAGGACATATGTACTTGTGGAACAAGTTGGATTTTTTATACAATATTAAAATTCACCACTTCAG 
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FIGURE 294 



MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVFIID 
SSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKR 
MRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEVAAKARDTGI 
LIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF 
CINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGSFVCQCYSGYALAEDG 
KRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTRINYCALNKPGCEHECVN 
MEESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTC 
SRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSED 
SFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRK 
DVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENF 
EWKQFVTGIIDSLTISPKAARVGLLQYSTQVHTEFTLRNFNSAKDMKKAVAHMKYMGKGSMT 
GLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWASKAKANGITMYAVGVGKA 
lEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALEDSDGRQDSPAGELPKTVQQPT 
ESEPVTINIQDLLSCSNFAVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCENLIM 
FQNLANEEVRKLTQRLEEMTQRMEALENRLRYR 

Important features: 
Signal sequence: 

Amino acids 1-23 

N-glycosylation site: 

Amino acids 221-225 

CAMP- and cOlP-dependent protein kinase phosphorylation sites: 

7\mino acids 115-119; 606-610; 892-896 

N-myristoylation sites : 

Amino acids 133-139;258-264;299-305; 340-34 6; 453-459; 494-500; 

639-645; 690-694; 

752-758;792-798 

Anidation sites: 

Amino acids 314-318; 560-564; 601-605 

Aspartic acid and asparagine hydroxylation sites : 

Amino acids 253-265;294-306;335-347;376-388;417-429; 
458-470; 540-552:; 581-593 
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FIGURE 295 

GGCCGGAGCAGCACGGCCGCAGGACCTGGAGCTCCGGCTGCGTCTTCCCGCAGCGCTACCCGC 

C^gCGCCTGCCGCGCCGGGCCGCGCTGGGGCTCCTGCCGCTTCTGCTGCTGCTGCCGCCCGC 

GCCGGAGGCCGCCAAGAAGCCGACGCCCTGCCACCGGTGCCGGGGGCTGGTGGACAAGTTTAA 

CCAGGGGATGGTGGACACCGCAAAGAAGAACTTTGGCGGCGGGAACACGGCTTGGGAGGAAAA 

GACGCTGTCCAAGTACGAGTCCAGCGAGATTCGCCTGCTGGAGATCCTGGAGGGGCTGTGCGA 

GAGCAGCGACTTCGAATGCAATCAGATGCTAGAGGCGCAGGAGGAGCACCTGGAGGCCTGGTG 

GCTGCAGCTGAAGAGCGAATATCCTGACTTATTCGAGTGGTTTTGTGTGAAGACACTGAAAGT 

GTGCTGCTCTCCAGGAACCTACGGTCCCGACTGTCTCGCATGCCAGGGCGGATCCCAGAGGCC 

CTGCAGCGGGAATGGCCACTGCAGCGGAGATGGGAGCAGACAGGGCGACGGGTCCTGCCGGTG 

CCACATGGGGTACCAGGGCCCGCTGTGCACTGACTGCATGGACGGCTACTTCAGCTCGCTCCG 

GAACGAGACCCACAGCATCTGCACAGCCTGTGACGAGTCCTGCAAGACGTGCTCGGGCCTGAC 

CAACAGAGACTGCGGCGAGTGTGAAGTGGGCTGGGTGCTGGACGAGGGCGCCTGTGTGGATGT 

GGACGAGTGTGCGGCCGAGCCGCCTCCCTGCAGCGCTGCGCAGTTCTGTAAGAACGCCAACGG 

CTCCTACACGTGCGAAGAGTGTGACTCCAGCTGTGTGGGCTGCACAGGGGAAGGCCCAGGAAA 

CTGTAAAGAGTGTATCTCTGGCTACGCGAGGGAGCACGGACAGTGTGCAGATGTGGACGAGTG 

CTCACTAGCAGAAAAAACCTGTGTGAGGAAAAACGAAAACTGCTACAATACTCCAGGGAGCTA 

CGTCTGTGTGTGTCCTGACGGCTTCGAAGAAACGGAAGATGCCTGTGTGCCGCCGGCAGAGGC 

TGAAGCCACAGAAGGAGAAAGCCCGACACAGCTGCCCTCCCGCGAAGACCTG^ATGTGCCGG 

ACTTACCCTTTAAATTATTCAGAAGGATGTCCCGTGGAAAATGTGGCCCTGAGGATGCCGTCT 

CCTGCAGTGGACAGCGGCGGGGAGAGGCTGCCTGCTCTCTAACGGTTGATTCTCATTTGTCCC 

TTAAACAGCTGCATTTCTTGGTTGTTCTTAAACAGACTTGTATATTTTGATACAGTTCTTTGT 

AATAAAATTGACCATTGTAGGTAATCAGGAGGAAAAAAAAA 
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MRLPRRAALGLLPLLLLLPPAPEAAKKPTPCHRCRGLVDKFNQGMVDTAKKNFGGGNTAWEEK 
TLSKYESSEIRLLEILEGLCESSDFECNQMLEAQEEHLEAWWLQLKSEYPDLFEWFCVKTLKV 
CCSPGTYGPDCLACQGGSQRPCSGNGHCSGDGSRQGDGSCRCHMGYQGPLCTDCMDGYFSSLR 
NETHSICTACDESCKTCSGLTNRDCGECEVGWVLDEGACVDVDECAAEPPPCSAAQFCKNANG 
SYTCEECDSSCVGCTGEGPGNCKECISGYAREHGQCADVDECSLAEKTCVRKNENCYNTPGSY 
VCVCPDGFEETEDACVPPAEAEATEGESPTQLPSREDL 

Important: features: 

Signal peptide: 

Amino acids 1-24 

N-glycosylation sites: 

Amino acids 190-194;251-255 

Glycosaminoglycan attachment sites: 

Amino acids 149-153; 155-159 

cAMP- and cOfP-dependent protein kinase phosphorylation site : 

Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 303--310 

N-myristoylation sites: 

Amino acids 44-50; 54-60; 55-61; 81-87; 150-156; 158-164; 164-170; 
252-258; 313-319 

Aspartic acid and asparagine hydroxy lation site: 

Amino acids 308-320 

EGF-like domain cysteine pattern signature: 

Amino acids 166-178 



Leucine zipper pattern: 

Amino acids 94-116 
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FIGURE 297 

GACATCGGAGGTGGGCTAGCACTGAAACTGCTTTTCAAGACGAGGAAGAGGAGGAGAAAGAGAAAGAAGAGGAAG 

ATGTTGGGCAACATTTATTTAACATGCTCCACAGCCCGGACCCTGGCATCATGCTGCTATTCCTGCAAATACTGA 

AGAAGCATGGGATTTAAATATTTTACTTCTAAATAAATGAATTACTCAATCTCCTATGACCATCTATACATACTC 

CACCTTCAAAAAGTACATCAATATTATATCATTAAGGAAATAGTAACCTTCTCTTCTCCAATATGCATGACATTT 

TTGGACAATGCAATTGTGGCACTGGCACTTATTTCAGTGAAGAAAAACTTTGTGGTTCTATGGCATTCATCATTT 

GACAAATGCAAGCATCTTCCTTATCAATCAGCTCCTATTGAACTTACTAGCACTGACTGTGGAATCCTTAAGGGC 

CCATTACATTTCTGAAGAAGAAAGCTAAG»TCAAGGACATGCCACTCCGAATTCATGTGCTACTTGGCCTAGCTA 

TCACTACACTAGTACAAGCTGTAGATAAAAAAGTGGATTGTCCACGGTTATGTACGTGTGAAATCAGGCCTTGGT 

TTACACCCAGATCCATTTATATGGAAGCATCTACAGTGGATTGTAATGATTTAGGTCTTTTAACTTTCCCAGCCA 

GATTGCCAGCTAACACACAGATTCTTCTCCTACAGACTAACAATATTGCAAAAATTGAATACTCCACAGACTTTC 

CAGTAAACCTTACTGGCCTGGATTTATCTCAAAACAATTTATCTTCAGTCACCAATATTAATGTAAAAAAGATGC 

CTCAGCTCCTTTCTGTGTACCTAGAGGAAAACAAACTTACTGAACTGCCTGAAAAATGTCTGTCCGAACTGAGCA 

ACTTACAAGAACTCTATATTAATCACAACTTGCTTTCTACAATTTCACCTGGAGCCTTTATTGGCCTACATAATC 

TTCTTCGACTTCATCTCAATTCAAATAGATTGCAGATGATCAACAGTAAGTGGTTTGATGCTCTTCCAAATCTAG 

AGATTCTGATGATTGGGGAAAATCCAATTATCAGAATCAAAGACATGAACTTTAAGCCTCTTATCAATCTTCGCA 

GCCTGGTTATAGCTGGTATAAACCTCACAGAAATACCAGATAACGCCTTGGTTGGACTGGAAAACTTAGAAAGCA 

TCTCTTTTTACGATAACAGGCTTATTAAAGTACCCCATGTTGCTCTTCAAAAAGTTGTAAATCTCAAATTTTTGG 

ATCTAAATAAAAATCCTATTAATAGAATACGAAGGGGTGATTTTAGCAATATGCTACACTTAAAAGAGTTGGGGA 

TAAATAATATGCCTGAGCTGATTTCCATCGATAGTCtTGCTGTGGATAACCTGCCAGATTTAAGAAAAATAGAAG 

CTACTAACAACCCTAGATTGTCTTACATTCACCCCAATGCATTTTTCAGACTCCCCAAGCTGGAATCACTCATGC 

TGAACAGCAATGCTCTCAGTGCCCTGTACCATGGTACCATTGAGTCTCTGCCAAACCTCAAGGAAATCAGCATAC 

ACAGTAACCCCATCAGGTGTGACTGTGTCATCCGTTGGATGAACATGAACAAAACCAACATTCGATTCATGGAGC 

CAGATTCACTGTTTTGCGTGGACCCACCTGAATTCCAAGGTCAGAATGTTCGGCAAGTGCATTTCAGGGACATGA 

TGGAAATTTGTCTCCCTCTTATAGCTCCTGAGAGCTTTCCTTCTAATCTAAATGTAGAAGCTGGGAGCTATGTTT 

CCTTTCACTGTAGAGCTACTGCAGAACCACAGCCTGAAATCTACTGGATAACACCTTCTGGTCAAAAACTCTTGC 

CTAATACCCTGACAGACAAGTTCTATGTCCATTCTGAGGGAACACTAGATATAAATGGCGTAACTCCCAAAGAAG 

GGGGTTTATATACTTGTATAGCAACTAACCTAGTTGGCGCTGACTTGAAGTCTGTTATGATCAAAGTGGATGGAT 

CTTTTCCACAAGATAACAATGGCTCTTTGAATATTAAAATAAGAGATATTCAGGCCAATTCAGTTTTGGTGTCCT 

GGAAAGCAAGTTCTAAAATTCTCAAATCTAGTGTTAAATGGACAGCCTTTGTCAAGACTGAAAATTCTCATGCTG 

CGCAAAGTGCTCGAATACCATCTGATGTCAAGGTATATAATCTTACTCATCTGAATCCATCAACTGAGTATAAAA 

TTTGTATTGATATTCCCACCATCTATCAGAAAAACAGAAAAAAATGTGTAAATGTCACCACCAAAGGTTTGCACC 

CTGATCAAAAAGAGTATGAAAAGAATAATACCACAACACTTATGGCCTGTCTTGGAGGCCTTCTGGGGATTATTG 

GTGTGATATGTCTTATCAGCTGCCTCTCTCCAGAAATGAACTGTGATGGTGGACACAGCTATGTGAGGAATTACT 

TACAGAAACCAACCTTTGCATTAGGTGAGCTTTATCCTCCTCTGATAAATCTCTGGGAAGCAGGAAAAGAAAAAA 

GTACATCACTGAAAGTAAAAGCAACTGTTATAGGTTTACCAACAAATATGTCCTAAAAACCACCAAGG 
CTCCAAAAATGAAC 
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FIGURE 298 

MKDMPLRIHVLLGLAITTLVQAVDKKVDCPRLCTCEIRPWFTPRSIYMEASTVDCNDLGLLTF 
PARLPANTQILLLQTNNIAKIEYSTDFPVNLTGLDLSQNNLSSVTNINVKKMPQLLSVYLEEN 
KLTELPEKCLSELSNLQELYINHNLLSTISPGAFIGLHNLLRLHLNSNRLQMINSKWFDALPN 
LEILMIGENPIIRIKDMNFKPLINLRSLVIAGINLTEIPDNALVGLENLESISFYDNRLIKVP 
HVALQKVVNLKFLDLNKNPINRIRRGDFSNMLHLKELGINNMPELISIDSLAVDNLPDLRKIE 
ATNNPRLSYIHPNAFFRLPKLESLMLNSNALSALYHGTIESLPNLKEISIHSNPIRCDCVIRW 
MNMNKTNIRFMEPDSLFCVDPPEFQGQNVRQVHFRDMMEICLPLIAPESFPSNLNVEAGSYVS 
FHCRATAEPQPEIYWITPSGQKLLPNTLTDKFYVHSEGTLDINGVTPKEGGLYTCIATNLVGA 
DLKSVMIKVDGSFPQDNNGSLNIKIRDIQANSVLVSWKASSKILKSSVKWTAFVKTENSHAAQ 
SARIPSDVKVYNLTHLNPSTEYKICIDIPTIYQKNRKKCVNVTTKGLHPDQKEYEKNNTTTLM 
ACLGGLLGIIGVICLISCLSPEMNCDGGHSYVRNYLQKPTFALGELYPPLINLWEAGKEKSTS 
LKVICATVIGLPTNMS 

In^ortaht features : 

Signal sequence: 

amino acids 1-22 

Transmembrane domain: 

amino acids 633-650 

N-glycosylation site. 

amino acids 93-97, 103-107, 223-227, 382-386, 522-526, 579-583, 
608-612, 624-628, 625-629 

Casein kinase II phosphorylation site. 

amino acids 51-55, 95-99, 242-246, 468-472, 487-491 

Tyrosine kinase phosphorylation site. 

amino acids 570-579 

N-myristoylation site. 

amino acids 13-19, 96-102, 158-164, 221-227, 352-358, 437-443, 
491-497, 492-498, 634-640, 702-708 



Cell attachment . sequence . 

amino acids 277-280 
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FIGURE 299 

GCTGTGGGAACCTCTCCACGCGCACGAACTCAGCCAACGATTTCTGATAGATTTTTGGGAGTT 
TGACCAGAGATGCAAGGGGTGAAGGAGCGCTTCCTACCGTTAGGGAACTCTGGGGACAGAGCG 
CCCCGGCCGCCTGATGGCCGAGGCAGGGTGCGACCCAGGACCCAGGACGGCGTCGGGAACCAT 
ACCATGGCCCGGATCCCCAAGACCCTAAAGTTCGTCGTCGTCATCGTCGCGGTCCTGCTGCCA 
GTCCTAGCTTACTCTGCCACCACTGCCCGGCAGGAGGAAGTTCCCCAGCAGACAGTGGCCCCA 
CAGCAACAGAGGCACAGCTTCAAGGGGGAGGAGTGTCCAGCAGGATCTCATAGATCAGAACAT 
ACTGGAGCCTGTAACCCGTGCACAGAGGGTGTGGATTACACCAACGCTTCCAACAATGAACCT 
TCTTGCTTCCCATGTACAGTTTGTAAATCAGATCAAAAACATAAAAGTTCCTGCACCATGACC 
AGAGACACAGTGTGTCAGTGTAAAGAAGGCACCTTCCGGAATGAAAACTCCCCAGAGATGTGC 
CGGAAGTGTAGCAGGTGCCCTAGTGGGGAAGTCCAAGTCAGTAATTGTACGTCCTGGGATGAT 
ATCCAGTGTGTTGAAGAATTTGGTGCCAATGCCACTGTGGAAACCCCAGCTGCTGAAGAGACA 
ATGAACACCAGCCCGGGGACTCCTGCCCCAGCTGCTGAAGAGACAATGAACACCAGCCCAGGG 
ACTCCTGCCCCAGCTGCTGAAGAGACAATGACCACCAGCCCGGGGACTCCTGCCCCAGCTGCT 
GAAGAGACAATGACCACCAGCCCGGGGACTCCTGCCCCAGCTGCTGAAGAGACAATGACCACC 
AGCCCGGGGACTCCTGCCTCTTCTCATTACCTCTCATGCACCATCGTAGGGATCATAGTTCTA 
ATTGTGeXTCTGATTGTGTTTGTTI^AAGACTTCACTGTGGAAGAAATTCCTTCCTTACCTG 

AAAGGTTCAGGTAGGCGCTGGCTGAGGGCGGGGGGCGCTGGACACTCTCTGCCCTGCCTCCCT 
CTGCTGTGTTCCCACAGACAGAAACGCCTGC 
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FIGURE 300 

MARIPKTLKFVVVIVAVLLPVLAYSATTARQEEVPQQTVAPQQQRHSFKGEECPAGSHRSEHT 
GACN PCTEGV D YTN ASNNE PSC FPCT VCKS DQKHKS SCTMTRDT VCQCKEGT FRN ENS PEMCR 
KCSRCPSGEVQVSNCTSWDDIQCVEEFGANATVETPAAEETMNTSPGTPAPAAEETMNTSPGT 

PAPAAEETMTTSPGTPAPAAEETMTTSPGTPAPAAEETMTTSPGTPASSHYLSCTIVGIIVLI 
VLLIVFV 

Important features: 
Signal peptide: 

Amino acids 1-29 

Transmembrane domain: 

Amino acids 240-259 

N-glycosylation site: 

Amino acids 77-81; 140-144 ; 156-160 

CAMP- and cQ4P-dependent protein kinase phosphorylation site: 

Amino acids 126-130 



N-myristoylation sites: 

Amino acids 56-62;72-78; 114-120; 154-160;233-239 
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FIGURE 301 

CACAAGCATCTTAATTTGAATCCACAAAGTTTCATGTAATGAAAAGAAATACATAATTTTAAT 

TCAACCCGAGTGTTTTCCAAGAAGATTGTATTTGCTTAAATTGCTACAGTAATTCAAGAGACA 

GCCCTGTCTGGACACAGAGTTACTGTGGATTTTTAAGAGACTCAGTTAAAGAATTTAGGAATT 

TCTGATTCATTTAAAGGATTTACAAATTCATCAACCCCTGAAAACTAAAGCAAATTGAACAGG 

AAAAAAAAAAAGAAGATGGGTTTTTTAAGTCCAATATATGTTATTTTCTTCTTTTTTGGAGTC 

AAAGTACATTGCCAATATGAAACTTATCAGTGGGATGAAGACTATGACCAAGAGCCAGATGAT 

GATTACCAAACAGGATTCCCATTTCGTCAAAATGTAGACTACGGAGTTCCTTTTCATCAGTAT 

ACTTTAGGCTGTGTCAGTGAATGCTTCTGTCCAACTAACTTTCCATCATCAATGTACTGTGAT 

AATCGCAAACTCAAGACTATCCCAAATATTCCGATGCACATTCAGCAACTCTACCTTCAGTTC 

AATGAAATTGAGGCTGTGACTGCAAATTCATTCATCAATGCAACTCATCTTAAAGAAATTAAC 

CTCAGCCACAACAAAATTAAATCTCAAAAGATTGATTATGGTGTGTTTGCTAAGCTTCCAAAT 

CTACTACAACTTCATCTAGAGCATAATAATTTAGAAGAATTTCCATTTCCTCTTCCTAAATCT 

CTGGAAAGACTCCTTCTTGGTTACAATGAAATCTCCAAACTGCAGACAAATGCTATGGATGGG 

CTAGTAAACTTG^^CCATGCTTGATCTCTGTTATAATTATCTTCATGATTCTCTGCTAAAAGAC 

AAAATCTTTGCCAAAATGGAAAAACTAATGCAGCTCAACCTCTGCAGTAACAGATTAGAATCA 

ATGCCTCCTGGTTTGCCTTCTTCACTTATGTATCTGTCTTTAGAAAATAATTCAATTTCTTCT 

ATACCCGAAAAATACTTCGACAAACTTCCAAAACTTCATACTCTAAGAATGTCACACAACAAA 

CTACAAGACATCCCATATAATATTTTTAATCTTCCCAACATTGTAGAACTCAGTGTTGGACAC 

AACAAATTGAAGCAAGCATTCTATATTCCAAGAAATTTGGAACACCTATACCTACAAAATAAT 

GAAATAGAAAAGATGAATCTTACAGTGATGTGTCCTTCTATTGACCCACTACATTACCACCAT 

TTAACATACATTCGTGTGGACCAAAATAAACTAAAAGAACCAATAAGCTCATACATCTTCTTC 

TGCTTCCCTCATATACACACTATTTATTATGGTGAACAACGAAGCACTAATGGTCAAACAATA 

CAACTAAAGACACAAGTTTTCAGGAGATTTCCAGATGATGATGATGAAAGTGAAGATCACGAT 

GATCCTGACAATGCTCATGAGAGCCCAGAACAAGAAGGAGCAGAAGGGCACTTTGACCTTCAT 

TATTATGAAAATCAAGAAT^AAGAAACTATATAGGTATACACTTACGACTTCACAAAACCTA 

TACTTAATATAGTAAATCTAAGTAAACATGTATTACTCAAAGTAATATATTTAGAATTATGTA 

TTAGTATAAGATCAGAATTGAATTTAAGTTGTTGGTGACATCTGCATCATTTCATAGGATTAG 

AACTTACTCAAAATAATGTAAATCTTTAAAAATATAAATTAGAATGACAAGTGGGAATCATAA 

ATTAAACGTTAATGGTTTCTTATGCTCTTTTTAAATATAGAAATATCATGTTAAAGAAAAAAA 
AAAAAAA 
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FIGURE 302 

MGFLSPIYVIFFFFGVKVHCQYETYQWDEDYDQEPDDDYQTGFPFRQNVDYGVPFHQYTLGCV 
SECFCPTNFPSSMYCDNRKLKTIPNIPMHIQQLYLQFNEIEAVTANSFINATHLKEINLSHNK 
IKSQKIDYGVFAKLPNLLQLHLEHNNLEEFPFPLPKSLERLLLGYNEISKLQTNAMDGLVNLT 
MLDLCYNYLHDSLLKDKIFAKMEKLMQLNLCSNRLESMPPGLPSSLMYLSLENNSISSIPEKY 
FDKLPKLHTLRMSHNKLQDIPYNIFNLPNIVELSVGHNKLKQAFYIPRNLEHLYLQNNEIEKM 
NLTVMCPSIDPLHYHHLTYIRVDQNKLKEPISSYIFFCFPHIHTIYYGEQRSTNGQTIQLKTQ 
VFRRFPDDDDESEDHDDPDNAHESPEQEGAEGHFDLHYYENQE 

Import:an't £easi:ures: 
N-glycosylation sites: 

Amino acids 113-117; 121-125; 187-191;242-246; 316-320 

Tyrosine kinase phosphorylation sites: 

Amino acids 268-275; 300-307 

N-myristoylation site: 

Amino, acids 230-236 



Leucine zipper patterns: 

Amino acids 146-168;217-239 
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FIGURE 303 

GCCCGGGACTGGCGCAAGGTGCCCAAGCAAGGAAAGAAATAATGAAGAGACACATGTGTTAGC 

TGCAGCCTTTTGAAACACGCAAGAAGGAAATCAATAGTGTGGACAGGGCTGGAACCTTTACCA 

CGCTTGTTGGAGTAGATGAGGAATGGGCTCGTGATTATGCTGACATTCCAGCATGAATCTGGT 

AGACCTGTGGTTAACCCGTTCCCTCTCCATGTGTCTCCTCCTACAAAGTTTTGTTCTTATGAT 

ACTGTGCTTTCATTCTGCCAGTATGTGTCCCAAGGGCTGTCTTTGTTCTTCCTCTGGGGGTTT 

AAATGTCACCTGTAGCAATGCAAATCTCAAGGAAATACCTAGAGATCTTCCTCCTGAAACAGT 

CTTACTGTATCTGGACTCCAATCAGATCACATCTATTCCCAATGAAATTTTTAAGGACCTCCA 

TCAACTGAGAGTTCTCAACCTGTCCAAAAATGGCATTGAGTTTATCGATGAGCATGCCTTCAA 

AGGAGTAGGTGAAACCTTGCAGACTCTGGACTTGTCCGACAATCGGATTCAAAGTGTGCACAA 

AAATGCCTTCAATAACCTGAAGGCCAGGGCCAGAATTGCCAACAACCCCTGGCACTGCGACTG 

TACTCTACAGCAAGTTCTGAGGAGCATGGCGTCCAATCATGAGACAGCCCACAACGTGATCTG 

TAAAACGTCCGTGTTGGATGAACATGCTGGCAGACCATTCCTCAATGCTGCCAACGACGCTGA 

CCTTTGTAACCTCCCTAAAAAAACTACCGATTATGCCATGCTGGTCACCATGTTTGGCTGGTT 

CACTATGGTGATCTCATATGTGGTATATTATGTGAGGCAAAATCAGGAGGATGCCCGGAGACA 

CCTCGAATACTTGAAATCCCTGCCAAGCAGGCAGAAGAAAGCAGATGAACCTGATGATATTAG 

CACTGTGGTA31AGTGTCCAAACTGACTGTCATTGAGAAAGAAAGAAAGTAGTTTGCGATTGCA 

GTAGAAATAAGTGGTTTACTTCTCCCATCCATTGTAAACATTTGAAACTTTGTATTTCAGTTT 

TTTTTGAATTATGCCACTGCTGAACTTTTAACAAACACTACAACATAAATAATTTGAGTTTAG 

GTGATCCACCCCTTAATTGTACCCCCGATGGTATATTTCTGAGTAAGCTACTATCTGAACATT 

AGTTAGATCCATCTCACTATTTAATAATGAAATTTATTTTTTTAATTTAAAAGCAAATAAAAG 

CTTAACTTTGAACCATGGGAAAAAAAAAAAAAAAAAAAAAAAACA 
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FIGURE 304 

MNLVDLWLTRSLSMCLLLQSFVLMILCFHSASMCPKGCLCSSSGGLNVTCSNANLKEIPRDLP 

PETVLLYLDSNQITSIPNEIFKDLHQLRVLNLSKNGIEFIDEHAFKGVAETLQTLDLSDNRIQ 

SVHKNAFNNLKARARIANNPWHCDCTLQQVLRSMASNHETAHNVICKTSVLDEHAGRPFLNAA 

NDADLCNLPKKTTDYAMLVTMFGWFTMVISYWYYVRQNQEDARRHLEYLKSLPSRQKKADEP 
DDISTW 

Important features: 
Signal sequence: 

Amino acids 1-33 

Transmembrane draiaxn: 

Amino acids 204-219 

N-glycosylation sites: 
Amino acids 47-51; 94-98 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 199-203 

Casein kinase II phosphorylation site. 

amino acids 162-166, 175-179 



N-myristoylation sites: 

Amino acids 37-43; 45-51; 110-116 
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FIGURE 305 

CGCCACCACTGCGGCCACCGCCAATSAAACGCCTCCCGCTCCTAGTGGTTTTTTCCACTTTGTTGAATTGTTCCT 
ATACTCAAAATTGCACCAAGACACCTTGTCTCCCAAATGCAAAATGTGAAATACGCAATGGAATTGAAGCCTGCT 
ATTGCAACATGGGATTTTCAGGAAATGGTGTCACAATTTGTGAAGATGATAATGAATGTGGAAATTTAACTCAGT 
CCTGTGGCGAAAATGCTAATTGCACTAACACAGAAGGAAGTTATTATTGTATGTGTGTACCTGGCTTCAGATCCA 
GCAGTAACCAAGACAGGTTTATCACTAATGATGGAACCGTCTGTATAGAAAATGTGAATGCAAACTGCCATTTAG 
ATAATGTCTGTATAGCTGCAAATATTAATAAAACTTTAACAAAAATCAGATCCATAAAAGAACCTGTGGCTTTGC 
TACAAGAAGTCTATAGAAATTCTGTGACAGATCTTTCACCAACAGATATAATTACATATATAGAAATATTAGCTG 
AATCATCTTCATTACTAGGTTACAAGAACAACACTATCTCAGCCAAGGACACCCTTTCTAACTCAACTCTTACTG 
AATTTGTAAAAACCGTGAATAATTTTGTTCAAAGGGATACATTTGTAGTTTGGGACAAGTTATCTGTGAATCATA 
GGAGAACACATCTTACAAAACTCATGCACACTGTTGAACAA6CTACTTTAAGGATATCCCAGAGCTTCCAAAAGA 
CCACAGAGTTTGATACAAATTCAACGGATATAGCTCTCAAAGTTTTCTTTTTTGATTCATATAACATGAAACATA 
TTCATCCTCATATGAATATGGATGGAGACTACATAAATATATTTCCAAAGAGAAAAGCTGCATATGATTCAAATG 
GCAATGTTGCAGTTGCATTTTTATATTATAAGAGTATTGGTCCTTTGCTTTCATCATCTGACAACTTCTTATTGA 
AACCTCAAAATTATGATAATTCTGAAGAGGAGGAAAGAGTCATATCTTCAGTAATTTCAGTCTCAATGAGCTCAA 

GTCTATGTGCATTTTGGAATTACTCACCTGATACCATGAATGGCAGCTGGTCTTCAGAGGGCTGTGAGCT6ACAT 

ACTCAAATGAGACCCACACCTCATGCCGCTGTAATCACCTGACACATTTTGCAATTTTGATGTCCTCTGGTCCTT 

CCATTGGTATTAAAGATTATAATATTCTTACAAGGATCACTCAACTAGGAATAATTATTTCACTGATTTGTCTTG 

^E^T^^S^^^^^'^^^^^'^^^^'^^^'^^^'^TC^^^'^^TTCAAAGCACCAGGACAACAATTC^^ 

GTAGCCTATTTCTTGCTGAACTTGTTTTTCTTGTTGGGATCAATACAAATACTAATAAGCTCTTCTGTTCAATCA 

TTGCCGGACTGCTACAGTACTTCTTTTTAGCTGCTTTTGCATGGATGTGCATTGAAGGCATACATCTCTATCTCA 

TTGTTGTGGGTGTCATCTACAACAAGGGATTTTTGCACAAGAATTTTTATATCTTTGGCTATCTAAGCCCAGCCG 

TGGTAGTTGGATTTTCGGCAGCACTAGGATACAGATATTATGGCACAACCAAAGTATGTTGGCTTAGCACCGAAA 

ACAACTTTATTTGGAGTTTTATAGGACCAGCATGCCTAATCATTCTTGTTAATCTCTTGGCTTTTGGAGTCATCA 

TATACAAAGTTTTTCGTCACACTGCAGGGTTGAAACCAGAAGTTAGTTGCTTTGAGAACATAAGGTCTTGTGCAA 

GAGGAGCCCTCGCTCTTCTGTTCCTTCTCGGCACCACCTGGATCTTTGGGGTTCTCCATGTTGTGCACGCATCAG 

TGGTTACAGCTTACCTCTTCACAGTCAGCAATGCTTTCCAGGGGATGTTCATTTTTTTATTCCTGTGTGTTTTAT 

CTAGAAAGATTCAAGAAGAATATTACAGATTGTTCAAAAATGTCCCCTGTTGTTTTGGATGTTTAAGGTAAACAT 

^^^^^'^^'^^^'^^^''^^^^^^'^^^^^^^T'^^^^TTCCAAGCTGTGGATGACCAATGTATAAAA^ 

CATCAAATTATCCAATTATTAACTACTAGACAAAAAGTATTTTAAATCAGTTTTTCTGTTTATGCTATAGGAACT 

^I^^^^^^^^^^^'^^^^^'^^'^'^^'^^'^^^A'^^'^^'^TATGTTTTTCTATGTGAAATA 

GTATTGCAGATATTTGGAAAGTAATTGGTTTCTCAGGAGTGATATCACTGCACCCAAGGAAAGATTTTCTTTCTA 

ACACGAGAAGTATATGAATGTCCTGAAGGAAACCACTGGCTTGATATTTCTGTGACTCGTGTTGCCTTTGAAACT 

^^!!S?S^'^^'^^'^^^^<='^GTAATGAGCTCCATTACAGAAAGTGGAACATAAGAGAATGAAGGGGCAGAATATCAAA 

CAGTGAAAAGGGAATGATAAGATGTATTTTGAATGAACTGTTTTTTCTGTAGACTAGCTGAGAAATTGTTGACAT 

AAAATAAAGAATTGAAGAAACACATTTTACCATTTTGTGAATTGTTCTGAACTTAAATGTCCACTAAAACAACTT 

^n^^n.I^^'^'^^^^^^^^^^^'^'^^^T'^'^^T^TATTCTAAAAAAAAAAAAAAGGTTTACCTCCAC^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 306 

MKRLPLLVVFSTLLNCSYTQNCTKTPCLPNAKCEIRNGIEACYCNMGFSGNGVTICEDDNECGNLTQSCGENANC 

TNTEGSYYCMCVPGFRSSSNQDRFITNDGTVCIENVNANCHLDNVCIAANINKTLTKIRSIKEPVALLQEVYRNS 

VTDLSPTDIITYIEILAESSSLLGYKNNTISAKDTLSNSTLTEFVKTVNNFVQRDTFVVWDKLSVNHRRTHLTKL 

MHTVEQATLRISQSFQKTTEFDTNSTDIALKVFFFDSYNMKHIHPHMNMDGDYINIFPKRKAAYDSNGNVAVAFL 

YYKSIGPLLSSSDNFLLKPQNYDNSEEEERVISSVISVSMSSNPPTLYELEKITFTLSHRKVTDRYRSLCAFWNY 

SPDTMNGSWSSEGCELTYSNETHTSCRCNHLTHFAILMSSGPSIGIKDYNILTRITQLGIIISLICLAICIFTFW 

FFSEIQSTRTTIHECNLCCSLFLAELVFLVGINTNTNKLFCSIIAGLLHYFFLAAFAWMCIEGIHLYLIVVGVIYN 

KGFLHKNFYIFGYLSPAVWGFSAALGYRYYGTTKVCWLSTENNFIWSFIGPACLIILVNLLAFGVIIYKVFRHT 

AGLKPEVSCFENIRSCARGALALLFLLGTTWIFGVLHWHASVVTAYLFTVSNAFQGMFIFLFLCVLSRKIQEEY 
YRLFECNVPCCFGCLR 

In^ortant features: 
Signal peptide: 

Amino acids 1-19 

Transmembrane domain: 

Amino acids 431-450; 494-515; 573-594 ; 619-636; 646-664 
N-glycosylation sites: 

Amino acids 15-19;21-25; 64-68; 74-78; 127-131; 177-181; 
188-192;249-253;381-385; 395-399 

Glycosaminoglycan attachment site: 
Amino acids 4 9-53 

CAMP- and cGMP^dependent protein kinase phosphorylation site: 
Amino acids 360-364 

Tyrosine kinase phosphorylation sites: 

Amino acids 36-44; 670-677 

H-myr Is toylation si tes : 

Amino acids 38-44;50-56;52-58;80-86; 382-388;388-394; 
434-4 40; 480-486; 521-527 



Aspartlc acid and . asparaglne hydroxylatlon site: 
Amino acids 75-87 
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FIGURE 307 



CCAGGCCGGGAGGCGACGCGCCCAGCCGTCTAAACGGGAACAGCCCTGGCTGAGGGAGCTGCAGCGCAGCAGAGT 

ATCTGACGGCGCCAGGTTGCGTAGGTGCGGCACGAGGAGTTTTCCCGGCAGCGAGGAGGTCCTGAGCAGCMSGC 

CCGGAGGAGCGCCTTCCCTGCCGCCGCGCTCTGGCTCTGGAGCATCCTCCTGTGCCTGCTGGCACTGCGGGCGGA 

GGCCGGGCCGCCGCAGGAGGAGAGCCTGTACCTATGGATCGATGCTCACCAGGCAAGAGTACTCATAGGATTTGA 

AGAAGATATCCTGATTGTTTCAGAGGGGAAAATGGCACCTTTTACACATGATTTCAGAAAAGCGCAACAGAGAAT 

GCCAGCTATTCCTGTCAATATCCATTCCATGAATTTTACCTGGCAAGCTGCAGGGCAGGCAGAATACTTCTATGA 

ATTCCTGTCCTTGCGCTCCCTGGATAAAGGCATCATGGCAGATCCAACCGTCAATGTCCCTCTGCTGGGAACAGT 

GCCTCACAAGGCATCAGTTGTTCAAGTTGGTTTCCCATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTGAAGT 

GGATGTGATTGTTATGAATTCTGAAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCTTTAAAACATG 

TCAACAAGCTGAGTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAAAGACGCATCTGCGAGTGTCCTGA 

TGGGTTCCACGGACCTCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATGAATGGTGGACTTTGTGTGACTCC 

TGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGTGACAAAGCAAACTGCTCAACCACCTGCTTTAA 

TGGAGGGACCTGTTTCTACCCTGGAAAATGTATTTGCCCTCCAGGACTAGAGGGAGAGCAGTGTGAAATCAGCAA 

ATGCCCACAACCCTGTCGAAATGGAGGTAAATGCATTG6TAAAAGCAAATGTAAGTGTTCCAAAGGTTACCAGGG 

AGACCTCTGTTCAAAGCCTGTCTGCGAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAACAAATGCCA 

ATGTCAAGAAGGTTGGCATGGAAGACACTGCAATAAAAGGTACGAAGCCAGCCTCATACATGCCCTGAGGCCAGC 

AGGCGCCCAGCTCAGGCAGCACACGCCTTCACTTAAAAAGGCCGAGGAGCGGCGGGATCCACCTGAATCCAATTA 

CATCTGGSGAACTCCGACATCTGAAACGTTTTAAGTTACACCAAGTTCATAGCCTTTGTTAACCT^ 

GAATGTTCAAATAATGTTCATTACACTTAAGAATACTGGCCTGAATTTTATTAGCTTCATTATAAATCACTGAGC 

TGATATTTACTCTTCCTTTTAAGTTTTCTAAGTACGTCTGTAGCATGATGGTATAGATTTTCTTGTTTCAGTGCT 

TTGGGACAGATTTTATATTATGTCAATTGATCAGGTTAAAATTTTCAGTGTGTAGTTGGCAGATATTTTCAAAAT 

TACAATGCATTTATGGTGTCTGGGGGCAGGGGAACATCAGAAAGGTTAAATTGGGCAAAAATGCGTAAGTCACAA 

GAATTTGGATGGTGCAGTTAATGTTGAAGTTACAGCATTTCAGATTTTATTGTCAGATATTTAGATGTTTGTTAC 

ATTTTTAAAAATTGCTCTTAATTTTTAAACTCTCAATACAATATATTTTGACCTTACCATTATTCCAGAGATTCA 

GTATTAAAAAAAAAAAAATTACACTGTGGTAGTGGCATTTAAACAATATAATATATTCTAAACACAATGAAATAG 

GGAATATAATGTATGAACTTTTTGCATTGGCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTTACCT 

AATAAACATTTTATACTGTTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTTTGGAAAAAAAAAAAAAAAAA 



AAAAAAAA 
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FIGURE 308 



MARRSAFPAAALWLWSILLCLLALRAEAGPPQEESLYLWIDAHQARVLIGFEEDILIVSEGKM 
APFTHDFRKAQQRMPAIPVNIHSMNFTWQAAGQAEYFYEFLSLRSLDKGIMADPTVNVPLLGT 
VPHKASVVQVGFPCLGKQDGVAAFEVDVIVMNSEGNTILQTPQNAIFFKTCQQAECPGGCRNG 
.GFCNERRICECPDGFHGPHCEKALCTPRCMNGGLCVTPGFCICPPGFYGVNCDKANCSTTCFN 
GGTCFYPGKCICPPGLEGEQCEISKCPQPCRNGGKCIGKSKCKCSKGYQGDLCSKPVCEPGCG 
AHGTCHEPNKCQCQEGWHGRHCNKRYEASLIHALRPAGAQLRQHTPSLKKAEERRDPPESNYIW 

Important features: 
Signal sequence: 

Amino acids 1-28 

N-glycosylation sites: 

Amino acids 88-92; 245-249 

Tyrosine kinase phosphorylation site: 

Amino acids 370-378 

N^myristoylation sites: 

Amino acids 184-190; 185-191; 189-195;315-321 

ATP/GTP-binding site motif A (P-loop) : 

Amino acids 285-293 



EGF-like domain cysteine pattern signatures: 

Amino acids 198-210;230-242;262-274;294-306; 326-338 
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FIGURE 309 



CCCACGCGTCCGGTCTCGCTCGCTCGCGCAGCGGCGGCAGCAGAGGTCGCGCACAGATGCGGG 

TTAGACTGGCGGGGGGAGGAGGCGGAGGAGGGAAGGAAGCTGCATGCATGAGACCCACAGACT 

CTTGCAAGCTGGATGCCCTCTGTGGATGAAAGATGTATCATGGAATGAACCCGAGCAATGGAG 

ATGGATTTCTAGAGCAGCAGCAGCAGCAGCAGCAACCTCAGTCCCCCCAGAGACTCTTGGCCG 

ISoZ^S'^^'^^^^^'^^^^^'^^^^^^'rG'^GCTTCGGCCCTGCACAGCTCACGGGCGGGTTCGATG 

JSSnSSS^'^^'^^^^^^^^^^^^^^^^T^^CCG^GAATGGCTTCAGGACCCCCAGCGGAGGGG 

^!nnIJr^*^^^^^''^^^^^^^^^^^'^^^CTGCCAAGACGGATTCAAGCTGAAGGGCGCTA 

CAAAGAGACTGTGTTTGAAGCATTTTAATGGAACCCTAGGCTGGATCCCAAGTGATAATTCCA 

TCTGTGTGCAAGAAGATTGCCGTATCCCTCAAATCGAAGATGCTGAGATTCATAACAAGACAT 

ATAGACATGGAGAGAAGCTAATCATCACTTGTCATGAAGGATTCAAGATCCGGTACCCCGACC 

TACACAATATGGTTTCATTATGTCGCGATGATGGAACGTGGAATAATCTGCCCATCTGTCAAG 

GCTGCCTGAGACCTCTAGCCTCTTCTAATGGCTATGTAAACATCTCTGAGCTCCAGACCTCCT 

TCCCGGTGGGGACTGTGATCTCCTATCGCTGCTTTCCCGGATTTAAACTTGATGGGTCTGCGT 

ATCTTGAGTGCTTACAAAACCTTATCTGGTCGTCCAGCCCACCCCGGTGCCTTGCTCTGGAAG 

CCCAAGTCTGTCCACTACCTCCAATGGTGAGTCACGGAGATTTCGTCTGCCACCCGCGGCCTT 

GTGAGCGCTACAACCACGGAACTGTGGTGGAGTTTTACTGCGATCCTGGCTACAGCCTCACCA 

GCGACTACAAGTACATCACCTGCCAGTATGGAGAGTGGTTTCCTTCTTATCAAGTCTACTGCA 

TCAAATCAGAGCAAACGTGGCCCAGCACCCATGAGACCCTCCTGACCACGTGGAAGATTGTGG 

CGTTCACGGCAACCAGTGTGCTGCTGGTGCTGCTGCTCGTCATCCTGGCCAGGATGTTCCAGA 

CCAAGTTCAAGGCCCACTTTCCCCCCAGGGGGCCTCCCCGGAGTTCCAGCAGTGACCCTGACT 

TTGTGGTGGTAGACGGCGTGCCCGTCATGCTCCCGTCCTATGACGAAGCTGTGAGTGGCGGCT 

TGAGTGCCTTAGGCCCCGGGTACATGGCCTCTGTGGGCCAGGGCTGCCCCTTACCCGTGGACG 

JSSS^^^o^^^^^^^'^^^^^^^'^^^^^^^GG^^^ACGGACACAGGCCCAGGGGAGTCAGAAA 

CCTGTGACAGCGTCTCAGGCTCTTCTGAGCTGCTCCAAAGTCTGTATTCACCTCCCAGGTGCC 

AAGAGAGCACCCACCCTGCTTCGGACAACCCTGACATAATTGCCAGCACGGCAGAGGAGGTGG 

eATCCACCAGCCCAGGCATCCATCATGCCCACTGGGTGTTGTTCCTAAGAAACTGATTGATTA 

AAAAATTTCCCAAAGTGTCCTGAAGTGTCTCTTCAAATACATGTTGATCTGTGGAGTTGATTC 

CTTTCCTTCTCTTGGTTTTAGACAAATGTAAACAAAGCTCTGATCCTTAAAATTGCTATGCTG 

ATAGAGTGGTGAGGGCTGGAAGCTTGATCAAGTCCTGTTTCTTCTTGACACAGACTGATTAAA 
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FIGURE 310 

MYHGMNPSNGDGFLEQQQQQQQPQSPQRLLAVILWFQLALCFGPAQLTGGFDDLQVCADPGIP 
ENGFRTPSGGVFFEGSVARFHCQDGFKLKGATKRLCLKHFNGTLGWIPSDNSICVQEDCRIPQ 
lEDAEIHNKTYRHGEKLIITCHEGFKIRYPDLHNMVSLCRDDGTWNNLPICQGCLRPLASSNG 
YVNISELQTSFPVGTVISYRCFPGFKLDGSAYLECLQNLIWSSSPPRCLALEAQVCPLPPMVS 
HGDFVCHPRPCERYNHGTWEFYCDPGYSLTSDYKYITCQYGEWFPSYQVYCIKSEQTWPSTH 
ETLLTTWKIVAFTATSVLLVLLLVILARMFQTKFKAHFPPRGPPRSSSSbPDFWVDGVPVML 
PSYDEAVSGGLSALGPGYMASVGQGCPLPVDDQSPPAYPGSGDTDTGPGESETCDSVSGSSEL 
LQSLYSPPRCQESTHPASDNPDIIASTAEEVASTSPGIHHAHWVLFLRN 

Important features: 
Signal sequence: 

amino acids 1-41 

Transmembrane domain: 

amino acids 325-344 

N-glycosylation site. 

amino acids 104-108, 134-138, 192-196 
Casein kinase II phosphorylation site. 

amino acids 8-12, 146-150, 252-256, 270-274, 313-317, 362-366, 
364-368, 380-384, 467-471, 468-472 

N-myristoylation site. 

amino acids 4-10, 61-67, 169-175, 203-209, 387-393, 418-424> 
478-484 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 394-405 
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FIGURE 311 

CAGCGCGTGGCCGGCGCCGCTGTGGGGACAGCATSAGCGGCGGTTGGATGGCGCAGGTTGGAG 

CGTGGCGAACAGGGGCTCTGGGCCTGGCGCTGCTGCTGCTGCTCGGCCTCGGACTAGGCCTGG 

AGGCCGCCGCGAGCCCGCTTTCCACCCCGACCTCTGCCCAGGCCGCAGGCCCCAGCTCAGGCT 

CGTGCCCACCCACCAAGTTCCAGTGCCGCACCAGTGGCTTATGCGTGCCCCTCACCTGGCGCT 

GCGACAGGGACTTGGACTGCAGCGATGGCAGCGATGAGGAGGAGTGCAGGATTGAGCCATGTA 

CCCAGAAAGGGCAATGCCCACCGCCCCCTGGCCTCCCCTGCCCCTGCACCGGCGTCAGTGACT 

GCTCTGGGGGAACTGACAAGAAACTGCGCAACTGCAGCCGCCTGGCCTGCCTAGCAGGCGAGC 

TCCGTTGCACGCTGAGCGATGACTGCATTCCACTCACGTGGCGCTGCGACGGCCACCCAGACT 

GTCCCGACTCCAGCGACGAGCTCGGCTGTGGAACCAATGAGATCCTCCCGGAAGGGGATGCCA 

CAACCATGGGGCCCCCTGTGACCCTGGAGAGTGTCACCTCTCTCAGGAATGCCACAACCATGG 

GGCCCCCTGTGACCCTGGAGAGTGTCCCCTCTGTCGGGAATGCCACATCCTCCTCTGCCGGAG 

ACCAGTCTGGAAGCCCAACTGCCTATGGGGTTATTGCAGCTGCTGCGGTGCTCAGTGCAAGCC 

TGGTCACCGCCACCCTCCTCCTTTTGTCCTGGCTCCGAGCCCAGGAGCGCCTCCGCCCACTGG 

GGTTACTGGTGGCCATGAAGGAGTCCCTGCTGCTGTCAGAACAGAAGACCTCGCTGCCCTGAG 

GACAAGCACTTGCCACCACCGTCACTCAGCCCTGGGCGTAGCCGGACAGGAGGAGAGCAGTGA 

TGCGGATGGGTACCCGGGCACACCAGCCCTCAGAGACCTGAGTTCTTCTGGCCACGTGGAACC 

TCGAACCCGAGCTCCTGCAGAAGTGGCCCTGGAGATTGAGGGTCCCTGGACACTCCCTATGGA 

GATCCGGGGAGCTAGGATGGGGAACCTGCCACAGCCAGAACTGAGGGGCTGGCCCCAGGCAGC 

TCCCAGGGGGTAGAACGGCCCTGTGCTTAAGACACTCCCTGCTGCCCCGTCTGAGGGTGGCGA 
TTAAAGTTGCTTC 
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FIGURE 312 

MSGGWMAQVGAWRTGALGLALLLLLGLGLGLEAAASPLSTPTSAQAAGPSSGSCPPTKFQCRT 
SGLCVPLTWRCDRDLDCSDGSDEEECRIEPCTQKGQCPPPPGLPCPCTGVSDCSGGTDKKLRN 
CSRLACLAGELRCTLSDDCIPLTWRCDGHPDCPDSSDELGCGTNEILPEGDATTMGPPVTLES 
VTSLRNATTMGPPVTLESVPSVGNATSSSAGDQSGSPTAYGVIAAAAVLSASLVTATLLLLSW 
LRAQERLRPLGLLVAMKESLLLSEQKTSLP 

Important features: 
Signal sequence: 

Amino acids 1-30 

Transmembrane domain: 

Amino acids 231-248 

N-glycosylation sites: 

Amino acids 126-130; 195-199; 213-217 

Casein kinase II phosphorylation site. 

amino acids 84-88, 140-144, 161-165, 218-222. 

N-myristoylation sites: 

Amino acids 3-9;10-16;26-32;30-36;112-ll8;166-172;212-218; 
224-230;230-236; 263-269 

Prokaryotic membrane lipoprotein lipid attachment site: 

Amino acids 44-55 



Leucine zipper pattern: 

Amino acids 17-39 
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FIGURE 313 



CGGACGCGTGGGCGTCCGGCGGTCGCAGAGCCAGGAGGCGGAGGCGCGCGGGCCAGCCTGGGCCCCAGCCCACAC 

CTTCACCAGGGCCCAGGAGCCACCATCTGGCGATGTCCACTGGGGCTACTGCTGTTGCTGCCGCTGGCTGGCCAC 

TTGGCTCTGGGTGCCCAGCAGGGTCGTGGGCGCCGGGAGCTAGCACCGGGTCTGCACCTGCGGGGCATCCGGGAC 

GCGGGAGGCCGGTACTGCCAGGAGCAGGACCTGTGCTGCCGCGGCCGTGCCGACGACTGTGCCCTGCCCTACCTG 

GGCGCCATCTGTTACTGTGACCTCTTCTGCAACCGCACGGTCTCCGACTGCTGCCCTGACTTCTGGGACTTCTGC 

CTCGGCGTGCCACCCCCTTTTCCCCCGATCCAAGGATGTATGCATGGAGGTCGTATCTATCCAGTCTTGGGAACG 

TACT6GGACAACTGTAACCGTTGCACCTGCCAGGAGAACAGGCAGTGGCATGGTGGATCCAGACATGATCAAAGC 

CATCAACCAGGGCAACTATGGCTGGCAGGCTGGGAACCACAGCGCCTTCTGGGGCATGACCCTGGATGAGGGCAT 

TCGCTACCGCCTGGGCACCATCCGCCCATCTTCCTC6GTCATGAACATGCATGAAATTTATACAGTGCTGAACCC 

AGGGGAGGTGCTTCCCACAGCCTTCGAGGCCTCTGAGAAGTGGCCCAACCTGATTCATGAGCCTCTTGACCAAGG 

CAACTGTGCAGGCTCCTGGGCCTTCTCCACAGCAGCTGTGGCATCCGATCGTGTCTCAATCCATTCTCTGGGACA 

CATGACGCCTGTCCTGTCGCCCCAGAACCTGCTGTCTTGTGACACCCACCAGCAGCAGGGCTGCCGCGGTGGGCG 

TCTC6ATGGTGCCTGGTGGTTCCTGCGTCGCCGAGGGGTGGTGTCTGACCACTGCTACCCCTTCTCGGGCCGTGA 

ACGAGACGAGGCTGGCCCTGCGCCCCCCTGTATGATGCACAGCCGAGCCATGGGTCGGGGCAAGCGCCAGGCCAC 

TGCCCACTGCCCCAACAGCTATGTTAATAACAATGACATCTACCAGGTCACTCCTGTCTACCGCCTCGGCTCCAA 

CGACAAGGAGATCATGAAGGAGCTGATGGAGAATGGCCCTGTCCAAGCCCTCATGGAGGTGCATGAGGACTTCTT 

CCTATACAAGGGAGGCATCTACAGCCACACGCCAGTGAGCCTTGGGAGGCCAGAGAGATACCGCCGGCATGGGAC 

CCACTCAGTCAAGATCACAGGATGGGGAGAGGAGACGCTGCCAGATGGAAGGACGCTCAAATACTGGACTGCGGC 

CAACTCCTGGGGCCCAGCCTGGGGCGAGAGGGGCCACTTCCGCATCGTGCGCGGCGTCAATGAGTGCGACATCGA 

GAGCTTCGTGCTGGGCGTCTGGGGCCGCGTGGGCATGGAGGACATGGGTCATCACTGAGGCTGCGGGCACCACGC 

GGGGTCCGGCCTGGGATCCAGGCTAAGGGCCGGCGGAAGAGGCCCCAATGGGGCGGTGACCCCAGCCTCGCCCGA 

CAGAGCCCGGGGCGCAGGCGGGCGCCAGGGCGCTAATCCCGGCGCGGGTTCCGCTGACGCAGCGCCCCGCCTGGG 

AGCCGCGGGCAGGCGAGACTGGCGGAGCCCCCAGACCTCCCAGTGGGGACGGGGCAGGGCCTGGCCTGGGAAGAG 

CACAGCTGCAGATCCCAGGCCTCTGGCGCCCCCACTCAAGACTACOUU^GCCAGGACACCTCAAGTCTCCAGCCC 

CAATACCCCACCCCAATCCCGTATTCTTTTTTTTTTTTTTTTAGACAGGGTCTTGCTCCGTTGCCCAGGTTGGAG 

TGCAGTGGCCCATCAGGGCTCACTGTAACCTCCGACTCCTGGGTTCAAGTGACCCTCCCACCTCAGCCTCTCAAG 

TAGCTGGGACTACAGGTGCACCACCACACCTGGCTAATTTTTGTATTTTTTGTAAAGAGGGGGGTCTCACTGTGT 

TGCCCAGGCTGGTTTCGAACTCCTGGGCTCAAGCGGTCCACCTGCCTCCGCCTCCCAAAGTGCTGGGATTGCAGG 

CATGAGCCACTGCACCCAGCCCTGTATTCTTATTCTTCAGATATTTATTTTTCTTTTCACTGTTTTAAAATAAAA 
CCAAAGTATTGATAAAAAAAAA 
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FIGURE 314 



MWRCPLGLLLLLPLAGHLALGAQQGRGRRELAPGLHLRGIRDAGGRYCQEQDLCCRGRADDCA 
LPYLGAICYCDLFCNRTVSDCCPDFWDFCLGVPPPFPPIQGCMHGGRIYPVLGTYWDNCNRCT 
CQENRQWHGGSRHDQSHQPGQLWLAGWEPQRLLGHDPG 

In^ortant features: 
N-glycosylation site, 
amino acids 78-82, 161-165 

Casein kinase II phosphorylation site. 

amino acids 80-84, . 117-121, 126-130, 169-173, 205-209, 296-300, 
411-415 

N-myristoylation site. 

amino acids 21-27, 39-45, 44-50, 104-110,-160-164, 224-230, 
269-275, 378-384, 442-448 

Amidation site. 

amino acids 26-30, 318-322 

Eukaryotic thiol (cysteine) proteases histidine active site. 

amino acids 398-409 
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FIGURE 315 



CGGACGCGTGGGCCCCTGGTGGGCCCAGCAAGM6GATCTACTGTGGATCCTGCCCTCCCTGT 

GGCTTCTCCTGCTTGGGGGGCCTGCCTGCCTGAAGACCCAGGAACACCCCAGCTGCCCAGGAC 

CCAGGGAACTGGAAGCCAGCAAAGTTGTCCTCCTGCCCAGTTGTCCCGGAGCTCCAGGAAGTC 

CTGGGGAGAAGGGAGCCCCAGGTCCTCAAGGGCCACCTGGACCACCAGGCAAGATGGGCCCCA 

AGGGTGAGCCAGGCCCCAGAAACTGCCGGGAGCTGTTGAGCCAGGGCGCCACCTTGAGCGGCT 

GGTACCATCTGTGCCTACCTGAGGGCAGGGCCCTCCCAGTCTTTTGTGACATGGACACCGAGG 

GGGGCGGCTGGCTGGTGTTTCAGAGGCGCCAGGATGGTTCTGTGGATTTCTTCCGCTCTTGGT 

CCTCCTACAGAGCAGGTTTTGGGAACCAAGAGTCTGAATTCTGGCTGGGAAATGAGAATTTGC 

ACCAGCTTACTCTCCAGGGTAACTGGGAGCTGCGGGTAGAGCTGGAAGACTTTAATGGTAACC 

GTACTTTCGCCCACTATGCGACCTTCCGCCTCCTCGGTGAGGTAGACCACTACCAGCTGGCAC 

TGGGCAAGTTCTCAGAGGGCACTGCAGGGGATTCCCTGAGCCTCCACAGTGGGAGGCCCTTTA 

CCACCTATGACGCTGACCACGATTCAAGCAACAGCAACTGTGCAGTGATTGTCCACGGTGCCT 

GGTGGTATGCATCCTGTTACCGATCAAATCTCAATGGTCGCTATGCAGTGTCTGAGGCTGCCG 

CCCACAAATATGGCATTGACTGGGCCTCAGGCCGTGGTGTGGGCCACCCCTACCGCAGGGTTC 

GGATGATGCTTCGATASGGCACTCTGGCAGCCAGTGCCCTTATCTCTCCTGTACAGCTTCCGG 

ATCGTCAGCCACCTTGCCTTTGCCAACCACCTCTGCTTGCCTGTCCACATTTAAAAATAAAAT 
CATTTTAGCCCTTTCA 



wo 01/40466 



PCT/USOO/32678 



316/550 

FIGURE 316 

MDLLWILPSLWLLLLGGPACLKTQEHPSCPGPRELEASKVVLLPSCPGAPGSPGEKGAPGPQG 
PPGPPGKMGPKGEPGPRNCRELLSQGATLSGWYHLCLPEGRALPVFCDMDTEGGGWLVFQRRQ 
DGSVDFFRSWSSYRAGFGNQESEFWLGNENLHQLTLQGNWELRVELEDFNGNRTFAHYATFRL 
LGEVDHYQLALGKFSEGTAGDSLSLHSGRPFTTYDADHDSSNSNCAVIVHGAWWYASCYRSNL 
NGRYAVSEAAAHKYGIDWASGRGVGHPYRRVRMMLR 

Important features: 
Signal peptide : . 

Amino acids. 1-16 

N-glycosylation site: 

Amino acids 178-182 

Glycosamlnoglycan attachment site: 

Amino acids 272-27 6 

Tyrosine kinase phosphorylation site: 

Amino acids 188-197 

N-myristoylation sites: 

Amino acids 16-22; 89-95; 144-150;267-273 

Fibrinogen beta and gamma chains C-terminai d<»nain signature: 

Amino acids . 242-255 
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FIGURE 317 



CCCAAGCCAGCCGAGCCGCCAGAGCCGCGGGCCGCGGGGGTGTCGCGGGCCCAACCCCAGGAT 

6CTCCCCTGCGCCTCCTGCCTACCCGGGTCTCTACTGCTCTGGGCGCTGCTACTGTTGCTCTT 

GGGATCAGCTTCTCCTCAGGATTCTGAAGAGCCCGACAGCTACACGGAATGCACAGATGGCTA 

TGAGTGGGACCCAGACAGCCAGCACTGCCGGGATGTCAACGAGTGTCTGACCATCCCTGAGGC 

CTGCAAGGGGGAAATGAAGTGCATCAACCACTACGGGGGCTACTTGTGCCTGCCCCGCTCCGC 

TGCCGTCATCAACGACCTACATGGCGAGGGACCCCCGCCACCAGTGCCTCCCGCTCAACACCC 

CAACCCCTGCCCACCAGGCTATGAGCCCGACGATCAGGACAGCTGTGTGGATGTGGACGAGTG 

TGCCCAGGCCCTGCACGACTGTCGCCCCAGCCAGGACTGCCATAACTTGCCTGGCTCCtATCA 

GTGCACCTGCCCTGATGGTTACCGCAAGATCGGGCCCGAGfGTGTGGACATAGACGAGTGCCG 

CTACCGCTACTGCCAGCACCGCTGCGTGAACCTGCCTGGCTCCTTCCGCTGCCAGTGCGAGCC 

GGGCTTCCAGCTGGGGCCTAACAACCGCTCCTGTGTTGATGTGAACGAGTGTGACATGGGGGC 

CCCATGCGAGCAGCGCTGCTTCAACTCCTATGGGACCTTCCTGTGTCGCTGCCACCAGGGCTA 

TGAGCTGCATCGGGATGGCTTCTCCTGCAGTGATATTGATGAGTGTAGCTACTCCAGCTACCT 

CTGTCAGTACCGCTGCGTCAACGAGCCAGGCCGTTTCTCCTGCCACTGCCCACAGGGTTACCA 

GCTGCTGGCCACACGCCTCTGCCAAGACATTGATGAGTGTGAGTCTGGTGCGCACCAGTGCTC 

CGAGGCCCAAACCTGTGTCAACTTCCATGGGGGCTACCGCTGCGTGGACACCAACCGCTGCGT 

GGAGCCCTACATCCAGGTCTCTGAGAACCGCTGTCTCTGCCCGGCCTCCAACCCTCTATGTCG 

AGAGCAGCCTTCATCCATTGTGCACCGCTACATGACCATCACCTCGGAGCGGAGCGTGCCCGC 

TGACGTGTTCCAGATCCAGGCGACCTCCGTCTACCCCGGTGCCTACAATGCCTTTCAGATCCG 

TGCTGGAAACTCGCAGGGGGACTTTTACATTAGGCAAATCAACAACGTCAGCGCCATGCTGGT 

CCTCGCCCGGCCGGTGACGGGCCCCCGGGAGTACGTGCTGGACCTGGAGATGGTCACCATGAA 

TTCCCTCATGAGCTACCGGGCCAGCTCTGTACTGAGGCTCACCGTCTTTGTAGGGGCCTACAC 

CTTCTSaGGAGCAGGAGGGAGCCACCCTCCCTGCAGCTACCCTAGCTGAGGAGCCTGTTGTGA 

GGGGCAGAATGAGAAAGGCAATAAAGGGAGAAAGAAAGTCCTGGTGGCTGAGGTGGGCGGGTC 

ACACTGCAGGAAGCCTCAGGCTGGGGCAGGGTGGCACTTGGGGGGGCAGGCCAAGTTCACCTA 

AATGGGGGTCTCTATATGTTCAGGCCCAGGGGCCCCCATTGACAGGAGCTGGGAGCTCTGCAC 

CACGAGCTTCAGTCACCCCGAGAGGAGAGGAGGTAACGAGGAGGGCGGACTCCAGGCCCCGGC 

CCAGAGATTTGGACTTGGCTGGCTTGCAGGGGTCCTAAGAAACTCCACTCTGGACAGCGCCAG 

GAGGCCCTGGGTTCCATTCCTAACTCTGCCTCAAACTGTACATTTGGATAAGCCCTAGTAGTT 

CCCTGGGCCTGTTTTTGTATAAAACGAGGCAACTGGAAAAAAAAAAAA 
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FIGURE 318 



MLPCASCLPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPE 
ACKGEMKCINHYGGYLCLPRSAAVINDLHGEGPPPPVPPAQHPNPCPPGYEPDDQDSCVDVDE 
CAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCE 
PGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSY 
LCQYRCVNEPGRFSCHCPQGYQLLATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRC 
VEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTITSERSVPADVFQIQATSVYPGAYNAFQI 
RAGNSQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 

Inportant features: 
Signal sequence: 

Amino acids 1-25 

N-glycosylatlon sites: 

Amino acids 198-202;394-398 

N-myristoylation sites: 

Amino acids 76-82;145-151;182-188;222-228;290-296;305-311; 
371-377;381-387 

Aspartic acid and asparagine hydroxylation sites: 

amino acids 140-152; 177-189;217-229;258-270 
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FIGURE 319 



GCTGGGGACATGAGAGGCACACCGAAGACCCACCTCCTGGCCTTCTCCCTCCTCTGCCTCCTC 

TCAAAGGTGCGTACCCAGCTGTGCCCGACACCATGTACCTGCCCCTGGCCACCTCCCCGATGC 

CCGCTGGGAGTACCCCTGGTGCTGGATGGCTGTGGCTGCTGCCGGGTATGTGCACGGCGGCTG 

GGGGAGCCCTGCGACCAACTCCACGTCTGCGACGCCAGCCAGGGCCTGGTCTGCCAGCCCGGG 

GCAGGACCCGGTGGCCGGGGGGCCCTGTGCCTCTTGGCAGAGGACGACAGCAGCTGTGAGGTG 

AACGGCCGCCTGTATCGGGAAGGGGAGACCTTCCAGCCCCACTGCAGCATCCGCTGCCGCTGC 

GAGGACGGCGGCTTCACCTGCGTGCCGCTGTGCAGCGAGGATGTGCGGCTGCCCAGCTGGGAC 

TGCCCCCACCCCAGGAGGGTCGAGGTCCTGGGCAAGTGCTGCCCTGAGTGGGTGTGCGGCCAA 

GGAGGGGGACTGGGGACCCAGCCCCTTCCAGCCCAAGGACCCCAGTTTTCTGGCCTTGTCTCT 

TCCCTGCCCCCTGGTGTCCCCTGCCCAGAATGGAGCACGGCCTGGGGACCCTGCTCGACCACC 

TGTGGGCTGGGCATGGCCACCCGGGTGTCCAACCAGAACCGCTTCTGCCGACTGGAGACCCAG 

CGCCGCCTGTGCCTGTCCAGGCCCTGCCCACCCTCCAGGGGTCGCAGTCCACAAAACAGTGCC 

TTC!^AGCCGGGCTGGGAATGGGGACACGGTGTCCACCATCCCCAGCTGGTGGCCCTGTGCC 

TGGGCCCTGGGCTGATGGAAGATGGTCCGTGCCCAGGCCCTTGGCTGCAGGCAACACTTTAGC 

TTGGGTCCACCATGCAGAACACCAATATTAACACGCTGCCTGGTCTGTCTGGATCCCGAGGTA 

TGGCAGAGGTGCAAGACCTAGTCCCCTTTCCTCTAACTCACTGCCTAGGAGGCTGGCCAAGGT 

GTCCAGGGTCCTCTAGCCCACTCCCTGCCTACACACACAGCCTATATCAAACATGCACACGGG 

CGAGCTTTCTCTCCGACTTCCCCTGGGCAAGAGATGGGACAAGCAGTCCCTTAATATTGAGGC 

TGCAGCAGGTGCTGGGCTGGACTGGCCATTTTTCTGGGGGTAGGATGAAGAGAAGGCACACAG 

AGATTCTGGATCTCCTGCTGCCTTTTCTGGAGTTTGTAAAATTGTTCCTGAATACAAGCCTAT 
GCGTGA 
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FIGURE 320 



MRGTPKTHLLAFSLLCLLSKVRTQLCPTPCTCPWPPPRCPLGVPLVLDGCGCCRVCARRLGEP 
CDQLHVCDASQGLVCQPGAGPGGRGALCLLAEDDSSCEVNGRLYREGETFQPHCSIRCRCEDG 
GFTCVPLCSEDVRLPSWDCPHPRRVEVLGKCCPEWVCGQGGGLGTQPLPAQGPQFSGLVSSLP 
PGVPCPEWSTAWGPCSTTCGLGMATRVSNQNRFCRLETQRRLCLSRPCPPSRGRSPQNSAF 

Important features : 

Signal sequence: 
Amino acids 1-23 

N-myristoyiation sites: 

Amino acids 3-9; 49-55; 81-87; 85-91; 126-132; 164-170; 166-172; 
167-173; 183-189; 209-215 

Ixisulin-llke growth factor binding proteins signature: 

Amino acids 49-65 

von Willebrand CI domain: 

Amino acids 107-124 

Throinbospondin 1 Homology Block: 

Amino acids 201-216 



IGF binding protein site: 
Amino acids 49-58 
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FIGURE 321 



AGAACCTCAGAAATGTGAGTTATTTGGGAATGGCTGTTTGTAAATGTCCTTACGTAAGCCAAG 
AGGAGGTCTTGACTTGGGGTCCCAGGGGTACCGCAGATCCCAGGGACTGGAGCAGCACTAGCA 
AGCTCTGGAGGATGAGCCAGGAGT.CTGGAATTGAGGCTGAGCCAAAGACCCCAGGGCCGTCTC 
AGTCTCATAAAAGGGGATCAGGCAGGAGGAGTTTGGGAGAAACCTGAGAAGGGCCTGATTTGC 
AGCATCAT6ATGGGCCTCTCCTTGGCCTCTGCTGTGCTCCTGGCCTCCCTCCTGAGTCTCCAC 
CTTGGAACTGCCACACGTGGGAGTGACATATCCAAGACCTGCTGCTTCCAATACAGCCACAAG 
CCCCTTCCCTGGACCTGGGTGCGAAGCTATGAATTCACCAGTAACAGCTGCTCCCAGCGGGCT 
GTGATATTCACTACCAAAAGAGGCAAGAAAGTCTGTACCCATCCAAGGAAAAAATGGGTGCAA 
AAATACATTTCTTTACTGAAAACTCCGAAACAATTGTGACTCAGCTGAATTTTCATCCGAGGA 
CGCTTGGACCCCGCTCTTGGCTCTGCAGCCCTCTGGGGAGCCTGCGGAATCTTTTCTGAAGGC 
TACATGGACCCGCTGGGGAGGAGAGGGTGTTTCCTCCCAGAGTTACTTTAATAAAGGTTGTTC 
ATAGAGTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 322 



MMGLSLASAVLLASLLSLHLGTATRGSDISKTCCFQYSHKPLPWTWVRSYEFTSNSCSQRAVI 
FTTKRGKKVCTHPRKKWVQKYISLLKTPKQL 

Important features: 
Signal peptide: 
amino acids 1-23 

N-myristoylation sites. 

amino acids 3-9, 26-32 

Amidation site. 

amino acids 68-72 

Small cytokines (intecrxne/chemokine) . 

amino acids 23-88 
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FIGURE 27-^ 

ACCGAGCCGAGCGGACCGAAGGCGCGCCCGAGATGCAGGTGAGCAAGAGGATGCTGGCGGGGGGCGTGAGGAGCA 

TGCCCAGCCCCCTCCTGGCCTGCTGGCAGCCCATCCTCCTGCTGGTGCTGGGCTCAGTGCTGTCAGGCTCGGCCA 

CGGGCTGCCCGCCCCGCTGCGAGTGCTCCGCCCAGGACCGCGCTGTGCTGTGCCACCGCAAGTGCTTTGTGGCAG 

TCCCCGAGGGCATCCCCACCGAGACGGGCCTGCTGGACCTAGGCAAGAACCGCATCAAAACGCTCAACCAGGACG 

AGTTCGCCAGCTTCCCGCACCTGGAGGAGCTGGAGCTCAACGAGAACATCGTGAGCGCCGTGGAGCCCGGCGCCT 

TCAACAACCTCTTCAACCTCCGGACGCTGGGTCTCCGCAGCAACCGCCTGAAGCTCATCCCGCTAGGCGTCTTCA 

CTGGCCTCAGCAACCTGACCAAGCAG6ACATCAGC6AGAACAAGATCGTTATCCTACTGGACTACATGTTTCAGG 

ACCTGTACAACCTCAAGTCACTGGAGGTTGGCGACAATGACCTCGTCTACATCTCTCACCGCGCCTTCAGCGGCC 

TCAACAGCCTGGAGCAGCTGACGCTGGAGAAATGCAACCTGACCTCCATCCCCACCGAGGCGCTGTCCCACCTGC 

ACGGCCTCATCGTCCTGAGGCTCCGGCACCTCAACATCAATGCCATCCGGGACTACTCCTTCAAGAGGCTGTACC 

GACTCAAGGTCTTGGAGATCTCCCACTGGCCCTACTTGGACACCATGACACCCAACTGCCTCTACGGCCTCAACC 

TGACGTCCCTGTCCATCACACACTGCAATCTGACCGCTGTGCCCTACCTGGCCGTCCGCCACCTAGTCTATCTCC 

GCTTCCTCAACCTCTCCTACAACCCCATCAGCACCATTGAGGGCTCCATGTTGCATGAGCTGCTCCGGCTGCAGG 

AGATCCAGCTGGTGGGCGGGCAGCTGGCCGTGGTGGAGCCCTATGCCTTCCGCGGCCTCAACTACCTGCGCGTGC 

TCAATGTCTCTGGCAACCAGCTGACCACACTGGAGGAATCAGTCTTCCACTCGGTGGGCAACCTGGAGACACTCA 

TCCTGGACTCCAACCCGCTGGCCTGCGACTGTCGGCTCCTGTGGGTGTTCCGGCGCCGCTGGCGGCTCAACTTCA 

ACCGGCAGCAGCCCACGTGCGCCACGCCCGAGTTTGTCCAGGGCAAGGAGTTCAAGGACTTCCCTGATGTGCTAC 

TGCCCAACTACTTCACCTGCCGCCGCGCCCGCATCCGGGACCGCAAGGCCCAGCAGGTGTTTGTGGACGAGGGCC 

ACACGGTGCAGTTTGTGTGCCGGGCCGATGGCGACCCGCCGCCCGCCATCCTCTGGCTCTCACCCCGAAAGCACC 

TGGTCTCAGCCAAGAGCAATGGGCGGCTCACAGTCTTCCCTGATGGCACGCTGGAGGTGCGCTACGCCCAGGTAC 

AGGACAACGGCACGTACCTGTGCATCGCGGCCAACGCGGGCGGCAACGACTCCATGCCCGCCCACCTGCATGTGC 

GCAGCTACTCGCCCGACTGGCCCCATCAGCCCAACAAGACCTTCGCTTTCATCTCCAACCAGCCGGGCGAGGGAG 

AGGCCAACAGCACCCGCGCCACTGTGCCTTTCCCCTTCGACATCAAGACCCTCATCATCGCCACCACCATGGGCT 

TCATCTCTTTCCTGGGCGTCGTCCTCTTCTGCCTGGTGCTGCTGTTTCTCTGGAGCCGGGGCAAGGGCAACACAA 

AGCACAACATCGAGATCGAGTATGTGCCCCGAAAGTCGGACGCAGGCATCAGCTCCGCCGACGCGCCCCGCAAGT 

TCAACATGAAGATGATATGAGGCCGGGGCGGGGGGCAGGGACCCCCGGGCGGCCGGGCAGGGGAAGGGGCCTGGT 

CGCCACCTGCTCACTCTCCAGTCCTTCCCACCTCCTCCCTACCCTTCTACACACGTTCTCTTTCTCCCTCCCGCC 

TCCGTCCCCTGCTGCCCCCCGCCAGCCCTCACCACCTGCCCTCCTTCTACCAGGACCTCAGAAGCCCAGACCTGG 

GGACCCCACCTACACAGGGGCATTGACAGACTGGAGTTGAAAGCCGACGAACCGACACGCGGCAGAGTCAATAAT 

TCAATAAAAAAGTTACGAACTTTCTCTGTAACTTGGGTTTCAATAATTATGGATTTTTATGAAAACTTGAAATAA 
TAAAAAGAGAAAAAAACTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 324 



MQVSKRMLAGGVRSMPSPLLACWQPILLLVLGSVLSGSATGCPPRCECSAQDRAVLCHRKCFVAVPEGIPTETRL 



LDLGKNRIKTLNQDEFASFPHLEELELNENIVSAVEPGAFNNLFNLRTLGLRSNRLKLIPLGVFTGLSNLTKQDI 
SENKIVILLDYMFQDLYNLKSLEVGDNDLVYISHRAFSGLNSLEQLTLEKCNLTSIPTEALSHLHGLIVLRLRHL 
NINAIRDYSFKRLYRLKVLEISHWPYLDTbfrPNCLYGLNLTSLSITHCNLTAVPYLAVRHLVYLRFLNLSYNPIS 
TIEGSMLHELLRLQEIQLVGGQLAVVEPYAFRGLNYLRVLNVSGNQLTTLEESVFHSVGNLETLILDSNPLACDC 
RLLWVFRRRWRLNFNRQQPTCATPEFVQGKEFKDFPDVLLPNYFTCRRARIRDRECAQQVFVDEGHTVQFVCRADG 
DPPPAILWLSPRKHLVSAKSNGRLTVFPDGTLEVRYAQVQDNGTYLCIAANAGGNDSMPAHLHVRSYSPDWPHQP 
NKTFAFISNQPGEGEANSTRATVPFPFDIKTLIIATTMGFISFLGWLFCLVLLFLWSRGKGNTKHNIEIEYVPR 
KSDAGISSADAPRKFNMKMI . 

In^ortant features: 
Signal sequence: 

amino acids 1-41 

Transmeinbrane domain: 

amino acids 556-578 

N-glycosylation site. 

amino acids 144-148, 202-206, 264-268, 274-278, 293-297, 341-345, 492-496, 
505-509, 526-530, 542-546 

Casein kinase II phosphorylation site. 

amino acids 49-53,. 108-112, 146-150, 300-304, 348-352, 349-353, 607-611 

Tyrosine kinase phosphorylation site. 

amino acids 590-598 

M-myristoylation site. 



amino acids 10-16, 32-38, 37-43, 113-119, 125-131, 137-143, 262-268, 320-326, 
344-350, 359-365, 493-499, 503-509, 605-611 c 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 32-43 
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FIGURE 325 

CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGAG 

GCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGAGG 

AGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAGGAG 

GAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAACGGAG 

AGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGAGTAGG 

AAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGGAAAGAC 

ACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTGGCTGCTT 

TGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCTGGAGGGAC 

AGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGGCAGGGGTCC 

CTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGCGCTGGTACTC 

TGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGGACTGGTGGAGC 

TACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGCCTGGTGAATGCA 

GCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGAGCTGAAGAGGGTT 

CTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGAAGCTCCGGGGAACC 

TTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCTGTGGTCAftTGTGTCT 

GGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCTGTTTGGAGCTCGCGAC 

GGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTGAGGTGCAGCTCATTCAC 

TTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGCCCCAATGGCCTGGCCATT 

CTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCTCAGTCGCCTCCTTAACCGC 

GACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTCTTCAAGACCTGAGCCTGGAG 

CTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGCTCTCTCAGCACCCCGCCCTGC 

TCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAATATCACCTCCCTTCAGATGCAC 

TCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCTTCCAGAGCCTCAGCGGTAACAGC 

CGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGCAACAGGGACCCCCGGCACCCCGAG 

AGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGATGGTGTCCCCCATGGTCGC2GAGAC 

TCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTCCCCACAAGGCGAGGGGAGTTACCCCT 
AAAACAAAGCTATTAAAGGGACAGAATACTTA 
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FIGURE 326 

MGAAARLSAPEUVLVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLCA 

VGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPWNVSGGPLLY 

SHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSLFVN 

VASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSETVTW 

ILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPERRCRGP 
NYRLHVDGVPHGR 

Important features: 
Signal peptide: 

Amino acids 1-23 

Transmembrane domain: 

Amino acids 177-199 

N-glycosylation sites: 

Amino acids 118-122; 170-174;260-264 



Eukaryo tic- type carbonic anhydrases proteins: 

Amino acids 222-271; 128-165; 45-93 
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FIGURE 327 



GGACTAATCTGTGGGAGCAGTTTATTCCAGTATCACCCAGGGTGCAGCCACACCAGGACTGTGTTGAAGGGTGTT 

TTTTTTCTTTTAAATGTAATACCTCCTCATCTTTTCTTCTTACACAGTGTCTGAGAACATTTACATT^^ 

GTAGTACATGGTGGATAACTTCTACTTTTAGGAGGACTACTCTCTTCTGACAGTCCTAGACTGGTCTTCTACACT 

AAGACACCATGAAGGAGTATGTGCTCCTATTATTCCTGGCTTTGTGCTCTGCCAAACCCTTCTTTAGCCCTTra^ 

ACATCGCACTGAAGAATATGATGCTGAAGGATATGGAAGACACAGATGATGATGATGATGATGATGATGATGATG 

ATGATGATGAGGACAACTCTCTTTTTCCAACAAGAGAGCCAAGAAGCCATTTTTTTCCATTTGATCTGTTTCCAA 

TGTGTCCATTTGGATGTCAGTGCTATTCACGAGTTGTACATTGCTCAGATTTAGGTTTGACCTCAGTCCCAACCA 

ACATTCCATTTGATACTCGAATGCTTGATCTTCAAAACAATAAAATTAAGGAAATCAAAGAAAATGATTTTA^ 

GACTCACTTCACTTTATGGTCTGATCCTGAACAACAACAAGCTAACGAAGATTCACCCAAAAGCCTTTCTAACCA 

CAAAGAAGTTGCGAAGGCTGTATCTGTCCCACAATCAACTAAGTGAAATACCACTTAATCTTCCCAAATCATTAG 

CAGAACTCAGAATTCATGAAAATAAAGTTAAGAAAATACAAAAGGACACATTCAAAGGAATGAATGCTTTACACG 

TTTTGGAAATGAGTGCAAACCCTCTTGATAATAATGGGATAGAGCCAGGGGCATTTGAAGGGGTGACGGTGTTCC 

ATATCAGAATTGCAGAAGCAAAACTGACCTCAGTTCCTAAAGGCTTACCACCAACTTTATTGGAGCTTC^^ 

ATTATAATAAAATTTCAACAGTGGAACTTGAGGATTTTAAACGATACAAAGAACTACAAAGGCTGGGCCTAGGA^ 

ACAACAAAATCACAGATATCGAAAATGGGAGTCTTGCTAACATACCACGTGTGAGAGAAATACATTTGGAAAACA 

ATAAACTAAAAAAAATCCCTTCAGGATTACCAGAGTTGAAATACCTCCAGATAATCTTCCTTCATTCTAATTCAA 

TTGCAAGAGTGGGAGTAAATGACTTCTGTCCAACAGTGCCAAAGATGAAGAAATCTTTATACAGTGCAAT^^ 

TATTCAACAACCCGGTGAAATACTGGGAAATGCAACCTGCAACATTTCGTTGTGTTTTGAGCAGAATGAGTGTTC 

AGCTTGGGAACTTTGGAATGTAATAATTAGTAATTGGTAATGTCCATTTAATATAAGATTCAAAAATCCCTACA^ 

TTGGAATACTTGAACTCTATTAATAATGGTAGTATTATATATACAAGCAAATATCTATTCTCAAGTGGTAAGTCC 

ACTGACTTATTTTATGACAAGAAATTTCAACGGAATTTTGCCAAACTATTGATACATAAGGGGTTGAGAGAAACA 

AGCATCTATTGCAGTTTCCTTTTTGCGTACAAATGATCTTACATAAATCTCATGCTTGACCATTCCTTTCTTCAT 

AACAAAAAAGTAAGATATTCGGTATTTAACACTTTGTTATCAAGCACATTTTAAAAAGAACTGTACTGTAAAT 

AATGCTTGACTTAGCAAAATTTGTGCTCTTTCATTTGCTGTTAGAAAAACAGAATTAACAAAGACAGTAATGTGA 

AGAGTGCATTACACTATTCTTATTCTTTAGTAACTTGGGTAGTACTGTAATATTTTTAATCATCTTAAAGTATGA 

TTTGATATAATCTTATTGAAATTACCTTATCATGTCTTAGAGCCCGTCTTTATGTTTAAAACTAATTTCTTAAAA 

TAAAGCCTTCAGTAAATGTTCATTACCAACTTGATAAATGCTACTCATAAGAGCTGGTTTGGGGCTATAGCATAT 

GCTTTTTTTTTTTTAATTATTACCTGATTTAAAAATCTCTGTAAAAACGTGTAGTGTTTCATAAAATCTGTAACT 

CGCATTTTAATGATCCGCTATTATAAGCTTTTAATAGCATGAAAATTGTTAGGCTATATAACATTGCCACTTCAA 

CTCTAAGGAATATTTTTGAGATATCCCTTTGGAAGACCTTGCTTGGAAGAGCCTGGACACTAACAATTCTACACC 

AAATTGTCTCTTCAAATACGTATGGACTGGATAACTCTGAGAAACACATCTAGTATAACTGAATAAGCAGAGCAT 

CAAATTAAACAGACAGAAACCGAAAGCTCTATATAAATGCTCAGAGTTCTTTATGTATTTCTTATTGGCATTCAA 

CATATGTAAMTCAGAAAACAGGGAAATTTTCATTAAAAATATTGGTTTGAAAT 
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FIGURE 328 

MKEYVLLLFLALCSAKPFFSPSHIALKNMMLKDMEDTDDDDDDDDDDDDDEDNSLFPTREPRS 
HFFPFDLFPMCPFGCQCYSRVVHCSDLGLTSVPTNIPFDTRMLDLQNNKIKEIKENDFKGLTS 
LYGLILNNNKLTKIHPKAFLTTKKLRRLYLSHNQLSEIPLNLPKSLAELRIHENKVKKIQKDT 
FKGMNALHVLEMSANPLDNNGIEPGAFEGVTVFHIRIAEAKLTSVPKGLPPTLLELHLDYNKI 
STVELEDFKRYKELQRLGLGNNKITDIENGSLANIPRVREIHLENNKLKKIPSGLPELKYLQI 
IFLHSNSIARVGVNDFCPTVPKMKKSLYSAISLFNNPVKYWEMQPATFRCVLSRMSVQLGNFGM 

Is^ortant features: 
Signal sequence. 

amino acids 1-15 

N-glycosylation site. 

amino acids 281-285 

N-xnyristoylation sites. 

amino acids 129-135, 210-216, 214-220, 237-243, 270-276, 282-288 



Leucine zipper pattern. 

amino acids 154-17 6 
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FIGURE ^9Q 

GGGGTCTCCCTCAGGGCCGGGAGGCACAGCGGTCCCTGCTTGCTGAAGGGCTGGATGTACGCA 

TCCGCAGGTTCCCGCGGACTTGGGGGCGCCCGCTGAGCCCCGGCGCCCGCAGAAGACTTGTGT 

TTGCCTCCTGCAGCCTCAACCCGGAGGGCAGCGAGGGCCTACCACCATGATCACTGGTGTGTT 

CAGCATGCGCTTGTGGACCCCAGTGGGCGTCCTGACCTCGCTGGCGTACTGCCTGCACCAGCG 

GCGGGTGGCCCTGGCCGAGCTGCAGGAGGCCGATGGCCAGTGTCCGGTCGACCGCAGCCTGCT 

GAAGTTGAAAATGGTGCAGGTCGTGTTTCGACACGGGGCTCGGAGTCCTCTCAAGCCGCTCCC 

GCTGGAGGAGCAGGTAGAGTGGAACCCCCAGCTATTAGAGGTCCCACCCCAAACTCAGTTTGA 

TTACACAGTCACCAATCTAGCTGGTGGTCCGAAACCATATTCTCCTTACGACTCTCAATACCA 

TGAGACCACCCTGAAGGGGGGCATGTTTGCTGGGCAGCTGACCAAGGTGGGCATGCAGCAAAT 

GTTTGCCTTGGGAGAGAGACTGAGGAAGAACTATGTGGAAGACATTCCCTTTCTTTCACCAAC 

CTTCAACCCACAGGAGGTCTTTATTCGTTCCACTAACATTTTTCGGAATCTGGAGTCCACCCG 

TTGTTTGCTGGCTGGGCTTTTCCAGTGTCAGAAAGAAGGACCCATCATCATCCACACTGATGA 

AGCAGATTCAGAAGTCTTGTATCCCAACTACCAAAGCTGCTGGAGCCTGAGGCAGAGAACCAG 

AGGCCGGAGGCAGACTGCCTCTTTACAGCCAGGAATCTCAGAGGATTTGAAAAAGGTGAAGGA 

CAGGATGGGCATTGACAGTAGTGATAAAGTGGACTTCTTCATCCTCCTGGACAACGTGGCTGC 

CGAGCAGGCACACAACCTCCCAAGCTGCCCCATGCTGAAGAGATTTGCACGGAfGATCGAACA 

GAGAGCTGTGGACACATCCTTGTACATACTGCCCAAGGAAGACAGGGAAAGTCTTCAGATGGC 

AGTAGGCCCATTCCTCCACATCCTAGAGAGCAACCTGCTGAAAGCCATGGACTCTGCCACTGC 

CCCCGACAAGATCAGAAAGCTGTATCTCTATGCGGCTCATGATGTGACCTTCATACCGCTCTT 

AATGACCCTGGGGATTTTTGACCACAAATGGCCACCGTTTGCTGTTGACCTGACCATGGAACT 

TTACCAGCACCTGGAATCTAAGGAGTGGTTTGTGCAGCTCTATTACCACGGGAAGGAGCAGGT 

GCCGAGAGGTTGCCCTGATGGGCTCTGCCCGCTGGACATGTTCTTGAATGCCATGTCAGTTTA 

TACCTTAAGCCCAGAAAAATACCATGCACTCTGCTCTCAAACTCAGGTGATGGAAGTTGGAAA 

TGAAGAGTAACTGATTTATAAAAGCAGGATGTGTTGATTTTAAAATAAAGTGCCTTTATACAATG 
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FIGURE 330 



MITGVFSMRLWTPVGVLTSLAYCLHQRRVALAELQEADGQCPVDRSLLKLKMVQVVFRHGARSPLKPLPLEEQVE 
WNPQLLEVPPQTQFDYTVTNLAGGPKPYSPYDSQYHETTLKGGMFAGQLTKVGMQQMFALGERLRKNYVEDIPFL 
SPTFNPQEVFIRSTN I FRNLESTRCLLAGLFQCQKEGPI I IHTDEADSEVLYPNYQSCWSLRQRTRGRRQTASLQ 
PGISEDLKKVKDRMGIDSSDKVDFFILLDNVAAEQAHNLPSCPMLKRFARMIEQRAVDTSLYILPKEDRESLQMA 
VGPFLHILESNLLKAMDSATAPDKIRKLYLYAAHDVTFIPLLMTLGIFDHKWPPFAVDLTMELYQHLESKEWFVQ 
LYYHGKEQVPRGCPDGLCPLDMFLNAMSVYTLSPEKYHALCSQTQVMEVGNEE 

Inportant features: 
Signal sequence: 

amino acids 1-23 

CAMP- and cG^-dependent protein kinase phosphorylation site. 

amino acids 218-222 

Casein kinase ZI phosphorylation site, 
amino acids 87-91, 104-108, 320-324 

Tyrosine kinase phosphorylation site. 

amino acids 280-288 

N-myrlstoylatlon site. 

amino acids. 15-21, 117-123, 118-124, 179-185, 240-246, 387-393 

Amldatlon site. 

amino acids 216-220 

lieuclne zipper pattern. 

amino, acids 10-32 

Hlstldlne acid phosphatases phosphohlstldlne signature. 

amino acids 50-65 
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FIGURE 331 

CGAGGGCTTTTCCGGCTCCGGAATGGCACATGTGGGAATCCCAGTCTTGTTGGCTACAACATTTTTCCCTTTCCT 

AACAAGTTCTAACAGCTGTTCTAACAGCTAGTGATCAGGGGTTCTTCTTGCTGGAGAAGAAAGGGCTGAGGGCAG 

AGCAGGGCACTCTCACTCAGGGTGACCAGCTCCTTGCCTCTCTGTGGATAACAGAGCATGAGAAAGTGAAGAGAT 

GCAGCGGAGTGAGGTGATGGAAGTCTAAAATAGGAAGGAATTTTGTGTGCAATATCAGACTCTGGGAGCAGTTGA 

CCTGGAGAGCCTGGGGGAGGGCCTGCCTAACAAGCTTTCAAAAAACAGGAGCGACTTCCACTGGGCTGGGATAAG 

ACGTGCCGGTAGGATAGGGAAGACTGGGTTTAGfCCTAATATCAAATTGACTGGCTGGGTGAACTTCAACAGCCT 

TTTAACCTCTCTGGGAGATGAAAACGATGGCTTAAGGGGCCAGAAATAGAGATGCTTTGTAAAATAAAATTTTAA 

AAAAAGCAAGTATTTTATAGCATAAAGGCTAGAGACCAAAATAGATAACAGGATTCCCTGAACATTCCTAAGAGG 

GAGAAAGTATGTTAAAAATAGAAAAACCAAAATGCAGAAGGAGGAGACTCACAGAGCTAAACCAGG^QGGGACC 

CTGGGTCAGGCCAGCCTCTTTGCTCCTCCCGGAAATTATTTTTGGTCTGACCACTCTGCCTTGTGTTTTGCAGAA 

TCATGTGAGGGCCAACCGGGGAAGGTGGAGCAGATGAGCACACACAGGAGCCGTCTCCTCACCGCCGCCCCTCTC 

AGCATGGAACAGAGGCAGCCCTGGCCCCGGGCCCTGGAGGTGGACAGCCGCTCTGTGGTCCTGCTCTCAGTGGTC 

TGGGTGCT6CTGGCCCCCCCAGCAGCCGGCATGCCTCAGTTCAGCACCTTCCACTCTGAGAATCGTGACTGGACC 

TTCAACCACTTGACCGTCCACCAAGGGACGGGGGCCGTCTATGTGGGGGCCATCAACCGGGTCTATAAGCTGACA 

GGCAACCTGACCATCCAGGTGGCTCATAAGACAGGGCCAGAAGAGGACAACAAGTCTCGTTACCCGCCCCTCATC 

GTGCAGCCCTGCAGCGAAGTGCTCACCCTCACCAACAATGTCAACAAGCTGCTCATCATTGACTACTCTGAGAAC 

CGCCTGCTGGCCTGTGGGAGCCTCTACCAGGGGGTCTGCAAGCTGCTGCGGCTGGATGACCTCTTCATCCTGGTG 

GAGCCATCCCACAAGAAGGAGCACTACCTGTCCAGTGTCAACAAGACGGGCACCATGTACGGGGTGATTGTGCGC 

TCTGAGGGTGAGGATGGCAAGCTCTTCATCGGCACGGCTGTGGATGGGAAGCAGGATTACTTCCCGACCCTGTCC 

AGCCGGAAGCTGCCCCGAGACCCTGAGTCCTCAGCCATGCTCGACTATGAGCTACACAGCGATTTTGTCTCCTCT 

CTCATCAAGATCCCTTCAGACACCCTGGCCCTGGTCTCCCACTTTGACATCTTCTACATCTACGGCTTTGCTAGT 

GGGGGCTTTGTCTACTTTCTCACTGTCCAGCCCGAGACCCCTGAGGGTGTGGCCATCAACTCCGCTGGAGACCTC 

TTCTACACCTCACGCATCGTGCGGCTCTGCAAGGATGACCCCAAGTTCCACTCATACGT6TCCCTGCCCTTCGGC 

TGCACCCGGGCCGGGGTGGAATACCGCCTCCTGCAGGCTGCTTACCTGGCCAAGCCTGGGGACTCACTGGCCCAG 

GCCTTCAATATCACCAGCCAGGACGATGTACTCTTTGCCATCTTCTCCAAAGGGCAGAAGCAGTATCACCACCCG 

CCCGATGACTCTGCCCTGTGTGCCTTCCCTATCCGGGCCATCAACTTGCAGATCAAGGAGCGCCTGCAGTCCTGC 

TACCAGGGCGAGGGCAACCTGGAGCTCAACTGGCTGCTGGGGAAGGACGTCCAGTGCACGAAGGCGCCTGTCCCC 

ATCGATGATAACTTCTGTGGACTGGACATCAACCAGCCCCTGGGAGGCTCAACTCCAGTGGAGGGCCTGACCCTG 

TACACCACCAGCAGGGACCGCATGACCTCTGTGGCCTCCTACGTTTACAACGGCTACAGCGTGGTTTTTGTGGGG 

ACTAAGAGTGGCAAGCTGAAAAAGGTAAGAGTCTATGAGTTCAGATGCTCCAATGCCATTCACCTCCTCAGCAAA 

GAGTCCCTCTTGGAAGGTAGCTATTGGTGGAGATTTAACTATAGGCAACTTTATTTTCTTGGGGAACAAAGGm 
AATGGGGAGGTAAGAAGGGGTTAATTTTGTGACTTAGCTTCTAGCTACTTCCTCCAGCCATCAGTCATTGGGTAT 
GTAAGGAATGCAAGCGTATTTCAATATTTCCCAAACTTTAAGAAAAAACTTTAAGAAGGTACATCTGCA^ 
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FIGURE 332 



MGTLGQASLFAPPGNYFWSDHSALCFAESCEGQPGKVEQMSTHRSRLLTAAPLSMEQRQPWPR 
ALEVDSRSVVLLSWWVLLAPPAAGMPQFSTFHSENRDWTFNHLTVHQGTGAVYVGAINRVYK 
LTGNLTIQVAHKTGPEEDNKSRYPPLIVQPCSEVLTLTNNVNKLLIIDYSENRLLACGSLYQG 
VCKLLRLDDLFILVEPSHKKEHYLSSVNKTGTMYGVIVRSEGEDGKLFIGTAVDGKQDYFPTL 
SSRKLPRDPESSAMLDYELHSDFVSSLIKIPSDTLALVSHFDIFYIYGFASGGFVYFLTVQPE 
TPEGVAINSAGDLFYTSRIVRLCKDDPKFHSYVSLPFGCTRAGVEYRLLQAAYLAKPGDSLAQ 
AFNITSQDDVLFAIFSKGQKQYHHPPDDSALCAFPIRAINLQIKERLQSCYQGEGNLELNWLL 
GKDVQCTKAPVPIDDNFCGLDINQPLGGSTPVEGLTLYTTSRDRMTSVASYVYNGYSWFVGT 
KSGKLKKVRVYEFRCSNAIHLLSKESLLEGSYWWRFNYRQLYFLGEQR 

Important features: 
Signal sequence: 

amino acids 1-32 

Transmenibrane domain: 

amino acids 71-87 

N-glycosylation site. 

amino acids 130-134, 145-149, 217-221, 381-385 
Casein kinase II phosphorylatioh site. 

amino acids 139-143, 229-233, 240-244, 291-295, 324-328, 383-387, 
384-388, 471-475, 481-485, 530-534 

N-myristoylation site. 

amino acids 220-226, 319-325, 353-359, 460-466, 503-509 
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FIGURE 333 

GCTGAGTCTGCTGCTCCTGCTGCTGCTGCTCCAGCCTGTAACCTGTGCCTACACCACGCCAGG 

CCCCCCCAGAGCCCTCACCACGCTGGGCGCCCCCAGAGCCCACACCATgCCGGGCACCTACGC 

TCCCTCGACCACACTCAGTAGTCCCAGCACCCAGGGCCTGCAAGAGCAGGCACGGGCCCTGAT 

GCGGGACTTCCCGCTCGTGGACGGCCACAACGACCTGCCCCTGGTCCTAAGGCAGGTTTACCA 

GAAAGGGCTACAGGATGTTAACCTGCGCAATTTCAGCTACGGCCAGACCAGCCTGGACAGGCT 

TAGAGATGGCCTCGTGGGCGCCCAGTTCTGGTCAGCCTATGTGCCATGCCAGACCCAGGACCG 

GGATGCCCTGCGCCTCACCCTGGAGCAGATTGACCTCATACGCCGCATGTGTGCCTCCTATTC 

TGAGCTGGAGCTTGTGACCTCGGCTAAAGCTCTGAACGACACTCAGAAATTGGCCTGCCTCAT 

CGGTGTAGAGGGTGGCCACTCGCTGGACAATAGCCTCTCCATCTTACGTACCTTCTACATGCT 

GGGAGTGCGCTACCTGACGCTCACCCACACCTGCAACACACCCTGGGCAGAGAGCTCCGCTAA 

GGGCGTCCACTCCTTCTACAACAACATCAGCGGGCTGACTGACTTTGGTGAGAAGGTGGTGGC 

AGAAATGAACCGCCTGGGCATGATGGTAGACTTATCCCATGTCTCAGATGCTGTGGCACGGCG 

GGCCCTGGAAGTGTCACAGGCACCTGTGATCTTCTCCCACTCGGCTGCCCGGGGTGTGTGCAA 

CAGTGCTCGGAATGTTCCTGATGACATCCTGCAGCTTCTGAAGAAGAACGGTGGCGTCGTGAT 

GGTGTCTTTGTCCATGGGAGTAATACAGTGCAACCCATCAGCCAATGTGTCCACTGTGGCAGA 

TCACTTCGACCACATCAAGGCTGTCATTGGATCCAAGTTCATCGGGATTGGTGGAGATTATGA 

TGGGGCCGGCAAATTCCCTCAGGGGCTGGAAGACGTGTCCACATACCCGGTCCTGATAGAGGA 

GTTGCTGAGTCGTGGCTGGAGTGAGGAAGAGCTTCAGGGTGTCCTTCGTGGAAACCTGCTGCG 

GGTCTTCAGACAAGTGGAAAAGGTACAGGAAGAAAACAAATGGCAAAGCCCCTTGGAGGACAA 

GTTCCCGGATGAGCAGCTGAGGAGTTCCTGCCACTCCGACCTCTCACGTCTGCGTCAGAGACA 

GAGTCTGACTTCAGGCCAGGAACTCACTGAGATTCCCATACACTGGACAGCCAAGTTACCAGC 

CAAGTGGTCAGTCTCAGAGTCCTCCCCCCACATGGCCCCAGTCCTTGCAGTTGTGGCCACCTT 

CCCAGTCCTTATTCTGTGGCTCT^TGACCCAGTTAGTCCTGCCAGATGTCACTGTAGCAAGC 

CACAGACACCCCACAAAGTTCCCCTGTTGTGCAGGCACAAATATTTCCTGAAATAAATGTTTT 
GGACATAG 
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FIGURE 334 

MPGTYAPSTTLSSPSTQGLQEQARAUIRDFPLVDGHNDLPLVLRQVYQKGLQDVNLRNFSYGQ 
TSLDRLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRMCASVSELELVTSAKALNDTQ 
KLACLIGVEGGHSLDNSLSILRTFYMLGVRXLTLTHTCNTPWAESSAKGVHSFYNNISGLTDF 
GEKWAEMNRLGMMVDLSHVSDAVARRALEVSQAPVIFSHSAARGVCNSARNVPDDILQLLKK 
NGGWMVSLSMGVIQCNPSANVSTVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTY 
PVLIEELLSRGWSEEELQGVLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLS 
RLRQRQSLTSGQELTEIPIHWTAKLPAKWSVSESSPHMAPVLAWATFPVLILWL 

Important features: 
N-glycosylation sites. 

amino acids 58-62, 123-127, 182-186, 273-277 
N-myristoylation sites. 

amino acids 72-78, 133-139, 234-240, 264-270, 334-340, 389-395 



Renal dipeptidase active site. 

amino acids 134-157 
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FIGURE 335 

CCCAGAAGTTCAAGGGCCCCCGGCCTCCTGCGCTCCTGCCGCCGGGACCCTCGACCTCCTCAG 

AGCAGCCGGCTGCCGCCCCGGGAAGarSGCGAGGAGGAGCCGCCACCGCCTCCTCCTGCTGCT 

GCTGCGCTACCTGGTGGTCGCCCTGGGCTATCATAAGGCCTATGGGTTTTCTGCCCCAAAAGA 

CCAACAAGTAGTCACAGCAGTAGAGTACCAAGAGGCTATTTTAGCCTGCAAAACCCCAAAGAA 

GACTGTTTCCTCCAGATTAGAGTGGAAGAAACTGGGTCGGAGTGTCTCCTTTGTCTACTATCA 

ACAGACTCTTCAAGGTGATTTTAAAAATCGAGCTGAGATGATAGATTTCAATATCCGGATCAA 

AAATGTGACAAGAAGTGATGCGGGGAAATATCGTTGTGAAGTTAGTGCCCCATCTGAGCAAGG 

CCAAAACCTGGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTTCCATCATG 

TGAAGTACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGATGTCAAGACAAAGAAGG 

GAATCCAGCTCCTGAATACACATGGTTTAAGGATGGCATCCGTTTGCTAGAAAATCCCAGACT 

TGGCTCCCAAAGCACCAACAGCTCATACACAATGAATACAAAAACTGGAACTCTGCAATTTAA 

TACTGTTTCCAAACTGGACACTGGAGAATATTCCTGTGAAGCCCGCAATTCTGTTGGATATCG 

CAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCATCATAGCAGCCGT 

AGTAGTTGTGGCCTTAGTGATTTCCGTTTGTGGCCTTGGTGTATGCTATGCTCAGAGGAAAGG 

CTACTTTTCAAAAGAAACCTCCTTCCAGAAGAGTAATTCTTCATCTAAAGCCACGACAATGAG 

TGAAAATGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGAAGGCCGCGGCGGGCGGATC 

ACGAGGTCAGGAGTTCTAGACCAGTCTGGCCAATATGGTGAAACCCCATCTCTACTAAAATAC 

AAAAATTAGCTGGGCATGGTGGCATGTGCCTGCAGTTCCAGCTGCTTGGGAGACAGGAGAATC 

ACTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATCACGCCACTGCAGTCCAGCCTGGG 

TAACAGAGCAAGATTCCATCTCAAAAAATAAAATAAATAAATAAATAAATACTGGTTTTTACC 
TGTAGAATTCTTACAATAAATATAGCTTGATATTC 
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FIGURE 336 

MARRSRHRLLLLLLRYLVVALGYHKAYGFSAPKDQQWTAVEYQEAILACKTPKKTVSSRLEW 
KKLGRSVSFVYYQQTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQGQNLEEDTV 
TLEVLVAPAVPSCEVPSSALSGTVVELRCQDKEGNPAPEYTWFKDGIRLLENPRLGSQSTNSS 
YTMNTKTGTLQFNTVSKLDTGEYSCEARNSVGYRRCPGKRMQVDDLNISGIIAAWVVALVIS 
VCGLGVCYAQRKGYFSKETSFQKSNSSSKATTMSENVQWLTPVIPALWKAAAGGSRGQEF 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 130-144, 238-258 

N-glycosylation site* 

amino acids 98-102, 187-191, 236-240, 277-281 
Casein kinase II phosphorylation site. 

amino acids 39-43, 59-63, 100-104, 149-153/ 205-209, 284-288 
N-myristoylation site. 

amino acids 182-188, 239-245, 255-261, 257-263, 305-311 

Amidation site . 

amino acids 226-230 
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FIGURE 337 

GGAGCCGCCCTGGGTGTCAGCGGCTCGGCTCCCGCGCACGCTCCGGCCGTCGCGCAGCCTCGG 

CACCTGCAGGTCCGTGCGTCCCGCGGCTGGCGCCCCTGACTCCGTCCCGGCCAGGGAGGGCCA 

2SATTTCCCTCCCGGGGCCCCTGGTGACCAACTTGCTGCGGTTTTTGTTCCTGGGGCTGAGTG 

CCCTCGCGCCCCCCTCGCGGGCCCAGCTGCAACTGCACTTGCCCGCCAACCGGTTGCAGGCGG 

TGGAGGGAGGGGAAGTGGTGCTTCCAGCGTGGTACACCTTGCACGGGGAGGTGTCTTCATCCC 

AGCCATGGGAGGTGCCCTTTGTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATCAGGTGT 

TGTCCTACATCAATGGGGTCACAACAAGCAAACCTGGAGTATCCTTGGTCTACTCCATGCCCT 

CCCGGAACCTGTCCCTGCGGCTGGAGGGTCTCCAGGAGAAAGACTCTGGCCCCTACAGCTGCT 

CCGTGAATGTGCAAGACAAACAAGGCAAATCTAGGGGCCACAGCATCAAAACCTTAGAACTCA 

ATGTACTGGTTCCTCCAGCTCCTCCATCCTGCCGTCTCCAGGGTGTGCCCCATGTGGGGGCAA 

ACGTGACCCTGAGCTGCCAGTCTCCAAGGAGTAAGCCCGCTGTCCAATACCAGTGGGATCGGC 

AGCTTCCATCCTTCCAGACTTTCTTTGCACCAGCATTAGATGTCATCCGTGGGTCTTTAAGCC 

TCACCAACCTTTCGTCTTCCATGGCTGGAGTCTATGTCTGCAAGGCCCACAATGAGGTGGGCA 

CTGCCCAATGTAATGTGACGCTGGAAGTGAGCACAGGGCCTGGAGCTGCAGTGGTTGCTGGAG 

CTGTTGTGGGTACCCTGGTTGGACTGGGGTTGCTGGCTGGGCTGGTCCTCTTGTACCACCGCC 

GGGGCAAGGCCCTGGAGGAGCCAGCCAATGATATCAAGGAGGATGCCATTGCTCCCCGGACCC 

TGCCCTGGCCCAAGAGCTCAGACACAATCTCCAAGAATGGGACCCTTTCCTCTGTCACCTCCG 

CACGAGCCCTCCGGCCACCCCATGGCCCTCCCAGGCCTGGTGCATTGACCCCCACGCCCAGTC 

TCTCCAGCCAGGCCCTGCCCTCACCAAGACTGCCCACGACAGATGGGGCCCACCCTCAACCAA 

TATCCCCCATCCCTGGTGGGGTTTCTTCCTCTGGCTTGAGCCGCATGGGTGCTGTGCCTGTGA 

TGGTGCCTGCCCAGAGTCAAGCTGGCTCTCTGGTATGATGACCCCACCACTCATTGGCTAAAG 

GATTTGGGGTCTCTCCTTCCTATAAGGGTCACCTCTAGCACAGAGGCCTGAGTCATGGGAAAG 

AGTCACACTCCTGACCCTTAGTACTCTGCCCCCACCTCTCTTTACTGTGGGAAAACCATCTCA 

GTAAGACCTAAGTGTCCAGGAGACAGAAGGAGAAGAGGAAGTGGATCTGGAATTGGGAGGAGC 

CTCCACCCACCCCTGACTCCTCCTTATGAAGCCAGCTGCTGAAATTAGCTACTCACCAAGAGT 

GAGGGGCAGAGACTTCCAGTCACTGAGTCTCCCAGGCCCCCTTGATCTGTACCCCACCCCTAT 

CTAACACCACCCTTGGCTCCCACTCCAGCTCCCTGTATTGATATAACCTGTCAGGCTGGCTTG 

GTTAGGTTTTACTGGGGCAGAGGATAGGGAATCTCTTATTAAAACTAACATGAAATATGTGTT 

GTTTTCATTTGCAAATTTAAATAAAGATACATAATGTTTGTATGAAAAA 
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FIGURE 338 

MISLPGPLVTNLLRFLFLGLSALAPPSRAQLQLHLPANRLQAVEGGEVVLPAWYTLHGEVSSS 

QPWEVPFVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRLEGLQEKDSGPYSC 

SVNVQDKQGKSRGHSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSKPAVQYQWDR 

QLPSFQTFFAPALDVIRGSLSLTNLSSSMAGVYVCKAHNEVGTAQCNVTLEVSTGPGAAVVAG 

AVVGTLVGLGLLAGLVLLYHRRGKALEEPANDIKEDAIAPRTLPWPKSSDTISKNGTLSSVTS 

ARALRPPHGPPRPGALTPTPSLSSQALPSPRLPTTDGAHPQPISPIPGGVSSSGLSRMGAVPV 
MVPAQSQAGSLV 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 245-267 

N-glycosylatioh site. 

amino acids 108-112, 169-173, 213-217, 236-240, 307-311 
N-n^ristoylation site. 

amino acids .90-96, 167-173, 220-226, . 231-237, 252-258, 256-262, 
262-268, 308-314, 363-369, 364-370 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 164-175 
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FIGURE 339 



GCGAGAACCTTTGCACGCGCACAAACTACGGGGACGATTTCTGATTGATTTTTGGCGCTTTCGATCCACCCTCCT 

CCCTTCTCATGGGACTTTGGGGACAAAGCGTCCCGACCGCCTCGAGCGCTCGAGCAGGGCGCTATCCAGGAGCCA 

GGACAGCGTCGGGAACCAGACCATGGCTCCTGGACCCCAAGATCCTTAAGTTCGTCGTCTTCATCGTCGCGGTTC 

TGCTGCCGGTCCGGGTTGACTCTGCCACCATCCCCCGGCAGGACGAAGTTCCCCAGCAGACAGTGGCCCCACAGC 

AACAGAGGCGCAGCCTCAAGGAGGAGGAGTGTCCAGCAGGATCTCATAGATCAGAATATACTGGAGCCTGTAACC 

CGTGCACAGAGGGTGTGGATTACACCATTGCTTCCAACAATTTGCCTTCTTGCCTGCTATGTACAGTTTGTAAAT 

CAGGTCAAACAAATAAAAGTTCCTGTACCACGACCAGAGACACCGTGTGTCAGTG.TGAAAAAGGAAGCTTCCAGG 

ATAAAAACTCCCCTGAGATGTGCCGGACGTGTAGAACAGGGTGTCCCAGAGGGATGGTCAAGGTCAGTAATTGTA 

CGCCCCGGAGTGACATCAAGTGCAAAAATGAATCAGCTGCCAGTTCCACTGGGAAAACCCCAGCAGCGGAGGAGA 

CAGTGACCACCATCCTGGGGATGCTTGCCTCTCCCTATCACTACCTTATCATCATAGTGGTTTTAGTCATCATTT 

TAGCTGTGGTTGTGGTTGGCTTTTCATGTCGGAAGAAATTCATTTCTTACCTCAAAGGCATCTGCTCAGGTGGTG 

GAGGAGGTCCCGAACGTGTGCACAGAGTCCTTTTCCGGCGGCGTTCATGTCCTTCACGAGTTCCTGGGGCGGAGG 

ACAATGCCCGCAACGAGACCCTGAGTAACAGATACTTGCAGCCCACCCAGGTCTCTGAGCAGGAAATCCAAGGTC 

AGGAGCTGGCAGAGCTAACAGGTGTGACTGTAGAGTCGCCAGAGGAGCCACAGCGTCTGCTGGAACAGGCAGAAG 

CTGAAGGGTGTCAGAGGAGGAGGCTGCTGGTTCCAGTGAATGACGCTGACTCCGCTGACATCAGCACCTTGCTGG 

ATQCCTCGGCAACACTGGAAGAAGGACATGCAAAGGAAACTU^TTCAGGACCAACTGGTGGGCTCCGAAAAGCTCT 

TTTATGAAGAAGATGAGGCAGGCTCTGCTACGTCCTGCCTGTGAAAGAATCTCTTCAGGAAACCAGAGCTTCCCT 

CATTTACCTTTTCTCCTACAAAGGGAAGCAGCCTGGAAGAAACAGTCCAGTACTTGACCCATGCCCCAACAAACT 

CTACTATCCAATATGGGGCAGCTTACCAATGGTCCTAGAACTTTGTTAACGCACTTGGAGTAATTTTTATGAAAT 

ACTGCGTGTGATAAGCAAACGGGAGAAATTTATATCAGATTCTTGGCTGCATAGTTATACGATTGTGTATTAAGG 

GTCGTTTTAGGCCACATGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGATAGGCTGAGGCAGGTGGATTGCTT 

GAGCTCGGGAGTTTGAGACCAGCCTCATCAACACAGTGAAACTCCATCTCAATTTAAAAAGAAAAAAAGTGGTTT 

TAGGATGTCATTCTTTGCAGTTCTTCATCATGAGACAAGTCTTTTTTTCTGCTTCTTATATTGCAAGCTCCATCT 

CTACTGGTGTGTGCATTTAATGACATCTAACTACAGATGCCGCACAGCCACAATGCTTTGCCTTATAGTTTTTTA 

ACTTTAGAACGGGATTATCTTGTTATTACCTGTATTTTCAGTTTCGGATATTTTTGACTTAATGATGAGATTATC 

AAGACGTAGCCCTATGCTAAGTCATGAGCATATGGACTTACGAGGGTTCGACTTAGAGTTTTGAGCTTTAAGATA 

GGATTATTGGGGCTTACCCCCACCTTAATTAGAGAAACATTTATATTGCTTACTACTGTAGGCTGTACATCTCTT 

TTCCGATTTTTGTATAATGATGTAAACATGGAAAAACTTTAGGAAATGCACTTATTAGGCTGTTTACATGGGTTG 

CCTGGATACAAATCAGCAGTCAAAAATGACTAAAAATATAACTAGTGACGGAGGGAGAAATCCTCCCTCTGTGGG 

AGGCACTTACTGCATTCCAGTTCTCCCTCCTGCGCCCTGAGACTGGACCAGGGTTTGATGGCTGGCAGCTTCTCA 

AGGGGCAGCTTGTCTTACTTGTTAATTTTAGAGGTATATAGCCATATTTATTTATAAATAAATATTTATTTATTT 

ATTTATAAGTAGATGTTTACATATGCCCAGGATTTTGAAGAGCCTGGTATCTTTGGGAAGCCATGTGTCTGGTTT 

GTCGTGCTGGGACAGTCATGGGACTGCATCTTCCGACTTGTCCACAGCAGATGAGGACAGTGAGAATTAAGTTAG 

ATCCGAGACTGCGAAGAGCTTCTCTTTCAAGCGCCATTACAGTTGAACGTTAGTGAATCTTGAGCCTCATTTGGG 

CTCAGGGCAGAGCAGGTGTTTATCTGCCCCGGCATCTGCCATGGCATCAAGAGGGAAGAGTGGACGGTGCTTGGG 

AATGGTGTGAAATGGTTGCCGACTCAGGCATGGATGGGCCCCTCTCGCTTCTGGTGGTCTGTGAACTGAGTCCCT 

GGGATGCCTTTTAGGGCAGAGATTCCTGAGCTGCGTTTTAGGGTACAGATTCCCTGTTTGAGGAGCTTGGCCCCT 

CTGTAAGCATCTGACTCATCTCAGAGATATCAATTCTTAAACACTGTGACAACGGGATCTAAAATGGCTGACACA 

TTTGTCCTTGTGTCACGTTCCATTATTTTATTTAAAAACCTCAGTAATCGTTTTAGCTTCTTTCCAGCAAACTCT 

TCTCCACAGTAGCCCAGTCGTGGTAGGATAAATTACGGATATAGTCATTCTAGGGGTTTCAGTCTTTTCCATCTC 

AAGGCATTGTGTGTTTTGTTCCGGGACTGGTTTGGCTGGGACAAAGTTAGAACTGCCTGAAGTTCGCACATTCAG 

ATTGTTGTGTCCATGGAGTTTTAGGAGGGGATGQCCTTTCCGGTCTTCGCACTTCCATCCTCTCCCACTTCCATC 

TGGCGTCCCACACCTTGTCCCCTGCACTTCTGGATGACACAGGGTGCTGCTGCCTCCTAGTCTTTGCCTTTGCTG 

GGCCTTCTGTGCAGGAGACTTGGTCTCAAAGCTCAGAGAGAGCCAGTCCGGTCCCAGCTCCTTTGTCCCTTCCTC 

AGAGGCCTTCCTTGAAGATGCATCTAGACTACCAGCCTTATCAGTGTTTAAGCTTATTCCTTTAACATAAGCTTC 

CTGACAACATGAAATTGTTGGGGTTTTTTGGCGTTGGTTGATTTGTTTAGGTTTTGCTTTATACCCGGGCCAAAT 

AGCACATAACACCTGGTTATATATGAAATACTCATATGTTTATGACCAAAATAAATATGAAACCTCATRTTAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 340 

MGLWGQSVPTASSARAGRYPGARTASGTRPWLLDPKILKFVVFIVAVLLPVRVDSATIPRQDEVPQQTVAPQQQR 
RSLKEEECPAGSHRSEYTGACNPCTEGVDYTIASNNLPSCLLCTVCKSGQTNKSSCTTTRDTVCQCEKGSFQDKN 
SPEMCRTCRTGCPRGMVKVSNCTPRSDIKCKNESAASSTGKTPAAEETVTTILGMLASPYHYLIIIVVLVIILAV 
VWGFSCRKKFISYLKGICSGGGGGPERVHRVLFRRRSCPSRVPGAEDNARNETLSNRYLQPTQVSEQEIQGQEL 
AELTGVTVESPEEPQRLLEQAEAEGCQRRRLLVPVNDADSADISTLLDASATLEEGHAKETIQDQLVGSEKLFYE 
EDEAGSATSCL 

Important features: 
Transm^Dobrane domains: 

amino acids 35-52, 208-230 

N-glycosylation sites. 

amino acids 127-131, 182-186, 277-281 

Glycosaminoglycan attachzuent site. 

amino acids 245-249 

CAMP- and cGMP- dependent protein kinase phosphorylation site . 

amino acids 260-264 

N»myristoylation sites. 

amino acids 21-27, 86-92, 102-108, 161-167, 242-248, 270-276, 297-303, 380^386 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 185-193 



TNFR/NGFR cysteine-rich region. 

amino acids 99-139 
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FIGURE 341 



GCCTCTGAATTGTTGGGCAGTCTGGCAGTGGAGCTCTCCCCGGTCTGACAGCCACTCCAGAGG 

CCATGCTTCGTTTCTTGCCAGATTTGGCTTTCAGCTTCCTGTTAATTCTGGCTTTGGGCCAGG 

CAGTCCAATTTCAAGAATATGTCTTTCTCCAATTTCTGGGCTTAGATAAGGCGCCTTCACCCC 

AGAAGTTCCAACCTGTGCCTTATATCTTGAAGAAAATTTTCCAGGATCGCGAGGCAGCAGCGA 

CCACTGGGGTCTCCCGAGACTTATGCTACGTAAAGGAGCTGGGCGTCCGCGGGAATGTACTTC 

GCTTTCTCCCAGACCAAGGTTTCTTTCTTTACCCAAAGAAAATTTCCCAAGCTTCCTCCTGCC 

TGCAGAAGCTCCTCTACTTTAACCTGTCTGCCATCAAAGAAAGGGAACAGTTGACATTGGCCC 

AGCTGGGCCTGGACTTGGGGCCCAATTCTTACTATAACCTGGGACCAGAGCTGGAACTGGCTC 

TGTTCCTGGTTCAGGAGCCTCATGTGTGGGGCCAGACCACCCCTAAGCCAGGTAAAATGTTTG 

TGTTGCGGTCAGTCCCATGGCCACAAGGTGCTGTTCACTTCAACCTGCTGGATGTAGCTAAGG 

ATTGGAATGACAACCCCCGGAAAAATTTCGGGTTATTCCTGGAGATACTGGTCAAAGAAGATA 

GAGACTCAGGGGTGAATTTTCAGCCTGAAGACACCTGTGCCAGACTAAGATGCTCCCTTCATG 

CTTCCCTGCTGGTGGTGACTCTCAACCCTGATCAGTGCCACCCTTCTCGGAAAAGGAGAGCAG 

CCATCCCTGTCCCCAAGCTTTCTTGTAAGAACCTCTGCCACCGTCACCAGCTATTCATTAACT 

TCCGGGACCTGGGTTGGCACAAGTGGATCATTGCCCCCAAGGGGTTCATGGCAAATTACTGCC 

ATGGAGAGTGTCCCTTCTCACTGACCATCTCTCTCAACAGCTCCAATTATGCTTTCATGCAAG 

CCCTGATGCATGCCGTTGACCCAGAGATCCCCCAGGCTGTGTGTATCCCCACCAAGCTGTCTC 

CCATTTCCATGCTCTACCAGGACAATAATGACAATGTCATTCTACGACATTATGAAGACATGG 

TAGTCGATGAATGTGGGTGTGGGXAGGATGTCAGAAATGGGAATAGAAGGAGTGTTCTTAGGG 

TAAATCTTTTAATAAAACTACCTATCTGGTTTATGACCACTTAGATCGAAATGTC 
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FIGURE 342 

MLRFLPDLAFSFLLILALGQAVQFQEYVFLQFLGLDKAPSPQKFQPVPYILKKIFQDREAAAT 
TGVSRDLCYVKELGVRGNVLRFLPDQGFFLYPKKISQASSCLQKLLYFNLSAIKEREQLTLAQ 
LGLDLGPNSYYNLGPELELALFLVQEPHVWGQTTPKPGKMFVLRSVPWPQGAVHFNLLDVAKD 
WNDNPRKNFGLFLEILVKEDRDSGVNFQPEDTCARLRCSLHASLLVVTLNPDQCHPSRKRRAA 
IPVPKLSCKNLCHRHQLFINFRDLGWHKWIIAPKGFMANYCHGECPFSLTISLNSSNYAFMQA 
LMHAVDPEIPQAVCIPTKLSPISMLYQDNNDNVILRHYEDMWDECGCG 

Important features: 
Signal peptide: 

amino acids 1-21. 

N-glycosylation sites. 

amino acids 112-116, 306-310 

cAMP- and c(a4P -dependent protein kinase phosphorylation site. 

amino acids 96-100 

H-myristoylation site. 

amino acids 77-83 

T6F-beta family proteins. 

amino acids 264-299, 327-341, 345-364 
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FIGURE 343 



CCCACGCGTCCGGCCTTCTCTCTGGACTTTGCATTTCCATTCCTTTTCATTGACAAACTGACTTTTTTTATTTCT 

TTTTTTCCATCTCTGGGCCAGCTTGGGATCCTAGGCCGCCCTGGGAAGACATTTGTGTTTTACACACATAAGGAT 

CTGTGTTTGGGGTTTCTTCTTCCTCCCCTGACATTGGCATTGCTTAGTGGTTGTGTGGGGAGGGAGACCACGTGG 

GCTCAGTGCTTGCTTGCACTTATCTGCCTAGGTACATCGAAGTCTTTTGACCTCCATACAGTGATTATGCCTGTC 

ATCGCTGGTGGTATCCTGGCGGCCTTGCTCCTGCTGATAGTTGTCGTGCTCTGTCTTTACTTCAAAATACACAAC 

GCGCTAAAAGCTGCAAAGGAACCTGAAGCTGTGGCTGTAAAAAATCACAACCCAGACAAGGTGTGGTGGGCCAAG 

AACAGCCAGGCCAAAACCATTGCCACGGAGTCTTGTCCTGCCCTGCAGTGCTGTGAAGGATATAGAATGTGTGCC 

AGTTTTGATTCCCTGCCACCTTGCTGTTGCGACATAAATGAGGGCCTCTGAGTTAGGAAAGGCTCCCTTCTCAAA 

GCAGAGCCCTGAAGACTTCAATGATGTCAATGAGGCCACCTGTTTGTGATGTGCAGGCACAGAAGAAAGGCACAG 

CTCCCCATCAGTTTCATGGAAAATAACTCAGTGCCTGCTGGGAACCAGCTGCTGGAGATCCCTACAGAGAGCTTC 

CACTGGGGGCAACCCTTCCAGGAAGGAGTTGGGGAGAGAGAACCCTCACTGTGGGGAATGCTGATAAACCAGTCA 

CACAGCTGCTCTATTCTCACACAAATCTACCCCTTGCGTGGCTGGAACTGACGTTTCCCTGGAGGTGTCCAGAAA 

GCTGATGTAACACAGAGCCTATAAAAGCTGTCGGTCCTTAAGGCTGCCCAGCGCCTTGCCAAAATGGAGCTTGTA 

AGAAGGCTCATGCCATTGACCCTCTTAATTCTCTCCTGTTTGGCGGAGCTGACAATGGCGGAGGCTGAAGGCAAT 

GCAAGCTGCACAGTCAGTCTAGGGGGTGCCAATATGGCAGAGACCCACAAAGCCATGATCCTGCAACTCAATCCC 

AGTGAGAACTGCACCTGGACAATAGAAAGACCAGAAAACAAAAGCATCAGAATTATCTTTTCCTATGTCCAGCTT 

GATCCAGATGGAAGCTGTGAAAGTGAAAACATTAAAGTCTTTGACGGAACCTCCAGCAATGGGCCTCTGCTAGGG 

CAAGTCTGCAGTAAAAACGACTATGTTCCTGTATTTGAATCATCATCCAGTACATTGACGTTTCAAATAGTTACT 

GACTCAGCAAGAATTCAAAGAACTGTCTTTGTCTTCTACTACTTCTTCTCTCCTAACATCTCTATTCCAAACTGT 

GGCGGTTACCTGGATACCTTGGAAGGATCCTTCACCAGCCCCAATTACCCAAAGCCGCATCCTGAGCTGGCTTAT 

TGTGTGTGGCACATACAAGTGGAGAAAGATTACAAGATAAAACTAAACTTCAAAGAGATTTTCCTAGAAATAGAC 

AAACAGTGCAAATTTGATTTTCTTGCCATCTATGATGGCCCCTCCACCAACTCTGGCCTGATTGGACAAGTCTGT 

GGCCGTGTGACTCCCACCTTCGAATCGTCATCAAACTCTCTGACTGTCGTGTTGTCTACAGATTATGCCAATTCT 

TACCGGGGATTTTCTGCTTCCTACACCTCAATTTATGCAGAAAACATCAACACTACATCTTTAACTTGCTCTTCT 

GACAGGATGAGAGTTATTATAAGCAAATCCTACCTAGAGGCTTTTAACTCTAATGGGAATAACTTGCAACTAAAA 

GACCCAACTTGCAGACCAAAATTATCAAATGTTGTGGAATTTTCTGTCCCTCTTAATGGATGTGGTACAATCAGA 

AAGGTAGAAGATCAGTCAATTACTTACACCAATATAATCACCTTTTCTGCATCCTCAACTTCTGAAGTGATCACC 

CGTCAGAAACAACTCCAGATTATTGTGAAGTGTGAAATGGGACATAATTCTACAGTGGAGATAATATACATAACA 

GAAGATGATGTAATACAAAGTCAAAATGCACTGGGCAAATATAACACCAGCATGGCTCTTTTTGAATCCAATTCA 

TTTGAAAAGACTATACTTGAATCACCATATTATGTGGATTTGAACCAAACTCTTTTTGTTCAAGTTAGTCTGCAC 

ACCTCAGATCCAAATTTGGTGGTGTTTCTTGATACCTGTAGAGCCTCTCCCACCTCTGACTTTGCATCTCCAACC 

TACGACCTAATCAAGAGTGGATGTAGTCGAGATGAAACTTGTAAGGTGTATCCCTTATTTGGACACTATGGGAGA 

TTCCAGTTTAATGCCTTTAAATTCTTGAGAAGTATGAGCTCTGTGTATCTGCAGTGTAAAGTTTTGATATGTGAT 

AGCAGTGACCACCAGTCTCGCTGCAATCAAGGTTGTGTCTCCAGAAGCAAACGAGACATTTCTTCATATAAATGG 

AAAACAGATTCCATCATAGGACCCATTCGTCTGAAAAGGGATCGAAGTGCAAGTGGCAATTCAGGATTTCAGCAT 

GAAACACATGCGGAAGAAACTCCAAACCAGCCTTTCAACAGTGTGCATCTGTTTTCCTTCATGGTTCTAGCTCTG 

AATGTGGTGACTGTAGCGACAATCACAGTGAGGCATTTTGTAAATCAACGGGCAGACTACAAATACCAGAAGCTG 

CAGAACTATTAACTAACAGGTCCAACCCTAAGTGAGACATGTTTCTCCAGGATGCCAAAGGAAATGCTACCTCGT 

GGCTACACATATTATGAATAAATGAGGAAGGGCCTGAAAGTGACACACAGGCCTGCATGTAAAAAAA 




wo 01/40466 PCT/USOO/32678 



344/550 
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MELVRRLM PLTLL I LSCLAELTMAEAEGNASCT VS LGGANMAETHKAM I LQLN PSENCT WT I E 
RPENKSIRIIFSYVQLDPDGSCESENIKVFDGTSSNGPLLGQVCSKNDYVPVFESSSSTLTFQ 
I VT DS ARIQRTVFVFY Y FFS PN I S I PNCGG YL DTLEGS FTS PN Y PKPH PELAYCVWHIQVEKD 
YKIKLNFKEIFLEIDKQCKFDFLAIYDGPSTNSGLIGQVCGRVTPTFESSSNSLTVVLSTDYA 
NSYRGFSASYTSIYAENINTTSLTCSSDRMRVliSKSYLEAFNSNGNNLQLKDPTCRPKLSNV 
VEFSyPLNGCGTIRKVEDQSITYTNIITFSASSTSEVITRQKQLQIIVKCEMGHNSTVEIIYI 
TEDDVIQSQNALGKYNTSMALFESNSFEKTILESPYYVDLNQTLFVQVSLHTSDPNLVVFLDT 
CRASPTSDFASPTYDLIKSGCSRDETCKVYPLFGHYGRFQFNAFKFLRSMSSVYLQCKVLICD 
SSDHQSRCNQGCVSRSKRDISSYKWKTDSIIGPIRLKRDRSASGNSGFQHETHAEETPNQPFN 
SVHLFS FMVLALN WTVAT IT VRHFVNQRADYKYQKLQNY 

Important features: 
Signal sequence: 

amino acids 1-24 

Transmembrane domain: 

amino acids 571-586 

N-glycosylation site. 

amino acids 29-33, 57-61, 67-^71, 148-152, 271-275, 370-374, 
394-398, 419-423 

Casein kinase II phosphorylation site. 

amino acids 22-26, 108-112, 289-293, 348-352, 371-375, 379-383, 
408-412, 463-467, 520-524, 556-560 

Tyrosine kinase phosphorylation site • 

amino acids 172-180, 407-415, 407-416, 519-528 

N-myristoylation site. 

amino acids 28-34, 38-44, 83-89, 95-101, 104-110, 226-232 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 7-18 
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FIGURE 345 

TGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGGTGCTCTC 

ggccttgctgtgtcaggtctggagctctggggtgttcgaactgaagctgcaggagttcgtScaagSgg^S 

GCTGGGGAACCGCAATTGCTGCCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCG^ 

cctcaagcactaccaggccagcgtgtcccccgagccgccctgcacctacggcagcgSgScScStS^^ 

^S^^^^^^^^^^^<^^^^°CCC^^C<^<^CGGGGGCGCCGACT.CCGC^^^^^ 

cttcacctggccgggcaccttctctctgattattgaagctctccacacagattctcctgatgacct 

aaacccagaaagactcatcagccgcctggccacccagaggcacctgacggtggSgS 

gcacagcagcggccgcacggacctcaagtactcctaccgcttcgtgtgtgacgaacaSactSSg^^ 

ctccgttttctgccgtccccgggacgatgccttcggco^cttcacctSgScSS 

ccctggctggaaagggccctactgcacagagccgatctgcctgcctggatgtgatgagcaSat^ 

CTGTTCTAATGGTGCCAAGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTGCCAGGCCGGCTTCTCGGGr^^ 

cgacttctcctgcacctgcccgcctggctacacgggcaggaactgcagtgcccccgtcagcagctgcSgSc^ 
accctgccacaatggggccacctgccacgagaggggccaccgctatgtgtgcgagtgtgScSggcScSg^^ 
^^^^^^^[^^^^^tt^ctgctccccgagctgcccccgggcccagcggtgg?^^^ 

^S^S^S^^^^^c^ttcccctgggtggccgtgtgcgccggggtcatccttgtcctcatgctgctgct^Sc^^ 

^n^J»I^f^^^^'''^^'^''*='=^'^^^'^^°^T°C^<^^GCACCGGCCCCCAGCCGACCCCTS^ 

gaccatgaacaacctggccaactgccagcgtgagaaggacatctcagtcagcatcatcggggccacgSS^^ 

r^r?»n^S'^°'^°*'^^°^^^^^^^^^G«^^CCACAGCGCCGACAAGAAT^^^ 

AAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCATATCCGAG^^^ 
GAAGGATGAGTGCGTCATAGCAACTGAGGTGTAiy^TGGAAGTGAGATGGCAAGACTrc^^^ 
nr^^Jn»^^^''''''''''^^^^^^°^'^'^CTGCTGAAGAGGAGGGAGGCCTCGTGS^^^ 
^™^^^^^^^^^^*^°^'^^^^'^^CTGAGGTCCTCGACGCCTGCCGACAGCCTGTCGCGGCCCG^ 

tgcggo^ctgccttccgtgacgtcgccgttgcactatggacagttgctcttaagagaatatatatSSg^ 

gaactgaattacgcataagaagcatgcactgcctgagtgtatattttgSttcttatg^^ 

attagaaacacaaacactgcctttattgtcctttttgatacgaagatgtgctttttcSStggIS 

^^^^"^^''''^^'''''^^^tttttcatgatatctgtaaagcttgagtS^^ 

™JJ™^^^*^°^^'^^^^^'^^*^°gcacttcgggtctatgtgactatatttttttgtatm 

ttatggaatattgtgcaaatgttatttgagttttttactgttttgttaatgaagaaattccwSaaa^ 
ttccaaaataaattttatgaatgao^aaaaaaaw^^ 
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FIGURE 346 



MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNCCRGGAGPPPCACRTFFRVC 
LKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADSAFSNPIRFPFGFTWPGTFSLIIEA 
LHTDSPDDLATENPERLISRLATQRHLTVGEEWSQDLHSSGRTDLKYSYRFVCDEHYYGEGCS 
VFCRPRDDAFGHFTCGERGEKVCNPGWKGPYCTEPICLPGCDEQHGFCDKPGECKCRVGWQGR 
YCDECIRYPGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHHKPCKNGATCTNTGQGSYTCS 
CRPGYTGATCELGIDECDPSPCKNGGSCTDLENSYSCTCPPGFYGKICELSAMTCADGPCFNG 
GRCSDSPDGGYSCRCPVGYSGFNCEKKIDYCSSSPCSNGAKCVDLGDAYLCRCQAGFSGRHCD 
DNVDDCASSPCANGGTCRDGVNDFSCTCPPGYTGRNCSAPVSRCEHAPCHNGATCHERGHRYV 
CECARGYGGPNCQFLLPELPPGPAVVDLTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAWV 
CVRLRLQKHRPPADPCRGETETMNNLANCQREKDISVSIIGATQIKNTNKKADFHGDHSADKN 
GFKARYPAVDYNLVQDLKGDDTAVRDAHSKRDTKCQPQGSSGEEKGTPTTLRGGEASERKRPD 
SGCSTSKDTKYQSVYVISEEKDECVIATEV 

Important features: 
Signal sequence : 

Amino acids 1-21 

Transmembrane domain: 

Amino acids 546-566 

N-glycosylation site: 

Amino acids 477-481 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 660-664 

Tyrosine kinase phosphorylation sites: 

Amino acids. 176-185;252-261 

N-myristoylation sites : 

Amino acids 2-8;37-43;40-46;98-104;99-105;262-268;281-287; 
282-288; 301-307; 310-316; 328-334; 340-344; 378-384; 387-393; 512-518; 
676-682; 683-689; 695-701 

Aspartic acid and asparagine hydroxylation sites: 

Amino acids 343-355; 420-432; 458-470 

Prokaryotic membrane lipoprotein lipid attachment site: 

Amino acids 552-563 

EGF-like domaiin cysteine pattern signature: 

Amino acids 243-255; 274-286; 314-326; 352-364; 391-403; 429-441; 
467-479;505-517 
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FIGURE 347 



CCCACGCGTCCGCACCTCGGCCCCGGGCTCCGAAGCGGCTCGGGGGCGCCCTTTCGGTCAACA 

TCGTAGTCCACCCCCTCCCCATCCCCAGCCCCCGGGGATTCAGGCTCGCCAGCGCCCAGCCAG 

GGAGCCGGCCGGGAAGCGCG^GGGGGCCCCAGCCGCCTCGCTCCTGCTCCTGCTCCTGCTGT 

TCGCCTGCTGCTGGGCGCCCGGCGGGGCCAACCTCTCCCAGGACGACAGCCAGCCCTGGACAT 

CTGATGAAACAGTGGTGGCTGGTGGCACCGTGGTGCTCAAGTGCCAAGTGAAAGATCACGAGG 

ACTCATCCCTGCAATGGTCTAACCCTGCTCAGCAGACTCTCTACTTTGGGGAGAAGAGAGCCC 

TTCGAGATAATCGAATTCAGCTGGTTACCTCTACGCCCCACGAGCTCAGCATCAGCATCAGCA 

ATGTGGCCCTGGCAGACGAGGGCGAGTACACCTGCTCAATCTTCACTATGCCTGTGCGAACTG 

CCAAGTCCCTCGTCACTGTGCTAGGAATTCCACAGAAGCCCATCATCACTGGTTATAAATCTT 

CATTACGGGAAAAAGACACAGCCACCCTAAACTGTCAGTCTTCTGGGAGCAAGCCTGCAGCCC 

GGCTCACCTGGAGAAAGGGTGACCAAGAACTCCACGGAGAACCAACCCGCATACAGGAAGATC 

CCAATGGTAAAACCTTCACTGTCAGCAGCTCGGTGACATTCCAGGTTACCCGGGAGGATGATG 

GGGCGAGCATCGTGTGCTCTGTGAACCATGAATCTCTAAAGGGAGCTGACAGATCCACCTCTC 

AACGCATTGAAGTTTTATACACACCAACTGCGATGATTAGGCCAGACCCTCCCCATCCTCGTG 

AGGGCCAGAAGCTGTTGCTACAGTGTGAGGGTCGCGGCAATCCAGTCCCCCAGCAGTACCTAT 

GGGAGAAGGAGGGCAGTGTGCCACCCCTGAAGATGACCCAGGAGAGTGCCCTGATCTTCCCTT 

TCCTCAACAAGAGTGACAGTGGCACCTACGGCTGCACAGCCACCAGCAACATGGGCAGCTACA 

AGGCCTACTACACCCTCAATGTTAATGACCCCAGTCCGGTGCCCTCCTCCTCCAGCACCTACC 

ACGCCATCATCGGTGGGATCGTGGCTTTCATTGTCTTCCTGCTGCTCATCATGCTCATCTTCC 

TTGGCCACTACTTGATCCGGCACAAAGGAACCTACCTGACACATGAGGCAAAAGGCTCCGACG 

ATGCTCCAGACGCGGACACGGCCATCATCAATGCAGAAGGCGGGCAGTCAGGAGGGGACGACA 

AGAAGGAATATTTCATCTAGAGGCGCCTGCCCACTTCCTGCGCCCCCCAGGGGCCCTGTGGGG 

ACTGCTGGGGCCGTCACCAACCCGGACTTGTACAGAGCAACCGCAGGGCCGCCCCTCCCGCTT 

GCTCCCCAGCCCACCCACCCCCCTGTACAGAATGTCTGCTTTGGGTGCGGTTTTGTACTCGGT 

TTGGAATGGGGAGGGAGGAGGGCGGGGGGAGGGGAGGGTTGCCCTCAGCCCTTTCCGTGGCTT 

CTCTGCATTTGGGTTATTATTATTTTTGTAACAATCCCAAATCAAATCTGTCTCCAGGCTGGA 

GAGGCAGGAGCCCTGGGGTGAGAAAAGCAAAAAACAAACAAAAAACA 
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FIGURE 348 



MGAPAASLLLLLLLFACCWAPGGANLSQDDSQPWTSDETVVAGGTVVLKCQVKDHEDSSLQWS 
NPAQQTLYFGEKRALRDNRIQLVTSTPHELSISISNVALADEGEYTCSIFTMPVRTAKSLVTV 
LGIPQKPIITGYKSSLREKDTATLNCQSSGSKPAARLTWRKGDQELHGEPTRIQEDPNGKTFT 
VSSSVTFQVTREDDGASIVCSVNHESLKGADRSTSQRIEVLYTPTAMIRPDPPHPREGQKLLL 
HCEGRGNPVPQQYLWEKEGSVPPLKMTQESALIFPFLNKSDSGTYGCTATSNMGSYKAYYTLN 
VNDPSPVPSSSSTYHAIIGGIVAFIVFLLLIMLIFLGHYLIRHKGTYLTHEAKGSDDAPDADT 
AI INAEGGQSGGDDKKE YFI 

Iii^ori:ant: features: 
Signal sequence: 

amino acids 1-20 

Transmembrane domain: 

amino acids 331--352 

N-glycosylation site. 

amino acids 25-29, 290-294 

Casein kinase II phosphorylation site. 

amino acids 27-31, 35-39, 89-93, 141-145, 199-203, 388-392 
N-myristoylation site. 

amino acids 2-8, 23-29, 156-162, 218-224, 295-301, 298-304, 
306-310, 334-340, 360-364, 385-389, 386-390 

Frokaryotic membrane lipoprotein lipid attachment sxte. 

amino acids 7-18 
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ACTTGCCATCACCTGTTGCCAGTGTGGAAAAATTCTCCCTGTTGAATTTTTTGCACATGGAGGACAGCAGCAAAG 

AGGGCAACACAGGCTGATAAGACCAGAGACAGCAGGGAGATTATTTTACCATACGCCCTCAGGACGTTCCCTCTA 

GCTGGAGTTCTGGACTTCAACAGAACCCCATCCAGTCATTTTGATTTTGCTGTTTATTTTTTTTTTCTTTTTCTT 

TTTCCCACCACATTGTATTTTATTTCCGTACTTCAGAAATSGGCCTACAGACCACAAAGTGGCCCAGCCATGGGG 

CTTTTTTCCTGAAGTCTTGGCTTATCATTTCCCTGGGGCTCTACTCACAGGTGTCCAAACTCCTGGCCTGCCCTA 

GTGTGTGCCGCTGCGACAGGAACTTTGTCTACTGTAATGAGCGAAGCTTGACCTCAGTGCCTCTTGGGATCCCGG 

AGGGCGTAACCGTACTCTACCTCCACAACAACCAAATTAATAATGCTGGATTTCCTGCAGAACTGCACAATGTAC 

AGTCGGTGCACACGGTCTACCTGTATGGCAACCAACTGGACGAATTCCCCATGAACCTTCCCAAGAATGTCAGAG 

TTCTCCATTTGCAGGAAAACAATATTCAGACCATTTCACGGGCTGCTCTTGCCCAGCTCTTGAAGCTTGAAGAGC 

TGCACCTGGATGACAACTCCATATCCACAGTGGGGGTGGAAGACGGGGCCTTCCGGGAGGCTATTAGCCTCAAAT 

TGTTGTTTTTGTCTAAGAATCACCTGAGCAGTGTGCCTGTT6GGCTTCCTGTGGACTTGCAAGAGCTGAGAGTGG 

ATGAAAATCGAATTGCTGTCATATCCGACATGGCCTTCCAGAATCTCAC6AGCTTGGAGCGTCTTATTGTGGACG 

GGAACCTCCTGACCAAC^AGGGTATCGCCGAGGGCACCTTa^GCCATCTCACCAAGCTCAAGGAATTTTCAATTG 

TACGTAATTCGCTGTCCCACCCTCCTCCCGATCTCCCAGGTACGCATCTGATCAGGCTCTATTTGCAGGACAACC 

AGATAAACCACATTCCTTTGACAGCCTTCTCAAATCTGCGTAAGCTGGAACGGCTGGATATATCCAACAACCAAC 

TGCGGATGCTGACTCAAGGGGTTTTTGATAATCTCTCCAACCTGAAGCAGCTCACTGCTCGGAATAACCCTTGGT 

TTTGTGACTGCAGTATTAAATGGGTCACAGAATGGCTCAAATATATCCCTTCATCTCTCAACGTGCGGGGTTTCA 

TGTGCCAAGGTCCTGAACAAGTCCGGGGGATGGCCGTCAGGGAATTAAATATGAATCTTTTGTCCTGTCCCACCA 

CGACCCCCGGCCTGCCTCTCTTCACCCCAGCCCCAAGTACAGCTTCTCCGACCACTCAGCCTCCCACCCTCTCTA 

TTCCAAACCCTAGCAGAAGCTACACGCCTCCAACTCCTACCACATCGAAACTTCCCACGATTCCTGACTGGGATG 

GCAGAGAAAGAGTGACCCCACCTATTTCTGAACGGATCCAGCTCTCTATCCATTTTGTGAATGATACTTCCATTC 

AAGTCAGCTGGCTCTCTCTCTTCACCGTGATGGCATACAAACTCACATGGGTGAAAATGGGCCACAGTTTAGTAG 

GGGGCATCGTTCAGGAGCGCATAGTCAGCGGTGAGAAGCAACACCTGAGCCTGGTTAACTTAGAGCCCCGATCCA 

CCTATCGGATTTGTTTAGTGCCACTGGATGCTTTTAACTACCGCGCGGTAGAAGACACCATTTGTTCAGAGGCCA 

CCACCCATGCCTCCTATCTGAACAACGGCAGCAACACAGCGTCCAGCCATGAGCAGACGACGTCCCACAGCATGG 

GCTCCCCCTTTCTGCTGGCGGGCTTGATCGGGGGCGCGGTGATATTTGTGCTGGTGGTCTTGCTCAGCGTCTTTT 

GCTGGCATATGCACAAAAAGGGGCGCTACACCTCCCAGAAGTGGAAATACAACCGGGGCCGGCGGAAAGATGATT 

ATTGCGAGGCAGGCACCAAGAAGGACAACTCCATCCTGGAGATGACAGAAACCAGTTTTCAGATCGTCTCCTTAA 

ATAACGATCAACTCCTTAAAGGAGATTTCAGACTGCAGCCCATTTACACCCCAAATGGGGGCATTAATTACACAG 

ACTGCCATATCCCCAACAACATGCGATACTGCAACAGCAGCGTGCCAGACCTGGAGCACTGCCATACGTGACAGC 

CAGAGGCCCAGCGTTATCAAGGCGGACAATTAGACTCTTGAGAACACACTCGTGTGTGCACATAAAGACACGCAG 

ATTACATTTGATAAATGTTACACAGATGCATTTGTGCATTTGAATACTCTGTAATTTATACGGTGTACTATATAA 

TGGGATTTAAAAAAAGTGCTATCTTTTCTATTTCAAGTTAATTACAAACAGTTTTGTAACTCTTTGCTTTTTAAA 
TCTT 
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FIGURE 350 



MGLQTTKWPSHGAFFLKSWLIISLGLYSQVSKLLACPSVCRCDRNFVYCNERSLTSVPLGIPE 
GVTVLYLHNNQINNAGFPAELHNVQSVHTVYLYGNQLDEFPMNLPKNVRVLHLQENNIQTISR 
AALAQLLKLEELHLDDNSISTVGVEDGAFREAISLKLLFLSKNHLSSVPVGLPVDLQELRVDE 
NRIAVISDMAFQNLTSLERLIVbGNLLTNKGIAEGTFSHLTKLKEFSIVRNSLSHPPPDLPGT 
HLIRLYLQDNQINHIPLTAFSNLRKLERLDISNNQLRMLTQGVFDNLSNLKQLTARNNPWFCD 
CSIKWVTEWLKYIPSSLNVRGFMCQGPEQVRGMAVRELNMNLLSCPTTTPGLPLFTPAPSTAS 
PTTQPPTLSIPNPSRSYTPPTPTTSKLPTIPDWDGRERVTPPISERIQLSIHFVNDTSIQVSW 
LSLFTVMAYKLTWVKMGHSLVGGIVQERIVSGEKQHLSLVNLEPRSTYRICLVPLDAFNYRAV 
EDTICSEATTHASYLNNGSNTASSHEQTTSHSMGSPFLLAGLIGGAVIFVLVVLLSVFCWHMH 
KKGRYTSQKWKYNRGRRKDDYCEAGTKKDNSILEMTETSFQIVSLNNDQLLKGDFRLQPIYTP 
NGGINYTDCHIPNNMRYCNSSVPDLEHCHT 

In^ortiant features: 
Signal peptide: 
amino acids 1-42 
Transmeimbrane domain: 

amino acids 542-561 ' ' 

N~glycosylation site. 

amino acids 202-206, 298-302, 433-437, 521-525, 635-639, 649-653 
Casein kinase II phosphorylation site. 

amino acids 204-208, 407-411, 527-531, 593-597, 598-602, 651-655 
"TyxosLTi^ kinase phosphorylation site. 

amino acids 319-328 ^ 
N-myristoylation site. 

amino acids 2-8, 60-66, 149-155, 213-219, 220-226, 294-300, 



522-528, 545-551, 633-639 
Amidation site. 



amino acids 581-585 



Leucine zipper pattern. 

amino acids 164-186 

Phospholipase A2 aspartic acid active site. 

amino acids 39-50 
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AGCCGACGCTGCTCAAGCTGCAACTCTGTTGCAGTTGGCAGTTCTTTTCGGTTTCCCTCCTGCTGTTTGGGGGCA 
TGAAAGGGCTTCGCCGCCGGGAGTAAAAGAAGGAATTGACCGGGCAGCGCGAGGGAGGAGCGCGCACGCGACCGC 

gagggcgggcgtgcaccctcggctggaagtttgtgccgggccccgagcgcgcgccggctgggSttcgSS 

GACCTAGGCCGCTGGACCGCGATGAGCGCGCCGAGCCTCCGTGCGCGCGCCGCGGGGTTGGGGCTGCTGCTGTGC 

gcggtgctggggcgcgctggccggtccgacagcggcggtcgcggggaactcgggcagccctctggStaSgcc 

cccgagccactcccgtcctgggtcgctcggctggacttaagtcacaacagattatctttcatcaaggSg^ 

atgagccaccttcaaagccttcgagaagtgaaactgaacaacaatgaattggagaSattcc^ 

gtctcggcaaatattacacttctctccttggctggaaacaggattgttgaaatSSctgSc^ 

tttcagtcccttgaaactttggaccttagcagcaacaatatttoigagctccaaactgStS^ 

^3™™^*=^'^^^^^^^^^^°^cc^a^cacatcaatggaacctgggtattttgacaaS 

^IS^.^^^^^^^^^^^^^^^^'^^t^cagctatcccacccaagatgtttaaactgccccSct^^ 

Sr^In^^™^^^''^'^'^''^^™^'^^^^^^GCTTTTTGGGGGCTGAGCAACATGGAA^^ 
o^^f^'^'^^^^^^^^^^^'^^^^^^^^'^'I'ACCAAAGGCTGGCTTTACGGCTTGCTGATGCTGCAGGAACTTCAT 
fr™^^^''''''^'^^^^^^^^^<^'=^^"^^^'r^CC'rGGGAGTTCTGCCAGAAS^^^ 
ACTTTCAATCACTTATCAAGGTTAGATGATTCAAGCTTCCTTGGCCTAAGCTTACTAAATACACTGCACA^^^ 

aacaacagagtcagctacattgctgattgtgccttccgggggctttcStaSSSSSg^g^ 

aatgaaatttcctggactattgaagacatgaatggtgctttctctgggcttgacaaactSgcSSgat^ 

caaggaaatcggatccgttctattactaaaaaagccttcactggtttggatgcattgSgcatct^^ 

^^^^^^^^''''''^''''^^^^^^^''^^^tgcattttcacaaatgaagaaactgc^ 

tcaagccttttgtgcgattgccagctaaaatggctcccacagtgggtggcggaaaacaacttocS^^ 

^^^S^TTTTCCCAAACCCCAGATCACGGTTCAGCCAGAAACACAGTCGGCAATAAAAGGTTCCM^^ 

ATCTGCTCAGCTGCCAGCAGCAGTGATTCCCCAATGACTTTTGCTTGGAAAAAAGACAATGAACTACTC^^ 

GCTGAAATGGAAAATTATGCACACCTCCGGGCCCAAGGTGGCGAGGTGATGGAGTATACCAC^^^ 

AAAGCCAAGCTTACAGTAAATATGCTTCCCTCATTOVCCAAGACCCCCATGGATCTCACCATCCGAGCTGG^ 

atggcacgcttggagtgtgctgctgtggggcacccagccccccagatagcctggS^^ 

r^rr^r^$J^^:^°^^'^^^^"'^^^^°T^^TS'=CCGAGGATGACGTGTT 

rT^^^^I™^^^'''''''^^''^^''^^^^^^^^^^T<^CAGGAAGTATTTCAGCAAATGC^^^ 

CTAGAAACACCATCATTTTTGCGGCCACTGTTGGACCGAACTGTAACCAAGGGAGAAACAGCCGTCCTACAGTGC 

ATTGCTGGAGGAAGCCCTCCCCCTAAACTGAACTGGACCAAAGATGATAGCCCATTGGTGGTAA^ 

^^^^^^''^^^''^^^''^^^^^'^CTGATTATTGTGGACTCAGATGTCAGTGMGCTGGG^ 

ATGTCTAACACCCTTGGCACTGAGAGAGGAAACGTGCGCCTCAGTGTGATCCCCACTCCAACCTGCGACTC^ 

^^^^^^^^^c^c^^'^c^ttagacgatgacggatgggccactgtgggtgtcgtStS^SS 

gtcgtgggcacgtcactcgtgtgggtggtcatcatataccacacaaggcggaggaatgaaStogSgcatScc 

aacacagatgagaccaacttgccagcagatattcctagttatttgtcatctcagggaacgttagc^^ 

gatgggtacgtgtcttcagaaagtggaagccaccaccagtttgtcacatcttcmgtgS^^ 

^S^S^'^^^^^^^^^t'^^^^cctgccatattgacaatagcagtgaagctgatgtggaagctISagm^ 

ctttgtccgtttttgggatccacaggccctatgtatttgaagggaaatgtgtatggctSSc^^^ 

tatcatacaggttgcagtcctgaccou^gaacagttttaatggaccactatgagcccacttac^^ 

gagtgctacco^tgttctcatccttgagaagaatcctgcgaacggagcttcagStS^ 

SoS^'^'^^'^^'^'^^^^^^^'^TTACTCTCACAATGAAGGACCTGGAATGAAAAATCT^^ 
J^3^Io^r^''^°^°^^'"CCAGAGCCAGCGTCGGTTGCCTCGAGTAATTCT^ 

gctctcaggagacctcacctagatgcctattcaagctttggacagccatcagattgtcagccaaga^ 

gaaaatcacatttgtacctttaaacagactttagaaaactacaggactccaaattttcagtcttatgactSac 
acasmactgaatgagaccaaaggaaaagcttaacatactacctcaagtgaacttttatttaaaagagagagaat 
cttatgttttttaaatggagttatgaattttaaaaggataaaaatgctttatttataSStgaaSSS?^^ 
aaaaagttatgaaaatttttatactgggaatgatgctcatataagaatacctttttaSc^^^ 

niJ^I£^''''^^^^^C^^^C'=TAAATTAATGATATAAATCATGATTATTTTATGTA^ 

tt^ctttttatggaaaatgagttactaaagcattttaaataatacctgccttgtaccatt^ 

ACTTCATTATATTTTGCACATTATATTTAATAAAATGTGTCAATTTGAAAAAAAAAAAAAAwSJ 
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MSAPSLRARAAGLGLLLCAVLGRAGRSDSGGRGELGQPSGVAAERPCPTTCRCLGDLLDCSRKRLARLPEPLPSW 
VARLDLSHNRLSFIKASSMSHLQSLREVKLNNNELETIPNLGPVSANITLLSLAGNRIVEILPEHLKEFQSLETL 
DLSSNNISELQTAFPALQLKYLYLNSNRVTSMEPGYFDNLANTI.LVLKLNRNRISAIPPKMFKLPQLQHLELNRN 
K I KN VDGLT FQGLGALKS LKMQRNG VTKLMDGAFWGLSNME I LQLDHNN LT E I TKGWL YGLLMLQELHLSQNAIN 
RISPDAWEFCQKLSELDLTFNHLSRLDDSSFLGLSLLNTLHIGNNRVSYIADCAFRGLSSLKTLDLKNNEISWTI 
EDMNGAFSGLDKLRRLILQGNRIRSITKKAFTGLDALEHLDLSDNAIMSLQGNAFSQMKKLQQLHLNTSSLLCDC 
QLKWLPQWVAENNFQSFVNASCAHPQLLKGRSIFAVSPDGFVCDDFPKPQITVQPETQSAIKGSNLSFICSAASS 
SDSPMTFAWKKDNELLHDAEMENYAHLRAQGGEVMEYTTILRLREVEFASEGKYQCVISNHFGSSYSVKAKLTVN 
MLPSFTKTPMDLTIRAGAMARLECAAVGHPAPQIAWQKDGGTDFPAARERRMHVMPEDDVFFIVDVKIEDIGVYS 
CTAQNSAGSISANATLTVLETPSFLRPLLDRTVTKGETAVLQCIAGGSPPPKLNWTKDDSPLVVTERHFFAAGNQ 
LLIIVDSDVSDAGKYTCEMSNTLGTERGNVRLSVIPTPTCDSPQMTAPSLDDDGWATVGWIIAVVCCWGTSLV 
WWI I YHTRRRNEDCS ITNTDETNLPADI PS YLSSQGTLADRQDGYVSSESGSHHQFVTSSGAGFFLPQHDSSGT 
CHIDNSSEADVEAATDLFLCPFLGSTGPMYLKGNVYGSDPFETYHTGCSPDPRTVLMDHYEPSYIKKKECYPCSH 
PSEESCERSFSNISWPSHVRKLLNTSYSHNEGPGMKNLCLNKSSLDFSANPEPASVASSNSFMGTFGBCALRRPHL 
DAYSSFGQPSDCQPRAFYLKAHSSPDLDSGSEEDGKERTDFQEENHICTFKQTLENYRTPNFQSYDLDT 

Importanl: fea'bures: 
Signal seq[uence: 

amino acids 1-27 



Transmembrane domain: 

amino acids 808-828 



N-glycosylation site. 

amino acids 122-126, 156-160, 274-278, 442-446, 469-473, 515-519, 
. 688-692, 729-733, 905-909, 987-991, 999-1003, 1016-1020 

Glycosaminoglycan attachment site. 

amino acids 886-890 



Casein kinase IX phosphorylation site . 

amino acids 99-103, 180-184, 263-267, 314-318, 324-328, 374-378, 
383-387, 407-411, 524-528, 608-612, 692-696, 709-713, 731-735, 
799-803, 843-847, 863-867, 907-911, 1003-1007, 1018-1022, 
1073-1077, 1079-1083, 1081-1085 

Tyrosine kinase phosphorylation site, 
amino acids 667-675 

N-myristoylation site. 

amino acids 14-20, 36-42, 239-245, 257-263, 380-386, 427-433, 
513-519, 588-594, 672-678, 683-687, 774-780, 933-939 



Leucinie zipper pattern. 

amino acids '58-80, 65-87 
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GGGGGTTAGGGAGGAAGGAATCCACCCCCACCCCCCCAAACCCTTTTCTTCTCCTTTCCTGGCTTCGGACATTGG 

AGCACTAAATGAACTTGAATTGTGTCTGTGGCGAGCAGGATGGTCGCTGTTACTTTGTGATGAGATCGGGGATGA 

ATTGCTCGCTTTAAAA^SCTGCTTTGGATTCTGTTGCTGGAGACGTCTCTTTGTTTTGCCGCTGGAAACGTTAC 

AGGGGACGTTTGCAAAGAGAA6ATCTGTTCCTGCAATGAGATAGAAGGGGACCTACAC6TAGACTGTGAAAAAAA 

GGGCTTCACAAGTCTGCAGCGTTTCACTGCCCCGACTTCCCAGTTTTACCATTTATTTCTGCATGGCAATTCCCT 

CACTCGACTTTTCCCTAATGAGTTCGCTAACTTTTATAATGCGGTTAGTTTGCACATGGAAAACAATGGCTTGCA 

TGAAATCGTTCCGGGGGCTTTTCTGGGGCTGCAGCTGGTGAAAAGGCTGCACATCAACAACAACAAGATCAAGTC 

TTTTCGAAAGCAGACTTTTCTGGGGCTGGACGATCTGeAATATCTCCAGGCTGATTTTAATTTATTACGAGATAT 

AGACCCGGGG6CCTTCCAG6ACTTGAACAAGCTGGAGGTGCTCATTTTAAATGACAATCTCATCAGCACCCTACC 

TGCCAACGTGTTCCAGTATGTGCCCATCACCCACCTCGACCTCCGGGGTAACAGGCTGAAAACGCTGCCCTATGA 

GGAGGTCl-rGGAGCAAATCCCTGGTATTGCGGAGATCCTGCTAGAGGATAACCCTTGGGACTGCACCTGTGATCT 

GCTCTCCCTGAAAGAATGGCTGGAAAACATTCCCAAGAATGCCCTGATCGGCCGAGTGGTCTGCGAAGCCCCCAC 

CAGACTGCAGGGTAAAGACCTCAATGAAACCACCGAACAGGACTTGTGTCCTTTGAAAAACCGAGTGGATTCTAG 

TCTCCCGGCGCCCCCTGCCCAAGAAGAGACCTTTGCTCCTGGACCCCTGCCAACTCCTTTCAAGACAAATGGGCA 

AGAGGATCATGCCACACCAGGGTCTGCTCCAAACGGAGGTACAAAGATCCCAGGCAACT6GCAGATCAAAATCAG 

ACCCACAGCAGCGATAGCGACGGGTAGCTCCAGGAACAAACCCTTAGCTAACAGTTTACCCTGCCCTGGGGGCTG 

CAGCTGCGACCACATCCCAGGGTCGGGTTTAAAGATGAACTGCAACAACAGGAACGTGAGCAGCTTGGCTGATTT 

GAAGCCCAAGCTCTCTAACGTGCAGGAGCTTTTCCTACGAGATAACAAGATCCACAGCATCCGAAAATCGCACTT 

TGTGGATTACAAGAACCTCATTCTGTTGGATCTGGGCiW^CAATAACATCGCTACTGTAGAGAACAACACTT.TCAA 

GAACCTTTTGGACCTCAGGTGGCTATACATGGATAGCAATTACCTGGACACGCTGTCCCGGGAGAAATTCGCGGG 

GCTGCAAAACCTAGAGTACCTGAACGTGGAGTACAACGCTATCCAGCTCATCCTCCCGGGCACTTTCAATGCCAT 

GCCCAAACTGAGGATCCTCATTCTCAACAACAACCTGCTGAGGTCCCTGCCTGTGGACGTGTTCGCTGGGGTCTC 

GCTCTCTAAACTCAGCCTGCACAAO^ATTACTTCATGTACCTCCCGGTGGCAGGGGTGCTGGACCAGTTAACCTC 

CATCATCCAGATAGACCTCCACGGAAACCCCTGGGAGTGCTCCTGCACAATTGTGCCTTTCAAGCAGTGGGCAGA 

ACGCTTGGGTTCCGAAGTGCTGATGAGCGACCTCAAGTGTGAGACGCCGGTGAACTTCTTTAGAAAGGATTTCAT 

GCTCCTCTCCAATGACGAGATCTGCCCTCAGCTGTACGCTAGGATCTCGCCCACGTTAACTTCGCACAGTAAAAA 

CAGCACTGGGTTGGCGGAGACCGGGACGCACTCCAACTCCTACCTAGACACCAGCAGGGTGTCCATCTCGGTGTT 

GGTCCCGGGACTGCTGCTGGTGTTTGTCACCTCCGCCTTCACCGTGGTGGGCATGCTCGTGTTTATCCTGAGGAA 

CCGAAAGCGGTCCAAGAGACGAGATGCCAACTCCTCCGCGTCCGAGATTAATTCCCTACAGACAGTCTGTGACTC 

TTCCTACTGGCACAATGGGCCTTACAACGCAGATGGGGCCCACAGAGTGTATGACTGTGGCTCTCACTCGCTCTC 

AGACTAAGACCCCAACCCCAATAGGGGAGGGCAGAGGGAAGGCGATACATCCTTCCCCACCGCAGGCACCCCGGG 

GGCTGGAGGGGCGTGTACCCAAATCCCCGCGCCATCAGCCTGGATGGGCATAAGTAGATAAATAACTGTGAGCTC 

GCACAACCGAAAGGGCCTGACCCCTTACTTAGCTCCCTCCTTGAAACAAAGAGCAGACTGTGGAGAGCTGGGAGA 

GCGCAGCCAGCTCGCTCTTTGCTGAGAGCCCCTTTTGACAGAAAGCCO^GCACGACCCTGCTGGAAGAACTGACA 

GTGCCCTCGCCCTCGGCCCCGGGGCCTGTGGGGTTGGATGCCGCGGTTCTATACATATATACATATATCCACATC 

TATATAGAGAGATAGATATCTATTTTTCCCCTGTGGATTAGCCCCGTGATGGCTCCCTGTTGGCTACGCAGGGAT 

GGGCAGTTGCACGAAGGCATGAATGTATTGTAAATAAGTAACTTTGACTTCTGAC 
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MLLWILLLETSLCFAAGNVTGDVCKEKICSCNEIEGDLHVDCEKKGFTSLQRFTAPTSQFYHL 

FLHGNSLTRLFPNEFANFYNAVSLHMENNGLHEIVPGAFLGLQLVKRLHINNNKIKSFRKQTF 

LGLDDLEYLQADFNLLRDIDPGAFQDLNKLEVLlLNDNLliSTLPANVFQYVPITHLDLRGNRL 

KTLPYEEVLEQIPGIAEILLEDNPWDCTCDLLSLKEWLENIPKNALIGRVVCEAPTRLQGKDL 

NETTEQDLCPLKNRVDSSLPAPPAQEETFAPGPLPTPFKTNGQEDHATPGSAPNGGTKIPGNW 

QIKIRPTAAIATGSSRNKPLANSLPCPGGCSCDHIPGSGLKMNCNNRNVSSLADLKPKLSNVQ 

ELFLRDNKIHSIRKSHFVDYKNLILLDLGNNNIATVENNTFKNLLDLRWLYMDSNYLDTLSRE 

KFAGLQNLEYLNVEYNAIQLILPGTFNAMPKLRILILNNNLLRSLPVDVFAGVSLSKLSLHNN 

YFMYLPVAGVLDQLTSIIQIDLHGNPWECSCTIVPFKQWAERLGSEVLMSDLKCETPVNFFRK 

DFMLLSNDEICPQLYARISPTLTSHSKNSTGLAETGTHSNSYLDTSRVSISVLVPGLLLVFVT 

SAFTWGMLVFILRNRKRSKRRDANSSASEINSLQTVCDSSYWHNGPYNADGAHRVYDCGSHS ' 
LSD 

Important features: 

Signal sequence: 

amino acids 1-15 

Transmembrane domain: 

amino acids 618-638 

M-glycosylation site. 

amino acids 18-22, 253-257, 363-367, 416-420, 595-599, 655-659 

CAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 122-126, 646-650 

Casein kinase II phosphorylation site. 

amino acids 30-34, 180-184, 222-226, 256-260, 366-370, 573-577, 
608-612, 657-661, 666-670, 693-697 

N-myristoylation site. 

amino acids 17-23, 67-73, 100-106, 302-308, 328-334, 343-349, 
354-360, 465-471, 493-499, 598-604, 603-609 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 337-348 
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AGTCGACTGCGTCCCCTGTACCCGGCGCCAGCTGTGTTCCTGACCCCAGAATAACTCAGGGCTGCACCGGGCCTG 

GCAGCGCTCCGCACACATTTCCTGTCGCGGCCTAAGGGAAACTGTTGGCCGCTGGGCCCGCGGGGGGATTCTTGG 

CAGTTGGGGGGTCCGTCGGGAGCGAGGGGGGAGGGGAAGG6AGGGGGAACCGGGTTGGGGAAGCCAGCTGTAGAG 

GGCGGTGACCGCGCTCCAGACACAGCTCTGCGTCCTCGAGCGGGACAGATCCAAGTTGGGAGCAGCTCTGCGTGC 

GGGGCCTCAGAGAATGAGGCCGGCGTTCGCCCTGTGCCTCCTCTGGCAGGCGCTCTGGCCCGGGCCGGGCGGCGG 

CGAACACCCCACTGCCGACCGTGCTGGCTGCTCGGCCTCGGGGGCCTGCTACAGCCTGCACCACGCTACCATGAA 

GCGGCAGGCGGCCGAGGAGGCCTGCATCCTGCGAGGTGGGGCGCTCAGCACCGTGCGTGCGGGCGCCGAGCTGCG 

CGCTGTGCTCGCGCTCCTGCGGGCAGGCCCAGGGCCCGGAGGGGGCTCCAAAGACCTGCTGTTCTGGGTCGCACT 

GGAGCGCAGGCGTTCCCACTGCACCCTGGAGAACGAGCCTTTGCGGGGTTTCTCCTGGCTGTCCTCCGACCCCGG 

CGGTCTCGAAAGCGACACGCTGCAGTGGGTGGAGGAGCCCCAACGCTCCTGCACCGCGCGGAGATGCGCGGTACT 

CCAGGCCACCGGTGGGGTCGAGCCCGCAGGCTGGAAGGAGATGCGATGCCACCTGCGCGCCAACGGCTACCTGTG 

CAAGTACCAGTTTGAGGTCTTGTGTCCTGCGCCGCGCCCCGGGGCCGCCTCTAACTTGAGCTATCGCGCGCCCTT 

CCAGCTGCACAGCGCCGCTCTGGACTTCAGTCCACCTGGGACCGAGGT6AGTGCGCTCTGCCGGGGACAGCTCCC 

GATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCTGGGACAAACTCTCGGGCGATGTGTTGTGTCCCTG 

CCCCGGGAGGTACCTCCGTGCTGGCAAATGCGCAGAGCTCCCTAACTGCCTAGACGACTTGGGAGGCTTTGCCTG 

CGAATGTGCTACGGGCTTCGAGCTGGGGAAGGACGGCCGCTCTTGTGTGACCAGTGGGGAAGGACAGCCGACCCT 

TGGGGGGACCGGGGTGCCCACCAGGCGCCCGCCGGCCACTGCAACCAGCCCCGTGCCGCAGAGAACATGGCCAAT 

CAGGGTCGACGAGAAGCTGGGA6AGACACCACTTGTCCCTGAACAAGACAATTCAGTAACATCTATTCCTGAGAT 

TCCTCGATGGGGATCACAGAGCACGATGTCTACCCTTCAAATGTCCCTTCAAGCCGAGTCAAAGGCCACTATCAC 

CCCATCAGGGAGCGTGATTTCCAAGTTTAATTCTACGACTTCCTCTGCCACTCCTCAGGCTTTCGACTCCTCCTC 

TGCCGTGGTCTTCATATTTGTGAGCACAGCAGTAGTAGTGTTGGTGATCTTGACCATGACAGTACTGGGGCTTGT 

CAAGCTCTGCTTTCACGAAAGCCCCTCTTCCCAGCCAAGGAAGGAGTCTATGGGCCCGCCGGGCCTGGAGAGTGA 

TCCTGAGCCCGCTGCTTTGGGCTCCAGTTCTGCACATTGCACAAACAATGGG6TGAAAGTCGGGGACTGTGATCT 

GCGGGACAGAGCAGAGGGTGCCTTGCTGGCGGAGTCCCCTCTTGGCTCTAGTGATGCAaaSGGAAACAGGGGACA 

TGGGCACrCCTGTGAACAGTTTTTCACTTTTGATGAAACGGGGAACCAAGAGGAACTTACTTGTGTAACTGACAA- 

TTTCTGCAGAAATCCCCCTTCCTCTAAATTCCCTTTACTCCACTGAGGAGCTAAATCAGAACTGCACACTCCTTC 

CCTGATGATAGAGGAAGTGGAAGTGCCTTTAGGATGGTGATACTGGGGGACCGGGTAGTGCTGGGGAGAGATATT 

TTCTTATGTTTATTCGGAGAATTTGGAGAAGTGATTGAACTTTTCAAGACATTGGAAACAAATAGAACACAATAT 

AATTTACATTAAAAAATAATTTCTACCAAAATGGAAAGGAAATGTTCTATGTTGTTCAGGCTAGGAGTATATTGG 

TTCGAAATCCCAGGGAAAAAAATAAAAATAAAAAATTAAAGGATTGTTGAT 
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FIGURE 356 

MRPAFALCLLWQALWPGPGGGEHPTADRAGCSASGACYSLHHATMKRQAAEEACILRGGALST 
VRAGAELRAVLALLRAGPGPGGGSKDLLFWVALERRRSHGTLENEPLRGFSWLSSDPGGLESD 
TLQWVEEPQRSCTARRCAVLQATGGVEPAGWKEMRCHLRANGYLCKYQFEVLCPAPRPGAASN 
LSYRAPFQLHSAALDFSPPGTEVSALCRGQLPISVTCIADEIGARWDKLSGDVLCPCPGRYLR 
AGKCAELPNCLDDLGGFACECATGFELGKDGRSCVTSGEGQPTLGGTGVPTRRPPATATSPVP 
QRTWPIRVDEKLGETPLVPEQDNSVTSIPEIPRWGSQSTMSTLQMSLQAESKATITPSGSVIS 
KFNSTTSSATPQAFDSSSAWFIFVSTAVVVLVILTMTVLGLVKLCFHESPSSQPRKESMGPP 
GLESDPEPAALGSSSAHCTNNGVKVGDCDLRDRAEGALLAESPLGSSDA 

Important features: 

Signal sequence: 

amino acids 1-16 . 

Transmembreine domain: 
amino acids 399-418 

N-glycosylation site. 

amino acids 189-193, 381-385 

Glycosaminoglycan attachment site. 

amino acids 289-293 

cAMP- and cGMP-dependent protein kinase phosphorylation site . 

amino acids 98-102, 434-438 

Casein kinase II phosphorylation site. 

amino acids 275-279, 288-292, 342-346, 445-449 

N-myristoylation site. 

amino acids 30-36, 35-41, 58-64, 59-65, 121-127, 151-157, 
185-191, 209-215, 267-273, 350-356, 374-380, 453-459, 463-469, 
477-483 

Aspartic acid and asparagine hydroxy lation site. 

amino acids 262-274 
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FIGURE 357 

CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCTGCTCTCTTCAACCAGACCTCTACATTCCATTTTGGAAGA 

AGACTAAAAATGGTGTTTCCAATGTGGACACTGAAGAGACAAATTCTTATCCTTTTTAACATAATCCTAATTTCC 

AAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTGCCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTG 

ATCGTGGACTGCACAGACAAGCATTTGACAGAAATTGOTGGAGGTATTCCCACGAACACCACGAACCTCACCCTC 

ACCATTAACCACATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACO^TCTGGTAGAGM^ 

TGCAACTGTGTACCTATTCCACTGGGGTCAAAAAACAACATGTGOVTCAAGAGGCTGCAGATTAAACCCAG^^ 

TTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACCAGCTAcS^^ 

CCTAGCTTACAGCTTCTCAGCCTTGAGGCCAACAACATCTTTTCCATCAGAAAAGAGAATCT^^ 

AACATAGAAATACTCTACCTGGGCCAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAA 

^^^^^^''^'^^^''^^'^'■'^'^^ctctccctgaaagataacaatgtS^^^^ 

ctcaaccaattacaaattcttgacctaagtggaaattgccctcgttgttataatgccccatttccttgtgcgccg 
^^^^^^^'^^^^'^^^^^^^^^^'^ccct'St^tgcttttgatgcgctgacagaattaaaagttttacgtcta 
^^^^^'^^^^'^'^^^^^'^^^^'^^^'^^^^c^tggtttaagaacatcaacaaactccaggaactggatctgtcc 
caaaacttcttggccaaagaaattggggatgctaaatttctgcattttctccccagcctcatccaattggatctg 

TCTTTCAATTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATCACAAGCATTTTCTTCACTGAAAAGCCTr 

aaaattctgcggatcagaggatatgtctttaaagagttgaaaagctttaacctctcgccattacataatcttcaa 

ctgaaagtcatagatctttcagtgaataaaatatcaccttcaggagattcaagtgaagttggcttctgctcaaat 
»^^™?'^^^'^'^^°^^^^^'^*^^^^<=^a^^tcctggaacaattacattatttcagatatgataagtatg 

^^™^2^^5^^^^^^^^^'^^'^°^^'^CT'rTCATGTCTGTTAATGAAAGCTGCTACA^^^ 

TTGGATCTAAGTAAAAATAGTATATTTTTTGTCAAGTCCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTC: 

™!S3o^^*^^'^^^"^<^^^'^^'fC'f'r^TGGCAGTGAATTCCAACCTTTAGCAG^ 

™^^^S™^^^^'^'^^"^^™^tcc^ttcaacagcatttgaagagcttcacaaactggaagttctgg^^ 
^^™^*^^^^^'^*^^^^^^^^A'^ct<^ttcctccaccagcaggaccatggagagtgagtctcttaga 

n^o^™r^°^^^'^^^^^'^^^'^'^°T'^''^TATGGAGAGAAGGTGATAACAGATACTTACAATTA 

aatctgctaaaattagaggaattagacatctctaaaaattccctaagtttcttgccttctggagtttttgatggt 

ATGCCTCCAAATCTAAAGAATCTCTCTTTGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCCAGTGT 

Ar^^^n«^^^'^^^^^**=^'^^^^^'^^^^^<=^CT'2^CCACTGTCCCTGAGAGATTATCCAACTGTTCC 

AGAAGCCTCAAGAATCTGATTCTTAAGAATAATCAAATCAGGAGTCTGACGAAGTATTTTCTACAAGATGCCTTC 

^noHS^^o^^^^^'^'^^'^^'^^^'^^^^^^^TCCAGATGATCCAAAAGACCAGCTTCCCAGAAAATGTCC^ 

^S^^^I'^^^^^^'^'^^^^^^^'^^^^^^'^^T'^^'^'^TTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGG 

GTTAACCATACGGAGGTGACTATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCACACAAGGGC • 

caaagtgtgatctccctggatctgtacacctgtgagttagatctgactaacctgattctg^Sct^ 

TCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCACCTCTATTTCTGGGATGTGTGGTATATTTACCAT 
TTCTGTAAGGCCAAGATAAAGGGGTATCAGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTkTTGTCT^^ 
GACACTAAAGACCCAGCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAA^ 

cattttaatttatgtctcgaggaaagggactggttaccagggcagccagttctggaaS^ 

CAGCTTAGCAAAAAGACAGTGTTTGTGATGACAGACAAGTATGCAAAGACTGAAAATTTTAAGATAGCATTTTAC 

ttgtcccatcagaggctovtggatgaaaaagttgatgtgattatcttgatatttcttgagaagccctttcagaag 

TCCAAGTTCCTCCAGCTCCGGAAAAGGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACAAACCCGCAAGCTCAC 

ccatacttctggcagtgtctaaagaacgccctggccacagacaatcatgtggcctatagtcaggtgttcaaggaa 

ACGGTCSaSCCCTTCTTTGCAAAACACAACTGCCTAGTTTACCAAGGAGAGGCCTGGC 
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FIGURE 358 



mvfpmwtlkrqililfniiliskllgarwfpktlpcdvtldvpknhvivdctdkhlteipggi 
ptnttnltltinhipdispasfhrldhlveidfrcncvpiplgsknnmcikrlqikprsfsgl 
tylkslyldgnqlleipqglppslqllsleannifsirkenltelanieilylgqncyyrnpc 
yvsysiekdaflnltklkvlslkdnnvtavptvlpstltelylynnmiakiqeddfnnlnqlq 
ildlsgncprcynapfpcapcknnsplqipvnafdaltelkvlrlhsnslqhvpprwfknink 
lqeldlsqnflakeigdakflhflpsliqldlsfnfelqvyrasmnlsqafsslkslkilrir 
gyvfkelksfnlsplhnlqnlevldlgtnfikianlsmfkqfkrlkvidlsvnkispsgdsse 
vgfcsnartsvesyepqvleqlhyfrydkyarscrfknkeasfmsvnescykygqtldlskns 
iffvkssdfqhlsflkclnlsgnlisqtlngsefqplaelryldfsnnrldllhstafeelhk 
levldissnshyfqsegithmlnftknlkvlqklmmndndissstsrtmeseslrtlefrgnh 
ldvlwregdnrylqlfknllkleeldisknslsflpsgvfdgmppnlknlslaknglksfswk 
klqclknletldlshnqlttvperlsncsrslknlIlknnqirsltkyflqdafqlryldlss 
nkiqmiqktsfpenvlnnlkmlllhhnrflctcdavwfvwwvnhtevtipylatdvtcvgpga 
hkgqsvisldlytceldltnlilfslsisvslflmvmmtashlyfwdvwyiyhfckakikgyq 
rlispdccydafivydtkdpavtewvlaelvakledprekhfnlcleerdwlpgqpvlenlsq 
siqlskktvfvmtdkyaktenfkiafylshqrlmdekvdviiliflekpfqkskflqlrkrlc 
gssvlewptnpqahpyfwqclknalatdnhvaysqvfketv 

Important features: 
Signal sequence: 

amino acids 1-26 

Transmembrane domain: 

amino acids 840-860 
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FIGURE 359 

GACGGCTGGCCACCATGCACGGCTCCTGCAGTTTCCTGATGCTTCTGCTGCCGCTACTGCTAC 

TGCTGGTGGCCACCACAGGCCCCGTTGGAGCCCTCACAGATGAGGAGAAACGTTTGATGGTGG 

AGCTGCACAACCTCTACCGGGCCCAGGTATCCCCGACGGCCTCAGACATGCTGCACATGAGAT 

GGGACGAGGAGCTGGCCGCCTTCGCCAAGGCCTACGCACGGCAGTGCGTGTGGGGCCACAACA 

AGGAGCGCGGGCGCCGCGGCGAGAATCTGTTCGCCATCACAGACGAGGGCATGGACGTGCCGC 

TGGCCATGGAGGAGTGGCACCACGAGCGTGAGCACTACAACCTCAGCGCCGCCACCTGCAGCC 

CAGGCCAGATGTGCGGCCACTACACGCAGGTGGTATGGGCCAAGACAGAGAGGATCGGCTGTG 

GTTCCCACTTCTGTGAGAAGCTCCAGGGTGTTGAGGAGACCAACATCGAATTACTGGTGTGCA 

ACTATGAGCCTCCGGGGAACGTGAAGGGGAAACGGCCCTACCAGGAGGGGACTCCGTGCTCCC 

AATGTCCCTCTGGCTACCACTGCAAGAACTCCCTCTGTGAACCCATCGGAAGCCCGGAAGATG 

CTCAGGATTTGCCTTACCTGGTAACTGAGGCCCCATCCTTCCGGGCGACTGAAGCATCAGACT 

CTAGGAAAATGGGTACTCCTTCTTCCCTAGCAACGGGGATTCCGGCTTTCTTGGTAACAGAGG 

TCTCAGGCTCCCTGGCAACCAAGGCTCTGCCTGCTGTGGAAACCCAGGCCCCAACTTCCTTAG 

CAACGAAAGACCCGCCCTCCATGGCAACAGAGGCTCCACCTTGCGTAACAACTGAGGTCCCTT 

CCATTTTGGCAGCTCACAGCCTGCCCTCCTTGGATGAGGAGCCAGTTACCTTCCCCAAATCGA 

CCCATGTTCCTATCCCAAAATCAGCAGACAAAGTGACAGACAAAACAAAAGTGCCCTCTAGGA 

GCCCAGAGAACTCTCTGGACCCCAAGATGTCCCTGACAGGGGCAAGGGAACTCCTACCCCATG 

CCCAGGAGGAGGCTGAGGCTGAGGCTGAGTTGCCTCCTTCCAGTGAGGTCTTGGCCTCAGTTT 

TTCCAGCCCAGGACAAGCCAGGTGAGCTGCAGGCCACACTGGACCACACGGGGCACACCTCCT 

CCAAGTCCCTGCCCAATTTCCCCAATACCTCTGCCACCGCTAATGCCACGGGTGGGCGTGCCC 

TGGCTCTGCAGTCGTCCTTGCCAGGTGCAGAGGGCCCTGACAAGCCTAGCGTTGTGTCAGGGC 

TGAACTCGGGCCCTGGTCATGTGTGGGGCCCTCTCCTGGGACTACTGCTCCTGCCTCCTCTGG 

TGTTGGCTGGAATCTTCTGAATGGGATACCACTCAAAGGGTGAAGAGGTCAGCTGTCCTCCTG 

TCATCTTCCCCACCCTGTCCCCAGCCCCTAAACAAGATACTTCTTGGTTAAGGCCCTCCGGAA 

GGGAAAGGCTACGGGGCATGTGCCTCATCACACCATCCATCCTGGAGGCACAAGGCCTGGCTG 

GCTGCGAGCTCAGGAGGCCGCCTGAGGACTGCACACCGGGCCCACACCTCTCCTGCCCCTCCC 

TCCTGAGTCCTGGGGGTGGGAGGATTTGAGGGAGCTCACTGCCTACCTGGCCTGGGGCTGTCT 

GCCCACACAGCATGTGCGCTCTCCCTGAGTGCCTGTGTAGCTGGGGATGGGGATTCCTAGGGG 

CAGATGAAGGACAAGCCCCACTGGAGTGGGGTTCTTTGAGTGGGGGAGGCAGGGACGAGGGAA 

GGAAAGTAACTCCTGACTCTCCAATAAAAACCTGTCCAACCTGTGAAA 
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FIGURE 360 

MHGSCSFLMLLLPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDMLHMRWDEEL 
AAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDVPLAMEEWHHEREHYNLSAATCSPGQMC 
GHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEGTPCSQCPSG 
YHCKNSLCEPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMGTPSSLATGIPAFLVTEVSGSL 
ATKALPAVETQAPTSLATKDPPSMATEAPPCVTTEVPSILAAHSLPSLDEEPVTFPKSTHVPI 
PKSADKVTDKTKVPSRSPENSLDPKMSLTGARELLPHAQEEAEAEAELPPSSEVLASVFPAQD 
KPGELQATLDHTGHTSSKSLPNFPNTSATANATGGRALALQSSLPGAEGPDKPSVVSGLNSGP 
GHVWGPLLGLLLLPPLVLAGIF 

Important features: 
Signal sequence: 

amino acids 1-22 

N-glycosylation site. 

amino acids 114-118, 403-407, 409-413 

Glycosaminoglycan attachment site. 

amino acids 439-443 

Casein kinase II phosphorylation site. 

amino acids 29-33, 50-54/ 156-160, 195-199, 202-206, 299-303 
N-uyristoylation site. 

amino acids 123-129, 143-149, 152-158, 169-175, 180-186, 231-237, 250-256 

iUnidation site. 

amino acids 82-86, 172-176 

Peroxidases proximal heme-ligand signature. 

amino acids 287-298 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 1. 

amino acids 127-138 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 2. 

amino acids 160-172 




# 
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FIGURE 361 



GACTAGTTCTCTTGGAGTCTGGGAGGAGGAAAGCGGAGCCGGCAGGGAGCGAACCAGGACTGG 
GGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGCGCGGGGGCTGGAGCACCACCAACT 
GGAGGGTCCGGAGTAGCGAGCGCCCCGAAGGAGGCCATCGGGGAGCCGGGAGGGGGGACTGCG 
AGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCTATGAGGCCACTCCTCGTCCTGCTGCT 
CCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGACGACAACAAGATCCCCAGCCTCTGCCCGGG 
GCACCCCGGCCTTCCAGGCACGCCGGGCCACCATGGCAGCCAGGGCTTGCCGGGCCGCGATGG 
CCGCGACGGCCGCGACGGCGCGCCCGGGGCTCCGGGAGAGAAAGGCGAGGGCGGGAGGCCGGG 
ACTGCCGGGACCTCGAGGGGACCCCGGGCCGCGAGGAGAGGCGGGACCCGCGGGGCCCACCGG 
GCCTGCCGGGGAGTGCTCGGTGCCTCCGCGATCCGCCTTCAGCGCCAAGCGCTCCGAGAGCCG 
GGTGCCTCCGCCGTCTGACGCACCCTTGCCCTTCGACCGCGTGCTGGTGAACGAGCAGGGACA 
TTACGACGCCGTCACCGGCAAGTTCACCTGCCAGGTGCCTGGGGTCTACTACTTCGCCGTCCA 
TGCCACCGTCTACCGGGCCAGCCTGCAGTTTGATCTGGTGAAGAATGGCGAATCCATTGCCTC 
TTTCTTCCAGTTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGGCCATGGTGAG 
GCTGGAGCCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTATGC 
CAGCATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCC 
AGTCTTTGCTSaSTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGAGGGTGTGA 
GGCTGACAACCAGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAGGGAGG 
TGGGGTAGAGCACTCTCCGTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTGTCTGCGATCA 
GGTCTGGCAGCATGGGGCAGTGGCTGGATTTCTGCCCAAGACCAGAGGAGTGTGCTGTGCTGG 
CAAGTGTAAGTCCCCCAGTTGCTCTGGTCCAGGAGCCCACGGTGGGGTGCTCTCTTCCTGGTC 
CTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCTGGGGCCGGCCCTTTTCTCAGAGAT 
CACTCAATAAACCTAAGAACCCTCATAAAAAAAAAAAAAAAAAAAAAAAAAA/^ 
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FIGURE 362 



MRPLLVLLLLGLAAGSPPLDDNKIPSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPGAPG 
EKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSDAPLPFD 
RVLVNEQGHYDAVTGKFTCQVPGVYYFAVHATVYRASLQFDLVKNGESIASFFQFFGGWPKPA 
SLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSPVFA 

Important features: 
Signal sequence. 

amino acids 1-15 

N-myristoylation sites. 

amino acids 11-17, 68-74, 216-222 

Cell attachment sequence. 

amino acids 77-80 
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FIGURE 363 

GGAGAGCGGAGCGAAGCTGGATAACAGGGGACCGATGATGTGGCGACCATCAGTTCTGCTGCT 

TCTGTTGCTACTGAGGCACGGGGCCCAGGGGAAGCCATCCCCAGACGCAGGCCCTCATGGCCA 

GGGGAGGGTGCACCAGGCGGCCCCCCTGAGCGACGCTCCCCATGATGACGCCCACGGGAACTT 

CCAGTACGACCATGAGGCTTTCCTGGGACGGGAAGTGGCCAAGGAATTCGACCAACTCACCCC 

AGAGGAAAGCCAGGCCCGTCTGGGGCGGATCGTGGACCGCATGGACCGCGCGGGGGACGGCGA 

CGGCTGGGTGTCGCTGGCCGAGCTTCGCGCGTGGATCGCGCACACGCAGCAGCGGCACATACG 

GGACTCGGTGAGCGCGGCCTGGGACACGTACGACACGGACCGCGACGGGCGTGTGGGTTGGGA 

GGAGCTGCGCAACGCCACCTATGGCCACTACGCGCCCGGTGAAGAATTTCATGACGTGGAGGA 

TGCAGAGACCTACAAAAAGATGCTGGCTCGGGACGAGCGGCGTTTCCGGGTGGCCGACCAGGA 

TGGGGACTCGATGGCCACTCGAGAGGAGCTGACAGCCTTCCTGCACCCCGAGGAGTTCCCTCA 

CATGCGGGACATCGTGATTGCTGAAACCCTGGAGGACCTGGACAGAAACAAAGATGGCTATGT 

CCAGGTGGAGGAGTACATCGCGGATCTGTACTCAGCCGAGCCTGGGGAGGAGGAGCCGGCGTG 

GGTGCAGACGGAGAGGCAGCAGTTCCGGGACTTCCGGGATCTGAACAAGGATGGGCACCTGGA 

TGGGAGTGAGGTGGGCCACTGGGTGCTGCCCCCTGCCCAGGACCAGCCCCTGGTGGAAGCCAA 

CCACCTGCTGCACGAGAGCGACACGGACAAGGATGGGCGGCTGAGCAAAGCGGAAATCCTGGG 

TAATTGGAACATGTTTGTGGGCAGTCAGGCCACCAACTATGGCGAGGACCTGACCCGGCACCA 

CGATGAGCTGTGAGCACCGCGCACCTGCCACAGCCTCAGAGGCCCGCACAATGACCGGAGGAG 

GGGCCGCTGTGGTCTGGCCCCCTCCCTGTCCAGGCCCCGCAGGAGGCAGATGCAGTCCCAGGC 

ATCCTCCTGCCCCTGGGCTCTCAGGGACCCCCTGGGTCGGCTTCTGTCCCTGTCACACCCCCA 

ACCCCAGGGAGGGGCTGTCATAGTCCCAGAGGATAAGCAATACCTATTTCTGACTGAGTCTCC 

CAGCCCAGACCCAGGGACCCTTGGCCCCAAGCTCAGCTCTAAGAACCGCCCCAACCCCTCCAG 

CTCCAAATCTGAGCCTCCACCACATAGACTGAAACTCCCCTGGCCCCAGCCCTCTCCTGCCTG 

GCCTGGCCTGGGACACCTCCTCTCTGCCAGGAGGCAATAAAAGCCAGCGCCGGGACCTTGAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 364 



MMWRPSVLLLLLLLRHGAQGKPSPDAGPHGQGRVHQAAPLSDAPHDDAHGNFQYDHEAFLGRE 
VAKEFDQLTPEESQARLGRIVDRMDRAGDGDGWVSLAELRAWIAHTQQRHIRDSVSAAWDTYD 
TDRDGRVGWEELRNATYGHYAPGEEFHDVEDAETYKKMLARDERRFRVADQDGDSMATREELT 
AFLHPEEFPHMRDIVIAETLEDLDRNKDGYVQVEEYIADLYSAEPGEEEPAWVQTERQQFRDF 
RDLNKDGHLDGSEVGHWVLPPAQDQPLVEANHLLHESDTDKDGRLSKAEILGNWNMFVGSQAT 
NYGEDLTRHHDEL 

In^ortant features: 
Signal sequence: 

amino acids 1-20 

N-glycosylation site. 

amino acids 140-144 . ' 

Casein kinase II phosphorylation site. 

amino acids 72-76, 98-102, 127-131, 184-188, 208-212, 289-293, 
291-295, 298-302 

N-myristoylation site. 

amino acids 263-269, 311-317 

Endoplasmic reticulum targeting sequence. 

amino acids 325-330 
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FIGURE 365 

GTCTGTTCCCAGGAGTCCTTCGGCGGCTGTTGTGTCAGTGGCCTGATCGCGATGGGGACAAAG 

GCGCAAGTCGAGAGGAAACTGTTGTGCCTCTTCATATTGGCGATCCTGTTGTGCTCCCTGGCA 

TTGGGCAGTGTTACAGTGCACTCTTCTGAACCTGAAGTCAGAATTCCTGAGAATAATCCTGTG 

AAGTTGTCCTGTGCCTACTCGGGCTTTTCTTCTCCCCGTGTGGAGTGGAAGTTTGACCAAGGA 

GACACCACCAGACTCGTTTGCTATAATAACAAGATCACAGCTTCCTATGAGGACCGGGTGACC 

TTCTTGCCAACTGGTATCACCTTCAAGTCCGTGACACGGGAAGACACTGGGACATACACTTGT 

ATGGTCTCTGAGGAAGGCGGCAACAGCTATGGGGAGGTCAAGGTCAAGCTCATCGTGCTTGTG 

CCTCCATCCAAGCCTACAGTTAACATCCCCTCCTCTGCCACCATTGGGAACCGGGCAGTGCTG 

ACATGCTCAGAACAAGATGGTTCCCCACCTTCTGAATACACCTGGTTCAAAGATGGGATAGTG 

ATGCCTACGAATCCCAAAAGCACCCGTGCCTTCAGCAACTCTTCCTATGTCCTGAATCCCACA 

ACAGGAGAGCTGGTCTTTGATCCCCTGTCAGCCTCTGATACTGGAGAATACAGCTGTGAGGCA 

CGGAATGGGTATGGGACACCCATGACTTCAAATGCTGTGCGCATGGAAGCTGTGGAGCGGAAT 

GTGGGGGTCATCGTGGCAGCCGTCCTTGTAACCCTGATTCTCCTGGGAATCTTGGTTTTTGGC 

ATCTGGTTTGCCTATAGCCGAGGCCACTTTGACAGAACAAAGAAAGGGACTTCGAGTAAGAAG " 

GTGATTTACAGCCAGCCTAGTGCCCGAAGTGAAGGAGAATTCAAACAGACCTCGTCATTCCTG 

GTGIS&GCCTGGTCGGCTCACCGCCTATCATCTGCATTTGCCTTACTCAGGTGCTACCGGACT 

CTGGCCCCTGATGTCTGTAGTTTCACAGGATGCCTTATTTGTCTTCTACACCCCACAGGGCCC 

CCTACTTCTTCGGATGTGTTTTTAATAATGTCAGCTATGTGCCCCATCCTCCTTCATGCCCTC 

CCTCCCTTTCCTACCACTGCTGAGTGGCCTGGAACTTGTTTAAAGTGTTTATTCCCCATTTCT 

TTGAGGGATCAGGAAGGAATCCTGGGTATGCCATTGACTTCCCTTCTAAGTAGACAGCAAAAA 

TGGCGGGGGTCGCAGGAATCTGCACTCAACTGCCCACCTGGCTGGCAGGGATCTTTGAATAGG 

TATCTTGAGCTTGGTTCTGGGCTCTTTCCTTGTGTACTGACGACCAGGGCCAGCTGTTCTAGA 

GCGGGAATTAGAGGCTAGAGCGGCTGAAATGGTTGTTTGGTGATGACACTGGGGTCCTTCCAT 

CTCTGGGGCCCACTCTCTTCTGTCTTCCCATGGGAAGTGCCACTGGGATCCCTCTGCCCTGTC 

CTCCTGAATACAAGCTGACTGACATTGACTGTGTCTGTGGAAAATGGGAGCTCTTGTTGTGGA 

GAGCATAGTAAATTTTCAGAGAACTTGAAGCCAAAAGGATTTAAAACCGCTGCTCTAAAGAAA 

AGAAAACTGGAGGCTGGGCGCAGTGGCTCACGCCTGTAATCCCAGAGGCTGAGGCAGGCGGAT 

CACCTGAGGTCGGGAGTTCGGGATCAGCCTGACCAACATGGAGAAACCCTACTGGAAATACAA 

AGTTAGCCAGGCATGGTGGTGCATGCCTGTAGTCCCAGCTGCTCAGGAGCCTGGCAACAAGAG 
CAAAACTCCAGCTCAAAAAAAAAAAAAAAA 
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FIGURE 366 

MGTKAQVERKLLCLFILAILLCSLALGSVTVHSSEPEVRIPENNPVKLSCAYSGFSSPRVEWK 
FDQGDTTRLVCYNNKITASYEDRVTFLPTGITFKSVTREDTGTYTCMVSEEGGNSYGEVKVKL 
IVLVPPSKPTVNIPSSATIGNRAVLTCSEQDGSPPSEYTWFKDGIVMPTNPKSTRAFSNSSYV 
LNPTTGELVFDPLSASDTGEYSCEARNGYGTPMTSNAVRMEAVERNVGVIVAAVLVTLILLGI 
LVFGIWFAYSRGHFDRTKKGTSSKKVIYSQPSARSEGEFKQTSSFLV 

Important features : 
Signal sequence: 

amino acids 1-27 

Transmembrane domain: 

amino acids 238-255 

N-glycosylation site. 

amino acids 185-189 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 270-274 

Casein kinas;e IZ phosphorylation site. 

amino acids 34-38, 82-86, 100-104, 118-122, 152-156, 154-158, 
193-197, 203-207, 287-291 



N-ioyristoylation site. 

amino acids' 105-111, 116-122, 158-164, 219-225, 237-243, 256-262 
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FIGURE 367 



GGGGAGAGGAATTGACCATGTAAAAGGAGACTTTTTTTTTTGGTGGTGGTGGCTGTTGGGTGCCTTGCAAAAATG 
AAGGATGCAGGACGCAGCTTTCTCCTGGAACCGAACGCAATGGATAAACTGATTGTGCAAGAGAGAAGGAAGAAC 
GAAGCTTTTTCTTGTGAGCCCTGGATCTTAACACAAATGTGTATATGTGCACACAGGGAGCATTCAAGAATGAAA 
TAAACCAGAGTTAGACCCGCGGGGGTTGGTGTGTTCTGACATAAATAAATAATCTTAAAGCAGCTGTTCCCCTCC 
CCACCCCCAAAAAAAAGGATGATTGGAAATGAAGAACCGAGGATTCACAAAGAAAAAAGTATGTTCATTTTTCTC 
TATAAAGGAGAAAGTGAGCCAAGGAGATATTTTTGGAATGAAAAGTTTGGGGCTTTTTTAGTAAAGTAAAGAACT 
GGTGTGGTGGTGTTTTCCTTTCTTTTTGAATTTCCCACAAGAGGAGAGGAAATTAATAATACATCTGCAAAGAAA 
TTTCAGAGAAGAAAAGTTGACCGCGGCAGATTGAGGCATTGATTGGGGGAGAGAAACCAGCAGAGCACAGTTGGA 
TTTGTGCCTATGTTGACTAAAATTGACGGATAATTGCAGTTGGATTTTTCTTCATCAACCTCCTTTTTTTTAAAT 
TTTTATTCCTTTTGGTATCAAGATCATGCGTTTTCTCTTGTTCTTAACCACCTGGATTTCCATCTGGATGTTGCT 
GTGATCAGTCTGAAATACAACTGTTTGAATTCCAGAAGGACCAACACCAGATAAATTATGAMGTTGAACAAGAT 
GACCTTACATCCACAGCAGATAATGATAGGTCCTAGGTTTAACAGGGCCCTATTTGACCCCCTGCTTGTGGTGCT 
GCTGGCTCTTCAACTTCTTGTGGTGGCTGGTCTGGTGCGGGCTCAGACCTGCCCTTCTGTGTGCTCCTGCAGCAA 
CCAGTTCAGCAAGGTGATTTGTGTTCGGAAAAACCTGCGTGAGGTTCCGGATGGCATCTCCACCAACACACGGCT 
GCTGAACCTCCATGAGAACCAAATCCAGATCATCAAAGTGAACAGCTTCAAGCACTTGAGGCACTTGGAAATCCT 
ACAGTTGAGTAGGAACCATATCAGAACCATTGAAATTGGGGCTTTCAATGGTCTGGCGAACCTCAACACTCTGGA 
ACTCTTTGACAATCGTCTTACTACCATCCCGAATGGAGCTTTTGTATACTTGTCTAAACTGAAGGAGCTCTGGTT 
GCGAAACAACCCCATTGAAAGCATCCCTTCTTATGCTTTTAACAGAATTCCTTCTTTGCGCCGACTAGACTTAGG 
GGAATTGAAAAGACTTTCATACATCTCAGAAGGTGCCTTTGAAGGTCTGTCCAACTTGAGGTATTTGAACCTTGC 
CATGTGCAACCTTCGGGAAATCCCTAACCTCACACCGCTCATAAAACTAGATGAGCTGGATCTTTCTGGGAATCA 
TTTATCTGCCATCAGGCCTGGCTCTTTCCAGGGTTTGATGCACCTTCAAAAACTGTGGATGATACAGTCCCAGAT 
TCAAGTGATTGAACGGAATGCCTTTGACAACCTTCAGTCACTAGTGGAGATCAACCTGGCACACAATAATCTAAC 
ATTACTGCCTCATGACCTCTTCACTCCCTTGCATCATCTAGAGCGGATACATTTACATCACAACCCTTGGAACTG 
TAACTGTGACATACTGTGGCTCAGCTGGTGGATAAAAGACATGGCCCCCTCGAACACAGCTTGTTGTGCCCGGTG 
TAACACTCCTCCCAATCTAAAGGGGAGGTACATTGGAGAGCTCGACCAGAATTACTTCACATGCTATGCTCCGGT 
GATTGTGGAGCCCCCTGCAGACCTCAATGTCACTGAAGGCATGGCAGCTGAGCTGAAATGTCGGGCCTCCACATC 
CCTGACATCTGTATCTTGGAT-TACTCCAAATGGAACAGTCATGACACATGGGGCGTACAAAGTGCGGATAGCTGT 
GCTCAGTGATGGTACGTTAAATTTCACAAATGTAACTGTGCAAGATACAGGCATGTACACATGTATGGTGAGTAA 
TTCCGTTGGGAATACTACTGCTTCAGCCACCCTGAATGTTACTGCAGCAACCACTACTCCTTTCTCTTACTTTTC 
AACCGTCACAGTAGAGACTATGGAACCGTCTCAGGATGAGGCACGGACCACAGATAACAATGTGGGTCCCACTCC 
AGTGGTCGACTGGGAGACCACCAATGTGACCACCTCTCTCACACCACAGAGCACAAGGTCGACAGAGAAAACCTT 
CACCATCCCAGTGACTGATATAAACAGTGGGATCCCAGGAATTGATGAGGTCATGAAGACTACCAAAATCATCAT 
TGGGTGTTTTGTGGCCATCACACTCATGGCTGCAGTGATGCTGGTCATTTTCTACAAGATGAGGAAGCAGCACCA 
TCGGCAAAACCATCACGCCCCAACAAGGACTGTTGAAATTATTAATGTGGATGATGAGATTACGGGAGACACACC 
CATGGAAAGCCACCTGCCCATGCCTGCTATCGAGCATGAGCACCTAAATCACTATAACTCATACAAATCTCCCTT 
CAACCACACAACAACAGTTAACACAATAAATTCAATACACAGTTCAGTGCATGAACCGTTATTGATCCGAATGAA 
CTCTAAAGACAATGTACAAGAGACTCAAATCTAAAACATTTACAGAGTTACAAAAAACAAACAATCAAAAAAAAA 
GACAGTTTATTAAAAATGACACAAATGACTGGGCTAAATCTACTGTTTCAAAAAAGTGTCTTTACAAAAAAACAA 
AAAAGAAAAGAAATTTATTTATTAAAAATTCTATTGTGATCTAAAGCAGACAAAAA 





wo 01/40466 



PCT/USOO/32678 



368/550 



FIGURE 368 



MLNKMTLHPQQIMIGPRFNRALFDPLLWLLALQLLVVAGLVRAQTCPSVCSCSNQFSKVICVRKNLREVPDGIS 
TNTRLLNLHENQIQIIKVNSFKHLRHLEILQLSRNHIRTIEIGAFNGLANLNTLELFDNRLTTIPNGAFVYLSKL 
KELWLRNNPIESIPSYAFNRIPSLRRLDLGELKRLSYISEGAFEGLSNLRYLNLAMCNLREIPNLTPLIKLDELD 
LSGNHLSAIRPGSFQGLMHLQKLWMIQSQIQVIERNAFDNLQSLVEINLAHNNLTLLPHDLFTPLHHLERIHLHH 
NPWNCNCDILWLSWWIKDMAPSNTACCARCNTPPNLKGRYIGELDQNYFTCYAPVIVEPPADLNVTEGMAAELKC 
RASTSLTSVSWITPNGTVMTHGAYKVRIAVLSDGTLNFTNVTVQDTGMYTCMVSNSVGNTTASATLNVTAATTTP 
FSYFSTVTVETMEPSQDEARTTDNNVGPTPVVDWETTNVTTSLTPQSTRSTEKTFTIPVTDINSGIPGIDEVMKT 
TKIIIGCFVAITLMAAVMLVIFYKMRKQHHRQNHHAPTRTVEIINVDDEITGDTPMESHLPMPAIEHEHLNHYNS 
YKS PFNHTTTVNT INS I HSSVHEPLLI RMNSKDNVQETQI 

Important features: 
Signal sequence: 
amino acids 1-44 

Transmenibrane doaaxn: 

amino acids 523-543 

N-glycosylation site. 

amino acids 278-282, 364-368, 390-394, 412-416, 415-419, 434-438, 442-446, 
488-492, 606-610 

cAMP- and cOlP-dependent protein kinase phosphorylation site. 

amino acids 183-187 

Casein kinase II phosphorylation site. 

amino acids 268-272, 417-421, 465-469, 579-583, 620-624 
M-myristoylation site. 

amino acids . 40-4 6, 73-79, 118-124, 191-197, 228-234, 237-243, 391-397, 
422-428, 433-439, 531-537 
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FIGURE 369 

CAAAACTTGCGTCGCGGAGAGCGCCCAGCTTGACTTGAATGGAAGGAGCCCGAGCCCGCGGAGCGCAGCTGAGAC 

TGGGGGAGCGCGTTCGGCCTGTGGGGCGCCGCTCGGCGCCGGGGCGCAGCAGGGAAGGGGAAGCTGTGGTCTGCC 

CTGCTCCACGAGGCGCCACTGGTGTGAACCGGGAGAGCCCCTGGGTGGTCCCGTCCCCTATCCCTCCTTTATATA 

GAAACCTTCCACACTGGGAAGGCAGCGGCGAGGCAGGAGGGCTCATGGTGAGCAAGGAGGCCGGCTGATCTGCAG 

GCGCACAGCATTCCGAGTTTACAGATTTTTACAGATACCAAMgGAAGGCGAGGAGGCAGAACAGCCTGCCTGGT 

TCCATCAGCCCTGGCGCCCAGGCGCATCTGACTCGGCACCCCCTGCAGGCACCATGGCCCAGAGCCGGGTGCTGC 

TGCTCCTGCTGCTGCTGCCGCCACAGCTGCACCTGGGACCTGTGCTTGCCGTGAGGGCCCCAGGATTTGGCCGAA 

GTGGCGGCCACAGCCTGAGCCCCGAAGAGAACGAATTTGCGGAGGAGGAGCCGGTGCTGGTACTGAGCCCTGAGG 

AGCCCGGGCCTGGCCCAGCCGCGGTCAGCTGCCCCCGAGACTGTGCCTGTTCCCAGGAGGGCGTCGTGGACTGTG 

GCGGTATTGACCTGCGTGAGTTCCCGGGGGACCTGCCTGAGCACACCAACCACCTATCTCTGCAGAACAACCAGC 

TGGAAAAGATCTACCCTGAGGAGCTCTCCCGGCTGCACCGGCTGGAGACACTGAACCTGCAAAACAACCGCCTGA 

CTTCCCGAGGGCTCCCAGAGAAGGCGTTTGAGCATCTGACCAACCTCAATTACCTGTACTTGGCCAATAACAAGC 

TGACCTTGGCACCCCGCTTCCTGCCAAACGCCCTGATCAGTGTGGACTTTGCTGCCAACTATCTCACCAAGATCT 

ATGGGCTCACCTTTGGCCAGAAGCCAAACTTGAGGTCTGTGTACCTGCACAACAACAAGCTGGCAGACGCCGGGC 

TGCCGGACAACATGTTCAACGGCTCCAGCAACGTCGAGGTCCTCATCCTGTCCAGCAACTTCCTGCGCCACGTGC 

CCAAGCACCTGCCGCCTGCCCTGTACAAGCTGCACCTCAAGAACAACAAGCTGGAGAAGATCCCCCCGGGGGCCT 

TCAGCGAGCTGAGCAGCCTGCGCGAGCTATACCTGCAGAACAACTACCTGACTGACGAGGGCCTGGACAACGAGA 

CCTTCTGGAAGCTCTCCAGCCTGGAGTACCTGGATCTGTCCAGCAACAACCTGTCTCGGGTCCCAGCTGGGCTGC 

CGCGCAGCCTGGTGCTGCTGCACTTGGAGAAGAACGCCATCCGGAGCGTGGACGCGAATGTGCTGACCCCCATCC 

GCAGCCTGGAGTACCTGCTGCTGCACAGCAACCAGCTGCGGGAGCAGGGCATCCACCCACTGGCCTTCCAGGGCC 

TCAAGCGGTTGCACACGGTGCACCTGTACAACAACGCGCTGGAGCGCGTGCCCAGTGGCCTGCCTCGCCGCGTGC 

GCACCCTCATGATCCTGCACAACCAGATCACAGGCATTGGCCGCGAAGACTTTGCCACCACCTACTTCCTGGAGG 

AGCTCAACCTCAGCTACAACCGCATCACCAGCCCACAGGTGCACCGCGACGCCTTCCGCAAGCTGCGCCTGCTGC 

GCTCGCTGGACCTGTCGGGCAACCGGCTGCACACGCTGCCACCTGGGCTGCCTCGAAATGTCCATGTGCTGAAGG 

TCAAGCGCAATGAGCTGGCTGCCTTGGCACGAGGGGCGCTGGCGGGCATGGCTCAGCTGCGTGAGCTGTACCTCA 

CCAGCAACCGACTGCGCAGCCGAGCCCTGGGCCCCCGTGCCTGGGTGGACCTCGCCCATCTGCAGCTGCTGGACA 

TCGCCGGGAATCAGCTCACAGAGATCCCCGAGGGGCTCCCCGAGTCACTTGAGTACCTGTACCTGCAGAACAACA- 

AGATTAGTGCGGTGCCCGCCAATGCCTTCGACTCCACGCCCAACCTCAAGGGGATCTTTCTCAGGTTTAACAAGC 

TGGCTGTGGGCTCCGTGGTGGACAGTGCCTTCCGGAGGCTGAAGCACCTGCAGGTCTTGGACATTGAAGGCAACT 

TAGAGTTTGGTGACATTTCCAAGGACCGTGGCCGCTTGGGGAAGGAAAAGGAGGAGGAGGAAGAGGAGGAGGAGG 

AGGAAGAGGAAACAAGATAGTGACAAGGTGATGCAGATGTGACCTAGGATGATGGACCGCCGGACTCTTTTCTGC 

AGCACACGCCTGTGTGCTGTGAGCCCCCCACTCTGCCGTGCTCACACAGACACACCCAGCTGCACACATGAGGCA 

TCCCACATGACACGGGCTGACACAGTCTCATATCCCCACCCCTTCCCACGGCGTGTCCCACGGCCAGACACATGC 

ACACACATCACACCCTCAAACACCCAGCTCAGCCACACACAACTACCCTCCAAACCACCACAGTCTCTGTCACAC 

CCCCACTACCGCTGCCACGCCCTCTGAATCATGCAGGGAAGGGTCTGCCCCTGCCCTGGCACACACAGGCACCCA 

TTCCCTCCCCCTGCTGACATGTGTATGCGTATGCATACACACCACACACACACACATGCACAAGTCATGTGCGAA 

CAGCCCTCCAAAGCCTATGCCACAGACAGCTCTTGCCCCAGCCAGAATCAGCCATAGCAGCTCGCCGTCTGCCCT 

GTCCATCTGTCCGTCCGTTCCCTGGAGAAGACACS^GGGTATCCATGCTCTGTGGCCAGGTGCCTGCCACCCTCT 

GGAACTCACAAAAGCTGGCTTTTATTCCTTTCCCATCCTATGGGGACAGGAGCCTTCAGGACTGCTGGCCTGGCC 

TGGCCCACCCTGCTCCTCCAGGTGCTGGGCAGTCACTCTGCTAAGAGTCCCTCCCTGCCACGCCCTGGCAGGACA 

CAGGCACTTTTCCAATGGGCAAGCCCAGTGGAGGCAGGATGGGAGAGCCCCCTGGGTGCTGCTGGGGCCTTGGGG 

CAGGAGTGAAGCAGAGGTGATGGGGCTGGGCTGAGCCAGGGAGGAAGGACCCAGCTGCACCTAGGAGACACCTTT 

GTTCTTCAGGCCTGTGGGGGAAGTTCCGGGTGCCTTTATTTTTTATTCTTTTCTAAGGAAAAAAATGATAAAAAT 

CTCAAAGCTGATTTTTCTTGTTATAGAAAAACTAATATAAAAGCATTATCCCTATCCCTGCAAAAAAAAAA 
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FIGURE 370 

MEGEEAEQPAWFHQPWRPGASDSAPPAGTMAQSRVLLLLLLLPPQLHLGPVLAVRAPGFGRSG 

GHSLSPEENEFAEEEPVLVLSPEEPGPGPAAVSCPRDCACSQEGWDCGGIDLREFPGDLPEH 

TNHLSLQNNQLEKIYPEELSRLHRLETLNLQNNRLTSRGLPEKAFEHLTNLNYLYLANNKLTL 

APRFLPNAtlSVDFAANYLTKIYGLTFGQKPNLRSVYLHNNKLADAGLPDNMFNGSSNVEVLI 

LSSNFLRHVPKHLPPALYKLHLKNNKLEKIPPGAFSELSSLRELYLQNNYLTDEGLDNETFWK 

LSSLEYLDLSSNNLSRVPAGLPRSLVLLHLEKNAIRSVDANVLTPIRSLEYLLLHSNQLREQG 

IHPLAFQGLKRLHTVHLYNNALERVPSGLPRRVRTLMILHNQITGIGREDFATTYFLEELNLS 

YNRITSPQVHRDAFRKLRLLRSLDLSGNRLHTLPPGLPRNVHVLKVKRNELAALARGALAGMA 

QLRELYLTSNRLRSRALGPRAWVDLAHLQLLDIAGNQLTEIPEGLPESLEYLYLQNNKISAVP 

ANAFDSTPNLKGIFLRFNKLAVGSVVDSAFRRLKHLQVLDIEGNLEFGDISKDRGRLGKEKEE 
EEEEEEEEEETR 

Important features: 
. Signal sequence : 
amino acids 1-48 

N-glycosylation site. 

amino acids 243-247, 310-314, 328-332, 439-443 
Casein kinase II phosphorylation site. 

amino acids 68-72, 84-88, 246-250, 292-296, 317-321, 591-595 
N-nyristoylation site. 

amino acids 19-25, 107-113, 213-219, 217-223, 236-242, 335-341, 
477-483, 498-502, 539-545, 548-554 

Leucine zipper pattern. 

amino acids 116-138, 251-273, 258-280, 322-344, 464-486, 471-493, 
535-557 
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CACTTTCTCCCTCTCTTCCTTTACTTTCGAGAAACCGCGCTTCCGCTTCTGGTCGCAGAGACCTCGGAGACCGCG 
S^^^^S^^'^^^^^^'^^^^C'^'^TGGGTGGGGGGGACCTGTGGCTGCTCGTACCGCCcSSS^^^ 

tgccgtcctccggaagaccttttcccctgctctgtttccttcaccgagtctgtgcatcgccSSSScSg 

gaggaggcttggccggcgggagatgctctaggggcggcgcgggaggagcggccggcgSacgg^^^ 

^^^^'''^''^''^''^^^^^^'^'^^ctcttgctggcgtactgcctgctccttgcctttg^^ 

gagtcgtgtgccccatgtccagggggaacagcaggagtgggaggggactgaggagctgccgtcgcctSg^^^ 
^^^^''^'^''''^'^'''^^^^'^^^'^tacaggcccagtcaggaccaggggct^^^^ 
gcgctgctgtgaccccggtacctccatgtacccggcgaccgccgtgccccagatcaacatcactatS^^ 
ggagaagggtgaccgcggagatcgaggcctccaagggaaatatggcaaaacaggctcagcagSgc^^^ 

r^$^^^^'^^^®^*^*«^<5°«CTCCATGGGGGCCCCTGGGGAGCGGTGcS^^ 

ggtgggccggaagaagcccatgcacagcaaccactactaccagacggtgatcttcgacacggS^ 
ctacgaccacttcaacatgttcaccggcaagttctactgctacgtgccSgSS^ 

r^^r^Jf^^'^^^'^'^^^^^^^^^^C'^^'^^C^TCATGAAGAACGAGGAGGAGG^^^^ 

r^^^r^^^^^^^''^^'''''''^''''^^^''^''^^^^^'=^^^^^C'^^<^^<^'^CCTACATCACC 
GGTCAAGCACGCCACCGAGCCCTMCTGGCCGGCCACCTCCTTTCCTCTCGCCACCTTCCACCCCTGCGCTGTGT 

tgaccccaccgcctcttccccgatccctggactccgactccctggctttgSttSctSSSS^ 

acagaaagccaaagcgatcggtgctcccagatcccgcagcctctggagagagcSSSSStgaaSS?Sg 

ggcggggcacccgcgagaaccctctgggaccttccgcggccctctctgcacacatcctcSctg 

cgagacgcgggtggcggcagggcgtcccagggtgcggcaccgcggctcSctccttgS^S^J5S 

ctaaaggtctcaaaaggagcaaagtaaaccgtggaggacaaagaaaagggttSatto^ 

cctgctggctcccaagagagaggccttttcagttgagactctgcttaagagaagatccamgtS^ 

gtcaggggaggggccgggggcaggaaactacctctggcttaattcttttaagcScSag^ 

ataggtggaccctgacatccctgtggccttgcccaagggctctgctggtctttcStcSSgS 

^S^2S^S^i^?^°'^^^^'^^^''^°^^^^*^^CCCCTGGGTCCCCCAGGCCTGCAGATGTTT^ 

^^™^*^^^^^^'^^^'^'^^'^*^"^<^'='^^'fGCTCCACCCCTGTGCCACCCCAGAGCCCTGGGGSTGCT^^ 

^S^™SSf'°°^''''^°°^'''^^^°'^^CC<^CCTCCCACACAAATCAGCCCCAGAAGGCCcS^^ 

CTCTTTTTTATAAAACACCTCAAGCAGCACTGCAGTCTCCCATCTCCTCGTGGGCTAAGCATCAcS^^ 

?^^^SJ^r^^^'^^^^^^^^^<^<^T'^^^<=C'^<^'^CCCCTTCTGCCCCCACTGcSTCMC^^^ 

r^S™^^*^^°^*^^^^'^"^'^^°=CTTCAGAGCAGGGGAGAGCTGGAAGGGGCTAGAAASTCcScT^ 

GTTTCTCAGGCTCCTGTGAGCCTCAGTCCTGAGACCAGAGTCAAGAGGAAGTACACGTCCCAATCACC^^^ 

ggattcactctcaggagctgggtggcaggagaggcaatagcccctgtggcaattgSggacc^^^ 
ttgcggtgtctccacggtgctctcgccctgcccatggccaccccagactctgatctccagS^^ 
IS^^S''''''^''''''''^^^^^^'''^^'^'^^ccagggtcactcttgctacccgctgggccSSaa 
ttccttccccccatcccccacctggttttgactaatcctgcttccctctctgggcctSctSgggm^ 

cagagcgccacactcgctgcttaagctcccccagctctttccagaaaacattaaactcagaattgtgt^^^ 
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FIGURE 372 



MGSRGQGLLLAYCLLLAFASGLVLSRVPHVQGEQQEWEGTEELPSPPDHAERAEEQHEKYRPS 
QDQGLPASRCLRCCDPGTSMYPATAVPQINITILKGEKGDRGDRGLQGKYGKTGSAGARGHTG 
PKGQKGSMGAPGERCKSHYAAFSVGRKKPMHSNHYYQTVIFDTEFVNLYDHFNMFTGKFYCYV 
PGLYFFSLNVHTWNQKETYLHIMKNEEEWILFAQVGDRSIMQSQSLMLELREQDQVWVRLYK 
GERENAIFSEELDTYITFSGYLVKHATEP 

Important features: 
Signal sequence. 

amino acids 1-25 

N^glycosylation site, 

amino acids 93-97 

N-myristoylation ■ sites « 

amino acids 7-13, 21-27, 67-73, 117-123, 129-135 

Amidation site, 
amino acids 150-154 

Cell attachment . sequence . 

amino acids 104-107 
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CGGAGTGGTGCGCCAACGTGAGAGGAAACCCGTGCGCGGCTGCGCTTTCCTGTCCCCAAGCCG 

TTCTAGACGCGGGAAAA^CTTTCTGAAAGCAGCTCCTTTTTGAAGGGTGTGATGCTTGGAA 

GCATTTTCTGTGCTTTGATCACTATGCTAGGACACATTAGGATTGGTCATGGAAATAGAATGC 

ACCACCATGAGCATCATCACCTACAAGCTCCTAACAAAGAAGATATCTTGAAAATTTCAGAGG 

ATGAGCGCATGGAGCTCAGTAAGAGCTTTCGAGTATACTGTATTATCCTTGTAAAACCCAAAG 

ATGTGAGTCTTTGGGCTGCAGTAAAGGAGACTTGGACCAAACACTGTGACAAAGCAGAGTTCT 

TCAGTTCTGAAAATGTTAAAGTGTTTGAGTCAATTAATATGGACACAAATGACATGTGGTTAA 

TGATGAGAAAAGCTTACAAATACGCCTTTGATAAGTATAGAGACCAATACAACTGGTTCTTCC 

TTGCACGCCCCACTACGTTTGCTATCATTGAAAACCTAAAGTATTTTTTGTTAAAAAAGGATC 

CATCACAGCCTTTCTATCTAGGCCACACTATAAAATCTGGAGACCTTGAATATGTGGGTATGG 

AAGGAGGAATTGTCTTAAGTGTAGAATCAATGAAAAGACTTAACAGCCTTCTCAATATCCCAG 

AAAAGTGTCCTGAACAGGGAGGGATGATTTGGAAGATATCTGAAGATAAACAGCTAGCAGTTT 

GCCTGAAATATGCTGGAGTATTTGCAGAAAATGCAGAAGATGCTGATGGAAAAGATGTATTTA 

ATACCAAATCTGTTGGGCTTTCTATTAAAGAGGCAATGACTTATCACCCCAACCAGGTAGTAG 

AAGGCTGTTGTTCAGATATGGCTGTTACTTTTAATGGACTGACTCCAAATCAGATGCATGTGA 

TGATGTATGGGGTATACCGCCTTAGGGCATTTGGGCATATTTTCAATGATGCATTGGTTTTCT 

TACCTCCAAATGGTTCTGACAATGACS^GAAGTGGTAGAAAAGCGTGAATATGATCTTTGTA 

TAGGACGTGTGTTGTCATTATTTGTAGTAGTAACTACATATCCAATACAGCTGTATGTTTCTT 

TTTCTTTTCTAATTTGGTGGCACTGGTATAACCACACATTAAAGTCAGTAGTACATTTTTAAA 

TGAGGGTGGTTTTTTTCTTTAAAACACATGAACATTGTAAATGTGTTGGAAAGAAGTGTTTTA 

AGAATAATAATTTTGCAAATAAACTATTAATAAATATTATATGTGATAAATTCTAAATTATGA 

ACATTAGAAATCTGTGGGGCACATATTTTTGCTGATTGGTTAAAAAATTTTAACAGGTCTTTA 

GCGTTCTAAGATATGCAAATGATATCTCTAGTTGTGAATTTGTGATTAAAGTAAAACTTTTAG 

CTGTGTGTTCCCTTTACTTCTAATACTGATTTATGTTCTAAGCCTCCCCAAGTTCCAATGGAT 

TTGCCTTCTCAAAATGTACAACTAAGCAACTAAAGAAAATTAAAGTGAAAGTTGAAAAAT 
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FIGURE 374 

MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDERMELSKSFRVYCIILV 
KPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDKYRDQYNWFFLARPTTFAI 
lENLKYFLLKKDPSQPFYLGHTIKSGDLEYVGMEGGIVLSVESMKRLNSLLNIPEkCPEQGGMIWKISEDKQLAV 
CLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMTYHPNQVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG 
HIFNDALVFLPPNGSDND 

In^ortant features: 
Signal sequence: 
amino acids 1-33 

N-glycosylation site. 

amino acids 121-125, 342-346 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 319-323, 464-468 

Casein kinase II phosphorylation site. 

amino acids 64-132, 150-154, 322-326, 331-335, 368-372, 385-389, 399-403, 
409-413, 473-477,. 729-733, 748-752 

Tyrosine kinase phosphorylation site. 

amino acids 736-743 

H-nyristoylation site. 

amino acids 19-25, 23-29, 136-142, 397-403, 441-447, 544-550, 558-564, 
651-657, 657-663, 672-672 

Prokaryotic meiobrane lipoprotein lipid attachment site, 
amino acids 14-25 



Cell attachment sequence. 

amino acids 247-250 



wo 01/40466 



PCTAJSOO/32678 



375/550 

FIGURE 375 

GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAAT 
CTATCAGGAAAGAAAGT^GAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGAAA 
AAAAATCATGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTCACGGG 
GCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCCCCAA 
AGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATTGACAA 
CCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGACAAGTG 
GTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCGAGATCCA 
GAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAACCACCCAAA 
GACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTCTTCAGATAT 
CTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGACCAGAGCCTAC 
GGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGACGT^TACTTGGA 
AATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTCCAATGACGTGGC 
CGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACATTTCAGAAGCCAA 
GGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCCTCAGCAGTCCCCTC 
AGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTG/y^GGAAAGAAAGGGGTGAAAGT 
GGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTGAACATGACTATGGGAA 
CTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGCATCATGCTATTTGGTCC 
AGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGGCTGCGTCTGGCTGCTGCC 
TCTTCTGGTCTTGCACCTGCTTCTCAAATTTTGATGTGAGTGCCACTTCCCCACCCGGGAAAG 
GCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACACCGACAGCAACCAATCAGATA 
TATACAAATGAAATTAGAAGAAACACAGCCTCATGGGAGAGAAATTTGAGGGAGGGG7y\CAAA 
GAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAAAATTGCCTTGCAGATATTTAGG 
TACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGCACACCCGGCTTGGACCCACTGCA 
AGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAAGGGCTCAGCCTCTCTGCCCACAGA 
GTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCAAATTC7U\TCAGTCCATAGAGACGAA 
CAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGGGCACTTTGGTAGACTGTGCCACCACG 
GCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAAAAAAA 
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FIGURE 376 

MKTIQPKMHNSISWAIFTGLAALCLFQGVPVRSGDATFPKT^DNVTVRQGESATLRCTIDNRV 
TRVAWLNRST I LYAGN DKWCL DPRWLLSNTQTQ YS I E IQNVDVY DEG P YTC S VQT DNH PKT S 
RVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYLEIQ 
GITREQSGDYECSASNDVAAPVVRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAVPSAE 
FQWYKDDKRLIEGKKGVKVENRPFLSKLIFFNVSEHDYGNYTCVASNKLGHTNASIMLFGPGA 
VSEVSNGTSRRAGCVWLLPLLVLHLLLKF 

Important features: 
Signal peptide: 

amino acids 1-28 
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FIGURE 377 

CTTCTTTGAAAAGGATTATCACCTGATCAGGTTCTCTCTGCATTTGCCCCTTTAGATTGTGAA 

ATGTGGCTCAAGGTCTTCACAACTTTCCTTTCCTTTGCAACAGGTGCTTGCTCGGGGCTGAAG 

GTGACAGTGCCATCACACACTGTCCATGGCGTCAGAGGTCAGGCCCTCTACCTACCCGTCCAC 

TATGGCTTCCACACTCCAGCATCAGACATCCAGATCATATGGCTATTTGAGAGACCCCACACA 

ATGCCCAAATACTTACTGGGCTCTGTGAATAAGTCTGTGGTTCCTGACTTGGAATACCAACAC 

AAGTTCACCATGATGCCACCCAATGCATCTCTGCTTATCAACCCACTGCAGTTCCCTGATGAA 

GGCAATTACATCGTGAAGGTCAACATTCAGGGAAATGGAACTCTATCTGCCAGTCAGAAGATA 

CAAGTCACGGTTGATGATCCTGTCACAAAGCCAGTGGTGCAGATTCATCCTCCCTCTGGGGCT 

GTGGAGTATGTGGGGAACATGACCCTGACATGCCATGTGGAAGGGGGCACTCGGCTAGCTTAC 

CAATGGCTAAAAAATGGGAGACCTGTCCACACCAGCTCCACCTACTCCTTTTCTCCCCAAAAC 

AATACCCTTCATATTGCTCCAGTAACCAAGGAAGACATTGGGAATTACAGCTGCCTGGTGAGG 

AACCCTGTCAGTGAAATGGAAAGTGATATCATTATGCCCATCATATATTATGGACCTTATGGA 

CTTCAAGTGAATTCTGATAAAGGGCTAAAAGTAGGGGAAGTGTTTACTGTTGACCTTGGAGAG 

GCCATCCTATTTGATTGTTCTGCTGATTCTCATCCCCCCAACACCTACTCCTGGATTAGGAGG 

ACTGACAATACTACATATATCATTAAGCATGGGCCTCGCTTAGAAGTTGCATCTGAGAAAGTA 

GCCCAGAAGACAATGGACTATGTGTGCTGTGCTTACAACAACATAACCGGCAGGCAAGATGAA 

SSS^^'^'^^^^^^^^^'^^^'^^^^'^^^^^'^A^'^A^^TGGAGAAGCTTGCACAGAAAGGAAAATCA 

TTGTCACCTTTAGCAAGTATAACTGGAATATCACTATTTTTGATTATATCCATGTGTCTTCTC 

TTCCTATGGAAAAAATATCAACCCTACAAAGTTATAAAACAGAAACTAGAAGGCAGGCCAGAA 

ACAGAATACAGGAAAGCTCAAACATTTTCAGGCCATGAAGATGCTCTGGATGACTTCGGAATA 

TATGAATTTGTTGCTTTTCCAGATGTTTCTGGTGTTTCCAGGATTCCAAGCAGGTCTGTTCCA 

GCCTCTGATTGTGTATCGGGGCAAGATTTGCACAGTACAGTGTATGAAGTTATTCAGCACATC 

CCTGCCCAGCAGCAAGACCATCCAGAGTGAACTTTCATGGGCTAAACAGTACATTCGAGTGAA 

ATTCTGAAGAAACATTTTAAGGAAAAACAGTGGAAAAGTATATTAATCTGGAATCAGTGAAGA 

AACCAGGACCAACACCTCTTACTCATTATTCCTTTACATGCAGAATAGAGGCATTTATGCAAA 

TTGAACTGCAGGTTTTTCAGCATATACACAATGTCTTGTGCAACAGAAAAACATGTTGGGGAA 

ATATTCCTCAGTGGAGAGTCGTTCTCATGCTGACGGGGAGAACGAAAGTGACAGGGGTTTCCT 

CATAAGTTTTGTATGAAATATCTCTACAAACCTCAATTAGTTCTACTCTACACTTTCACTATC 

ATCAACACTGAGACTATCCTGTCTCACCTACAAATGTGGAAACTTTACATTGTTCGATTTTTC 

AGCAGACTTTGTTTTATTAAATTTTTATTAGTGTTAAGAATGCTAAATTTATGTTTCAATTTT 

ATTTCCAAATTTCTATCTTGTTATTTGTACAACAAAGTAATAAGGATGGTTGTCACAAAAACA 

AAACTATGCCTTCTCTTTTTTTTCAATCACCAGTAGTATTTTTGAGAAGACTTGTGAACACTT 

?iS™I°^^^^'^^^^"^^'^'^^'^TT'r'rATTTTTTTCAAGGAAAGATGGATTCAAATAAATT 
ATTCTGTTTTTGCTTTTAAAAAAAAAAAAAA 
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FIGURE 378 



MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMPKYLLGSVNKS 
VVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPWQIHPPSGAVEY 
VGNMTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMESDIIMPII 
YYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEKVAQKTMD 
YVCCAYNNITGRQDETHFTVIITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQKLEG 
RPETEYRKAQTFSGHEDALDDFGIYEFVAFPDVSGVSRIPSRSVPASDCVSGQDLHSTVYEVIQHIPAQQQDHPE 

Important features: 
Signal sequence: 

amino acids 1-18 

Transmembrane domain: 

amino acids 341-359 

M-glycosylatipn site. 

amino acids 73-77, 92-96, 117-121, 153-157, 189-193, 204-208, 276-280,. 308-312 
Casein kinase II phosphorylation site. 

amino acids 129-133, 198-202, 214-218, 388-392, 426-430, 433-437 

Tyrosine kinase phosphorylation site. 

amino acids 272-280 

N-myrlstoylatlon site. 

amino acids 15-21, 19-25, 118-124, 163-167, 203-209, 231-237, 239-245 

Prokaryotlc membrane lipoprotein lipid attachment site. 

amino acids 7-18 
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FIGURE ^IQ 

ATAGTAGAAGAATGTCTCTGAAATTACTGGATGAGTTTCAGTCATACTTTCACATGGGCACAA 

TTTCACATTCAAGCTCCTTATCCTAGGCTAATTTTATATTATGTTAAATCACTTGTTTTTGTT 

CTCACGGCTTCCTGCCTGCTATAGGCATAATTACGAGGAAGCAGAACTTCTCCAGAAGCAAGC 

GCACATGCGTTCCAAAATAAGAGCAAATTCGCTCTAAACACAGGAAAAGACCTGAAGCTTTAA 

TTAAGGGGTTACATCCAACCCCAGAGCGCTTTTGTGGGCACTGATTGCTCCAGCTTCTGCGTC 

ACTGCGCGAGGGAAGAGGGAAGAGGATCCAGGCGTTAGACATGTATAGACACAAAAACAGCTG 

GAGATTGGGCTTAAAATACCCACCAAGCTCCAAAGAAGAGACCCAAGTCCCCAAAACATTGAT 

TTCAGGGCTGCCAGGAAGGAAGAGCAGCAGCAGGGTGGGAGAGAAGCTCCAGTCAGCCCACAA 

GATGCCATTGTCCCCCGGCCTCCTGCTGCTGCTGCTCTCCGGGGCCACGGCCACCGCTGCCCT 

GCCCCTGGAGGGTGGCCCCACCGGCCGAGACAGCGAGCATATGCAGGAAGCGGCAGGAATAAG 

GAAAAGCAGCCTCCTGACTTTCCTCGCTTGGTGGTTTGAGTGGACCTCCCAGGCCAGTGCCGG 

GCCCCTCATAGGAGAGGAAGCTCGGGAGGTGGCCAGGCGGCAGGAAGGCGCACCCCCCCAGCA 

ATCCGCGCGCCGGGACAGAATGCCCTGCAGGAACTTCTTCTGGAAGACCTTCTCCTCCTGCAA 
ATAG 
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FIGURE 380 



MYRHKNSWRLGLKYPPSSKEETQVPKTLISGLPGRKSSSRVGEKLQSAHKMPLSPGLLLLLLS 
GATATAALPLEGGPTGRDSEHMQEAAGIRKSSLLTFLAWWFEWTSQASAGPLIGEEAREVARR 
QEGAPPQQSARRDRMPCRNFFWKTFSSCK 

Inportiant features: 
Transmembrane domain: 

amino acids 51-69 

cAMP- and cGMP-dependent: protein kinase phosphorylation sites. 

amino acids 35-39^ 92-96 

N-myristoylation sites. 

amino acids 64-70, 75-81, 90-96 

Amldatlon site. 

amino acids 33-37 
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FXQXJBE 391 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGCC 
GCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGCGC 
CCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCGCCC 
CGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAACATTC 
ATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCGCCCTG 

CGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGATGCGGAC 

CCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGCCCCGCCCA 

ACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTGCTGTGGCTG 

^JSJS^^^^^'^^^'^^^^^^^^'^^^'^^^^^^GGTGCCTGCGTATGCTACAATGAGCCCAAGGTG 

JSS?S5^^^'^^^^^^^^^^^^^^^^'^^^A^<^CTGTGCCCGTGGGCATCCCTGCTGCCAGCCAG 

CGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTCCGTGCCTGCCGC 

AACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGCGGCTGCCTTCACT 

GGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCCGGTCTGTGGACCCT 

SSS5^^^'^^'^^^*^^^^^^^^^^^^^'^^^^C^<=<5CTGCACCTGGACCGCTGCGGCCTGCAGGAG 

S^SSJ^^^^^^^'^^'^'^^^^^^^^^'^^^^'^^CCCTGCAGTACCTCTACCTGCAGGACAACGCG 

CTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCACACACCTCTTCCTGCAC 

SS^S^^^^-^^'^^'^^^^^^^'^^^^^^^^CGCGCCTTCCGTGGGCTGCACAGCCTCGACCGTCTC 

CTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTTCCGTGACCTTGGCCGCCTC 

SJSJSS'^^'^^'^^'^^'^'^'^^^^'^^^'^^^^'^C^GCGCTGCCCACTGAGGCCCTGGCCCCCCTG 

CGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTGTGTGACTGCCGGGCACGCCCA 

So™^^^'^^^^"^^^^^^^'^'^^^^^^^C^<^CTCCTCCGAGGTGCCCTGCAGCCTCCCGCAA 

CGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATGACCTGCAGGGCTGCGCTGTGGCC 

ACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACCGATGAGGAGCCGCTGGGGCTTCCC 

AAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGTACTGGAGCCTGGAAGACCAGCTTCG 

oSJSSS^^^^'^^'^^^^^^^^^^^'^^^^^CCCGGTGACAGCCCGCCGGGCAACGGCTCTGGC 

CCACGGCACATCAATGACTCACCCTTTGGGACTCTGCCTGGCTCTGCTGAGCCCCCGCTCACT 

GCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTTCCCCACCTCGGGCCCTCGCCGGAGGCCA 

SSHJSr^^^^^^^^^^^^^^^^^^A^C^CTGCCGTCTGGGCCAGGCAGGCAGCGGGGGT 

GGCGGGACTGGTGACTCAGAAGGCTCAGGTGCCCTACCCAGCCTCACCTGCAGCCTCACCCCC 

CTGGGCCTGGCGCTGGTGCTGTGGACAGTGCTTGGGCCCTGCTGACCCCCAGCGGACACAAGA 

GCGTGCTCAGCAGCCAGGTGTGTGTACATACGGGGTCTCTCTCCACGCCGCCAAGCCAGCCGG 

GCGGCCGACCCGTGGGGCAGGCCAGGCCAGGTCCTCCCTGATGGACGCCTGCCGCCCGCCACC 

S^S'^^^^^^^^^'^^^'^^'^'^^^^^^^^^TCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTC 

CCACCCAGATCGCGGTATATAGAGATATGCATTTTATTTTACTTGTGTAAAAATATCGGACGA 
CGTGGAATAAAGAGCTCTTTTCTTAAAAAAA 
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FIGURE 382 



MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRIF 
LHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPATF 
HGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLHGNR 
ISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAPLRAL 
Q YLRLN DN PWVC DCRAR PL WAWLQKFRGS S S E V PCS L PQRLAGRDLKRLAAN DLQGCA VATG P 
YHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGNGSGPRH 
INDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQAGSGGGGT 
GDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site. 

amino acids 436-439 

N-glycosylation site* 

amino acids 82-85, 179-183, 23.7-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 
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FIGURE 383 

TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTCCTGAACTTGTCTG 

AAGCCCTTGTCCGTAA6CCTTGAACTACGTTCTTAAATCTATGAAGTCGAGGGACCTTTCGCTGCTTTTGTAGGG 

ACTTCTTTCCTTGCTTCAGCAAC^AGGCTTTTCTTGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGA 

TTGGGGCTTTGATCCCTGAACCAGAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCAAGACCA 

AAGGAGGGGATTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTCACA 

AACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGTTGGGACCAGGGCTTGA 

AAGGAATGTGTGTAGGAGAGAAGAGAAAGCTCATCATTCCTCCTGCTCTGGGCTATGGAAAAGAAGGAAAAGGTA 

AAATTCCCCCAGAAAGTACACTGATATTTAATATTGATCTCCTGGAGATTCGAAATGGACCAAGATCCCATGAAT 

CATTCCAAGAAATGGATCTTAATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAG6AGT 

TTGAAAAACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAGAAGATG 

AAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTATAGAGATACATCTACCCTT 

TTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAAAGAACATTTTATTTTTATACAATGTTCTTTCT 

TGCTTTGTTTTTTATTTTTATATATTTTTTCTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTT 

CTTTCTGATAAGTTATTGGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTC 

ACAGATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACCACGACATGA 

GACCAGGTTATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTCTCCAAGTTAGAGGTCAACATTT 

GAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCATGTTATAATGAAATAGTTTATGTGTAACTGGCTCTG 

AGTCTCTGCTTGAGGACCAGAGGAAAATGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGAT 

GTGCAATGCTGAAGTTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 

GGCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAACCCTATCTCTAC 

TAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCTACCCAGGAAGGCTGAGGCGGCAGAA 

TCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAGATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAA 

AAGAACACGGTTAATACCATATNAATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGG 

CTCCTAGTGATTGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATGTA 

TCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGCTAGCGGAATATCCTT 

CCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACATTGTATCATAAGATAAAGTAGTAAACCA 

GTCTACATTTTCCCATTTCTGTCTCATCAAAAACTGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAG 

CACTTTGGGGGCCAAGGAGGGTGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCT 

TGTCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAGGCTGA 

GACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCCACTGCACTCCAGCCTGGG 

TGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGACCTACAGCAGCTACTATTGAATAAATACCTA 
TCCTGGATTTT 
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FIGURE 384 

MRLFLWNAVLTLFVTSLIGALIPEPEVKIEVLQKPFICHRKTKGGDLMLVHYEGYLEKDGSLF 
HSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPALGYGKEGKGKIPPESTLI 
FNIDLLEIRNGPRSHESFQEMDLNDDWKLSKDEVKAYLKKEFEKHGAVVNESHHDALVEDIFD 
KEDEDKDGFISAREFTYKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence. 

aifnino acids 208-211 

FKBP-type pep tidyl -prolyl cis-trans isomerase 
amino acids 78-114 and 118-131 

KF-hand calcium-binding dcxnain • * 

amino acids 191-203, 184-203 and 140-159 

S-lOO/lCaBP type calcium binding domain 

amino acids 183-203 



• 
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FIGURE 385 



CTCCCACGGTGTCCAGCGCCCAGAATGCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCTC 

CCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGTCC 

CTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGTGGG 

ATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAATGAAG 

GGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGAACCTC 

ACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAGTCTTTA 

CTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCACCTTCCAG 

CCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCCCAGGATTG 

ACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGGGCTGAGGCC 

CCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACACAGGAACCTCT 

CCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGCTGGACTCCACC 

TCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGGGTGTCCATCCCG 

ATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGCCGCAGGCCTGATC 

GCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCACGGAGACACAGAGG 

AACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCCCCTTCCCAGGCCCCT 

GAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGAGCTGGGCTTCTCGAAG 

TTTGTCTCAGCGTAGGGCAGGAGGCCCTCCTGGCCAGGCCAGCAGTGAAGCAGTATGGCTGGC 

TGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAGTCCAGCTGCCCGGACTCC 

AGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCAGCCTGACCTAGAAGCGTTT 

GTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTGGAGACTGGGACATCCCTGAT 

AGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCAGCAGGGCCAGACAAGGCTCAG 

TGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGCCTCATGCCCAGTGTCGGACCCT 

GCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCCTGGCGTCCTCAGACTTAGTCCCA 

CGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCTGGGGTGAGACTGGGATTCTGGCTT 

CTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCTGTGAAAAACGTGATTCCTGGCCCCA 

CCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAGGACTCTGAATTCTAACAATGCCCAGT 

GACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTGATGAACGCTCACACCCCTTCAGCTTAG 

AGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCCCAATAGATCTGCTCTGTCTGCGACACCA 

GATCCACGTGGGGACTCCCCTGAGGCCTGCTAAGTCCAGGCCTTGGTCAGGTCAGGTGCACAT 

TGCAGGATAAGCCCAGGACCGGCACAGAAGTGGTTGCCTTTNCCATTTGCCCTCCCTGGNCCA 

TGCCTTCTTGCCTTTGGAAT^AAATGATGAAGAAAACCTTGGCTCCTTCCTTGTCTGGAAAGGG 

TTACTTGCCTATGGGTTCTGGTGGCTAGAGAGAAAAGTAGAAAACCAGAGTGCACGTAGGTGT 

CTAACACAGAGGAGAGTAGGAACAGGGCGGATACCTGAAGGTGACTCCGAGTCCAGCCCCCTG 

GAGAAGGGGTCGGGGGTGGTGGTAAAGTAGCACAACTACTATTTTTTTTCTTTTTCCATTATT 

ATTGTTTTTTAAGACAGAATCTCGTGCTGCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCA 

AACTCCGCCTCCTGGGTTCAAGTGATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAG 

GCACGCACCACCACACCTGGCTAATTTTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTG 

GCCAGGCTGGTCTTGAACTCCTGACCTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCG 

GGATTACAGGCATGAGCCACTGTGTCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTG 

TTCAGTATGCAAAACTTGGAAAGATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCA 

TAGTCTCACCAGAGACTATCATTATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTC 

ACATAATTTGCCGGTGTTCTTTTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGC 

CTTTTCCACCTTATCGTTCCATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTT 

ATAAATAAAATGTTCATCAGCTGCATAAAAAAAAAAAAAA 



• 



wo 01/40466 



PCTAUSOO/32678 



386/550 



FIGURE 386 



MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCSG 

TIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFVFP 

GPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPGTSQ 

YGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRILAPV 

LVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGDVISMP 
PLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-giycosylation site. 

amino acids 96-99 

fibrinogen beta and gamma chains C- terminal domain, 
amino acids 104-113 

Ig like V^-type domain: 

amino acids 13-128 
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FIGURE 387 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACATG 

GCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGAGG 

CGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATCGGG 

ATeTCCCTCCTCCTTCTCCTCTTGCTAGTTTCGTACTATGTTGGAACCTTGGGGACTCACACT 

GAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGCTTCCA 

GAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAAGTGGTG 

ATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCGAGTGGCC 

TTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGCCCAGTGAT 

GAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCATGTCATCTTA 

AAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGACAGAAGGAAGT 

GACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATTACTGGCAGCGA 

ATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATTGACTACAACCAC 

CCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTACCAGTGCACAGCA 

GGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGTATGTACAAAGCATC 

GGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTGATTTTCCTCTTGGTG 

TGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGAGAGACCTAATGAAATT 

CGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCTCCTCTTCCTCAGGCTCT 

CGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAATAGTGCCTCACGCAGCCAG 

CGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCACCCAGGCATACAGCCTAGTG 

GGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATGCTAATCTGACCAAAGCAGAA 

ACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAAACGGTCTGAATTACAATGGAC 

TTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTCTTCTCGTCATTGGAGCTCAAGT 

CACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCAGTGAGCATTGCACGGAACAGATT 

CAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAAAGGATGTAAGCTGATTCATCTGTA 

/yy^GGCATCTTATTGTGCCTTTAGACCAGAGTAAGGGAAAGCAGGAGTCCAAATCTATTTGT 

TGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTGAGGTGAATATACCTAAAACTTTTAAT 

GTGGGATATTTTGTATCAGTGCTTTGATTCACAATTTTCAAGAGGAAATGGGATGCTGTTTGT 

AAATTTTCTATGCATTTCTGCAAACTTATTGGATTATTAGTTATTCAGACAGTCAAGCAGAAC 

CCACAGCCTTATTACACCTGTCTACACCATGTACTGAGCTAACCACTTCTAAGAAACTCCAAA 

AAAGGAAACATGTGTCTTCTATTCTGACTTAACTTCATTTGTCATAAGGTTTGGAt'ATTAATT 

TCAAGGGGAGTTGAAATAGTGGGAGATGGAGAAGAGTGAATGAGTTTCTCCCACTCTATACTA 

ATCTCACTATTTGTATTGAGCCCAAAATAACTATGAAAGGAGACAAAAATTTGTGACAAAGGA 

TTGTGAAGAGCTTTCCATCTTCATGATGTTATGAGGATTGTTGACAAACATTAGAAATATATA 

ATGGAGCAATTGTGGATTTCCCCTCAAATCAGATGCCTCTAAGGACTTTCCTGCTAGATATTT 

CTGGAAGGAGAAAATACAACATGTCATTTATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAA 

AAGGGATCTAGGAATGCTGAAAGATTACCCAACATACCATTATAGTCTCTTCTTTCTGAGAAA 

ATGTGAAACCAGAATTGCAAGACTGGGTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTA 

ATATGTCAAGGAAGGTAGCCGGGCATGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAG 

GTTGCAGTGAGCCGAGATTATGCCATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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FIGURE 388 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKW 
ITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHVIL 
KVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRIDYNH 
PGRVLLQNLTMSYSGLYQCTAGNEAGKESCVVRVTVQYVQSIGMVAGAVTGIVAGALLIFLLV 

wllirrkdkeryeeeerpneiredaeapkarlvkpsssssgsrssrsgssstrstansasrsq 
rtlstdaapqpglatqayslvgpevrgsepkkvhhanltkaettpsmipsqsrafqtv 

Inportanb freatures: 
Signal sequence : 
amino acids 1-16 



Transmembrane domain: 

amino acids 232-251 
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FIGURE 389 

GCGGCACCTGGAAG^CGCCCATTGGCTGGTGGCCTGCTCAAGGTGGTGTTCGTGGTCTTCG 

•CCTCCTTGTGTGCCTGGTATTCGGGGTACCTGCTCGCAGAGCTCATTCCAGATGCACCCCTGT 

CCAGTGCTGCCTATAGCATCCGCAGCATCGGGGAGAGGCCTGTCCTCAAAGCTCCAGTCCCCA 

AAAGGCAAAAATGTGACCACTGGACTCCCTGCCCATCTGACACCTATGCCTACAGGTTACTCA 

GCGGAGGTGGCAGAAGCAAGTACGCCAAAATCTGCTTTGAGGATAACCTACTTATGGGAGAAC 

AGCTGGGAAATGTTGCCAGAGGAATAAACATTGCCATTGTCAACTATGTAACTGGGAATGTGA 

CAGCAACACGATGTTTTGATATGTATGAAGGCGATAACTCTGGACCGATGACAAAGTTTATTC 

AGAGTGCTGCTCCAAAATCCCTGCTCTTCATGGTGACCTATGACGACGGAAGCACAAGACTGA 

ATAACGATGCCAAGAATGCCATAGAAGCACTTGGAAGTAAAGAAATCAGGAACATGAAATTCA 

GGTCTAGCTGGGTATTTATTGCAGCAAAAGGCTTGGAACTCCCTTCCGAAATTCAGAGAGAAA 

AGATCAACCACTCTGATGCTAAGAACAACAGATATTCTGGCTGGCCTGCAGAGATCCAGATAG 

AAGGCTGCATACCCAAAGAACGAAGCTGACACTGCAGGGTCCTGAGTAAATGTGTTCTGTATA 

AACAAATGCAGCTGGAATCGCTCAAGAATCTTATTTTTCTAAATCCAACAGCCCATATTTGAT 

GAGTATTTTGGGTTTGTTGTAAACCAATGAACATTTGCTAGTTGTATCAAATCTTGGTACGCA 

GTATTTTTATACCAGTATTTTATGTAGTGAA(3ATGTCAATTAGCAGGAAACTAAAATGAATGG 
AAATTCTTAAAAAAAAAA 
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FIGURE 390 



MRPLAGGLLKWFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKRQKC 
DHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRC 
FDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWV 
FIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 

Important features: 
Signal sequence. 

amino acids 1-20 



N-glycQsylation sites . 
amino acids 120-124, 208-212 



Glycosaminoglycan attachment site. 

amino acids 80-84 



N-myristoylation sites , 



amino acids 81-87, 108-114, 119-125 



# 
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FIGURE 391 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGTCGCCGGGAAaGGG 
AGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCTCTGCGCGCCCCTGTCCGCCCCGGCCC 
AGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCAGCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTG 
CTGTGCCCTGCGCCCTTGCCCCGCGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCT 
GCCCTGGGCGCGGGGCGGAGCAGGC^TGrCCCGCCaSGGGACCGCTACCCaVGCGCTGGCCCTGGTGCTCCTGGC 
AGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCA6CCCTCGAGGACCCTGATTATTACGGGCAGGAGATCTGGAG 
CCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTCTCTCCGCCGCTGCCTGCGGGGCCCGGGGA 
^^^^I™»^°°^*^^^^*^^<^*°'=^^^CC^CCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGA 
GAAGTCGGCTCCGGAGCCGCCTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAA 
GGCTGCCAACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGA 
^^^'^'^^^^'^^^''^^^^'^'''^C^'^GCCTCCACGGTGAAGCGCTATGGCCTGGGGGCACATCGAGGGAG 
ACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGACGGAGCGTGGTGCGCGGGAAGAAATGACCTCCA 
GCAGTGGATTGAAGTGGATGCTCGGCGCCTGACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTG 
^S^^*^'^°^^^°°°^^^^^'^'^*='^^'^^^^''CATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGG 
ATCTGGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGTCCCCATGGT 
GGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCTGCATGAGAATGGAGATCCTGGG 
CTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGGAACGAGATGACCACCACTGATGACCTGGATTTTAA 
GCACCACAATTATAAGGAAATGCGCCAGTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTA 
CAACATTGGAAAAAGCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGT 
CGGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCTGCTGCTGCTGCT 
GGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCCACCTGGTGGAGGAGACGCGGATTCA 
CGTCCTCCCCTCCCTCAACCCCGATGGCTACGAGAAGGCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCT 
GGGACGCTGGACCCACGATGGAATTGACATCAACAACAACTTTCCTGATTTAAACAC6CTGCTCTGGGAGGCAGA 
GGATCGACAGAATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAATGC 
CACGGTGGCTGCCGAGACCAGAGCAGTCATAGCCTGGATGGAAAAAATCCCTTTTGTGCTGGGCGGCAACCTGCA 
SoSoS^™S'^*^^*^°'^*^'^®^^°^^''^^^^^C^^CCTGGTGCGGTCCCCCTGGAAGACGCAGGAAC^^ 

GAGGGTGTGCCACACGGAGGACTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGG 

AAGTCTGAACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAATACCC 

ACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTTCATGGAGCAGGTTCATCG 

TGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCCCAAACGCCATTATCTCCGTAGAAGGCATTAA 

CCATGACATCCGAACAGCCAACGATGGGGATTACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAA 

GGCCGAAGGTTTCACTGCATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCAC 

ACTTAGCAAAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGCCTGCCAGC 

CAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTgACCCTCCTGGGCCCTTGAGACTCGTCTGGG 

ACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTGGACTCACTCACTGTTGTTTCCTCTGTAATTCAAG 

AAGTGCCTGGAAGAGAGGGTGCATTGTGAGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTT 

CTTTGTTCCCATTTATCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAG 

CCAACCTGGGAATCAGAGAGAGAAGGA6AAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGCATAGAAAAGG 

ATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGCAATTTGCACAGCTAAAATTGCAG 

CATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGAGATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTC 

TGGCCCTGGGACATTGCAAGCTGCTACAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACA 

TCAGTGAGCCTCTTGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 

GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATAAACAACCCCCAAA 

GTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCAGAGGTGCCAGCCTTCCTGAAGGGCCA 

GAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAGGACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTC 

TCTCTGCTTGAGGTATTGCCCCTGTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTT 

TTGATATGTAGAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAGCAT 

CAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTTAATGTTGCTGCCTCAT 

TGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACCCTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA ™»™™i««nnn 
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FIGURE 392 

MSRPGTATPALALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPLPA 
GPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHSVR 
VAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCAGRN 
DLQQWIEVDARRLTRFTGVITQGRNSLWLSDWVTSYKVMVSNDSHTWVTVKNGSGDMI FEGNS 
EKEIPVLNELPVPMVARYIRINPQSWFDNGSICMRMEILGCPLPDPNNYYHRRNEMTTTDDLD 
FKHHNYKEMRQLMKWNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIAGA 
HGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGGSELGGWS 
LGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAAETRAVIAW 
MEKIPFVLGGNLQGGELVVAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYASTHRLMTDARR 
RVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNGFELSIYVGCDKYPHESQLPEEWENNR 
ESLIVFMEQVHRGIKGLVRDSHGKGIPNAIISVEGINHDIRTANDGDYWRLLNPGEYWTAECA 
EGFTASTKNCMVGYDMGATRCDFTLSKTNMARIREIMEKFGKQPVSLPARRLKLRGRKRRQRG 




• 
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FIGURE 393 



GTCCCACATCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCTGAAGTGGACCAAC 

TAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACATCTCAGATCGCTTCCAATGAAGA 

TGGCCTTGCCTTGGGGTCCTGCTTGTTTCATAATCATCTAACTATGGGACAAGGTTGTGCCGGCAGCTCTGGGGG 

AAGGAGCACGGGGCTGATCAAGCCATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTT 

TCTGAATCTAGCCCACTTGGCGGTAAGCATCATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGGTGG 

CTACTTATTTCTTTTAGGGGATTGTCAGGAGGTGACCACTCTCACGGTGAAATACCAAGTGTCAGAGGAAGTGCC 

ATCTGGTACAGTGATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTT 

CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCATTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCG 

GCTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 

GGCTCTGATCCATGTGGAGATCCAAGTGCTGGACATCAATGACCACCAGCCACGGTTTCCCAAAGGCGAGCAGGA 

GCTGGAAATCTCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACAGAGCTCTTGACCCAGAGACAGGCCC 

TAACACCCTGCACACCTACACTCTGTCTCCCAGTGAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGAC 

CAAACATGCAGAACTCATAGTGGTGAAGGAGCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGC 

CTATGACAATGGGAACCCCCCCAAGTCAGGTACCAGCTTGGTCAAGGTCAACGTCTTGGACTCCAATGACAATAG 

CCCTGCGTTTGCTGAGAGTTCACTGGCACTGGAAATCCAAGAAGATGCTGCACCTGGTACGCTTCTCATAAAACT 

GACCGCCACAGACCCTGACCAAGGCCCCAATGGGGAGGTGGAGTTCTTCCTCAGTAAGCACATGCCTCCAGAGGT 

GCTGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTCATTCTGCGTCGACCTCTAGACTATGAAAAGAACCC 

TGCCTACGAGGTGGATGTTCAGGCAAGGGACCTGGGTCCCAATCCTATCCCAGCCCATTGCAAAGTTCTCATCAA 

GGTTCTGGATGTCAATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCCAGCCATCACTGGTGTCAGAAGC 

TCTTCCCAAGGACAGTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTCAGGACACAATGGTTTGGTCCACTG 

CTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCAACACATACATGTTGCTAACCAATGC 

CACACTGGACAGAGAGCAGTGGCCCAAATATACCCTCACTCTGTTAGCCCAAGACCAAGGACTCCAGCCCTTATC 

AGCCAAGAAACAGCTCAGCATTCAGATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGCAGGTATGA 

AGTCTCCACGCGGGAAAACAACTTACCCTCTCTTCACCTCATTACCATCAAGGCTCATGATGCAGACTTGGGCAT 

TAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTCCAACACAGGAGA 

GGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 

GCAACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGT 

CCAGCCTGTGCTCAGCGATGGAAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 

CATCGAGACTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGGCCATT 

CCTTTTGACAACCATTGTGGCAAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGG 

AAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGCTGTTCGTCAATGTCACCAATGCCAGCAGCCT 

CATTGGGAGTGAGTGGGAGCTGGAGATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTT 

GAGGGTCATGTTTGTCACCAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGAT 

GCTGACGGTGATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCCATCTGCCG 

GACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTACCGCCAGCAGCCCAAGAGGCC 

CCAGAAACACATTCAGAAGGCAGACATCCACCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGT 

CGGGCAGTCCCACAAAGATGTGGACAAGGAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTT 

CCACCTCACCCCGACCCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGCACCGGCGGAGAGCCGAGAGGT 

GCTGCAAGACACGGTCAACCTCCTTTTCAACCATCCCAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCC 

CGAGCCCCAGCCTGCCACAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGC 

TGGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCACCAGCCTCCTCTGCAACCCTGAGACGGCAGCGACATCT 

CAATGGCAAAGTGTCCCCTGAGAAAGAATCAGGGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGC 

TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGCT 

GTCCTTGCTGCATCAGGGCCAATTCCAGCCCAAACCAAACCACCGAGGAAATAAGTACTTGGCCAAGCCAGGAGG 

CAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGGAGGCCAGACAGACCCAGAACAGGAGGA 

AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 

CAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCAC 

CACCAACTACCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGTTCGG 

CAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTCTCGGAGATGAGCTCACT 

GCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 

CTGCGGGAGGACCCTCAGTTTAGACTTGGCCACCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGG 

AAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGTGAACATACCTCAGACGCCT 

CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAG 

CGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAGCCCCAGGACTAACAGCTGAC 

TGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGAGGACAGGGACGGTTTGTGGCTGAGATAAGTGTT 

TCCTGGCAAAACATATGTGGAGCACAAAGGGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAGG 

AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTAT 
CAATTU^GGAAAAGCAGTAAAAAAAAAAAAAAAAAAAA 
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FIGURE 394 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQAG 
AAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQVL 
DINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIVGPD 
ETKHAELIVVKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESSLALE 
IQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPLDYEKN 
PAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSFIALVMA 
DDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQDQGLQPLS 
AKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVSYRIQDSPV 
AHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDANDNAPEWQ 
PVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTTIVARDADSGA 
NGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIVVEDQGSPPLQTRALLRV 
.MFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEKKDNRAYNCREAE 
STYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEAGWDPCLQAPFHLT 
PTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEPQPATGQPRSRPLKV 
AGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQILRSLVRLSVAAFAE 
RNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGSRSAIPDTDGPSARAGG 
QTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPDPAWMARLSLPLTTNYRD 
NVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSLLEMLLEQRSSMPVEAASE 
ALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSSSSSRCL 

Important features: 
Signal peptide: 
amino acids 1-13 
Transmembrane dcunain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 
Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGURE 395 

CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAGG 
CTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAATC 
AGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCACCATGGGACGCCCCCGACCTCGTGCG 
GCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGCACAG 
GAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGGCCTTG 
TTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTTACAGCT 
GCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAATAAAGAT 
GGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACAGCAGCGAT 
GTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCCCTGGGGTCC 
AAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTGCACCGTCTCA 
GGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACTGTGCAGAAGTA 
AAAATCTTTCeCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACAGATGGCATGGTC 
TGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGGCCCCCTGGTGTGT 
GATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGAGGTCCGACAAACCT 
GGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATCATAGGCAGCAAGGGC 
SSaTTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCTCTGGTTC 
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FIGURE 396 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVLV 
GGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPVVQSIPHPCYNSSDVEDHNHDLMLLQL 
RDQASLGSKVKPISLADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYP 
GQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRYLDWI 
KKIIGSKG 

Important Features: 
Signal peptide: 

amino acids 1-23 - 

Transmembreme doxoain: 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 



Kringle domain proteins motif 

amino acids 205-217 
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FIGURE 397 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTAC 

TACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGGGT 

ACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCACTG 

TGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGCTGCA 

GCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCCCCCTG 

ATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCCCTGAGC 

TCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGCCATGTGG 

TGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCGTGTCCATC 

CTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGGAGATCTACC 

AGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTATGTGCCTGAGA 

TGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCAGGTGGATGGCA 

CAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCCCTGGCAGCCGGG 

AGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGATGACGGTGACACCC 

GCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGAGGAGCTGGACTTGG 

AGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGCCCCTGGGGACTACCA 

AGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGXaACCCATGGCCTGCACCCTCC 

TGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCTCCAGCCCTCTTCCTCCT 

TCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGCTCCAGGCCTCTTGCTAAG 

CCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCAGGGACTATTTTCTGCACCA 

GCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACTCACAGTGGAGCTTCCAGGACC 

CAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAAAAAAAAAAAA 
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FIGURE 398 

MSDLLLLGLIGGLTLLLLLTLLAFAGYSGLLAGVEVSAGSPPIRNVTVAYKFHMGLYGETGRL 
FTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFS'FP 
APSHVVTATFPYTTILSIWLATRRVHPALDTYIKERKLCAYPRLEIYQEDQIHFMCPLARQGD 
FYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGASSRGW 
DDGDTRSEHSYSESGASGSSFEELDLEGEGPIiGESRLDPGTEPLGTTKWLWEPTAPEKGKE 
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FIGURE .^QQ 

GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGGC 

TCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCGAG 

AGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAAACT 

TCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGGATGAGCAACTCCGTTCCTCTGCTCTGTTTC 

TGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGACGGCTG 

GAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGGTTTAAC 

CTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAGCCAGCCC 

TTAGAAGACTGCAGTTTCAACATGACAGCTAAAACCTTTTTCATCATTCACGGATGGACGATG 

AGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGAGAGAAAGAC 

GCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGATGCGGTCAAT 

AATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGGAGAAGGACGAT 

TTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTGGCCGGGTATGCA 

GGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGCCGGGCCCATGTTT 

GAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTGTGGATGTCCTCCAC 

ACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTGGGCCACATTGACATC 

TACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGTCTTGGGATCAATTGCA 

TATGGAACAATCACAGAGGTGGTAAAATGTGAGCATGAGCGAGCCGTCCACCTCTTTGTTGAC 

TCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACTGACTCCAATCGCTTCAAA 

AAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCATTGGCTACAATGCCAAGAAA 

ATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGGCAGGCATGCCTTTCAGAGGT 

AACCTTCAGTCCCTGGAGTGTCCCTS&GGAAGGCCCTTAATACCTCCTTCTTAATACCATGCT 

GCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCACAATCCAATCAAATCGTTGCAA 

ATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTTACAAAATAAACAGTGTGGACCCC 

TAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 400 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFNLRTSKDPEHEG 
CYLSVGHSQPLEDCSFNMTAKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANWWDWLPL 
AHQLYTDAVNNTRWGHSIARMLDWLQEKDDFSLGNVHLIGYSLGAHVAGYAGNFVKGTVGRI 
TGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGG.DFQPGC 
GLNDVLGSIAYGTITEVVKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGICLSCRKNR 
CNSIGYNAKKMRNKRNSECMYLKTRAGMPFRGNLQSLECP 

Importani: features: 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 

N-glycosylation sites. 

amino acids 80-83 and 136-139 



• 
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FIGURE 4Q1 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCTG 

CCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCTGT 

CAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTdGCTC 

AACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGGTACC 

AGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCACCACC 

GGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCTGTGTCC 

TCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGCTACGGCT 

TTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCTGCCCCTGA 

CCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGGTTAATAATA 
TTCAACATGTCAACAAC 
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FIGURE 402 

MACRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAGS 
APRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNACVLTISPVQPEDDADYYCSVGYGFSP 
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FIGURE 403 

CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGAAAGCAGCGAGTTG 

GCAGAGCAGGGCTGCATTTCXAGCAGGAGCTGCGAGCACAGTGCTGGCTCACAACAAGATGCTCAAGGTGTCAGC 

CGTACTGTGTGTGTGTGOVGCCGCTTGGTGCAGTCAGTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGG 

GCGGTCGGACGGCGGTAATTTTCTGGATGATAAACAATGGCTCACCACAATCTCTCAGTATGACAAGGAAGTCGG 

ACAGTGGAACAAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGATCA 

G6CTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTATGCATTGCTCAAGATTC 

TCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATGAAAGAAGCAGGAGTAGACCATAGGCAGTG 

GAGGGGTCCCATATTATCCACCTGCAAGCAGTGCCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCA 

TACCTACTCTTTTCAGTGCAAACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGG 

ACATTGCCCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCTGGAGTT 

CAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAAGTCAAAACAAGAAGACAAA 

AACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATCTTGCCAATTTGCAAGGACTCACTTGGCTGGAT 

GTTTAACAGACTTGATACAAACTATGACCTGCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTT6ATAAGAA 

TGAACAGTGTACCAAGGCATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTG 

CTACTGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCAAGGGGTAAA 

GAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGCCAACACAATGTCATGGCAGTGT 

TGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTCATGGGATCCAGAATAAATGGTGTTGCAGATTGTGC 

TATAGATTTTGAGATCTCCGGAGATTTTGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGA 

CGATATTATGAATGATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGAT6GTGGTGATGA 

CCATGATGTATACATTTGATTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTACAAAAATGATAG 

CCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCACATATATTTTGTATAATTATTTGAA 

AAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAATAAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACA 

AAAAGAAAATACATATGCAGTCTAGTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGA 

ACAAACTTTGTAAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAGAT 

AATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGGAAAAATATGCATGCT • 

TTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGAtGAGTCAGGATAACAGAGAGATACCACATGACTCCA 
AAAAAAAAAAAAAA 
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FIGURE 404 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKFRD 
EVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHEy^KEAGV 
DHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCPSDK 
PTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPICKDS 
LGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQRQQDP 
PCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGSRINGVA 
DCAIDFEISGDFASGDFHEWTDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern. 

amino acids 246-267 

N-oayristoylation sites . 

amino acids 357-362, 371-376 and 376-381 



Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 
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FIGURE 4Q5 

^I'^^'^^^^^^^^CTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCcS^^^ 

3«S^^S*^^^^^^^°^^^"^*ccacttctgcctctactggaaccgacatgSggS^^ 
tatggc»agcgtgacttcttgctgagtgacaaagcctctagcctcctctgcttccagS^ 



n»»^™^^*^'^'^^^^'^^^^^^'^^<^T^CC^<^CGGCCGCTCACAATGCCTCGGTGGACATGTG^^ 

aamgggacctccagctgctcagccagttcctgaagcatccccagaaggcctcaaggLgccct^^^ 

r^rrf^^^^«^f''^'=''^''°^^^°^*^^^<^^CTGACCTCTGTGAGATTCATGGG^ 



ttgagcagcccggggcattggagcagtgctgggtgtgagaccgtcaggagagaaacSSaStcct^^ 

^^^^^I^gacctactttgcagtgctgatggtctcctcggtggaggtSSScgS 

ctcctctcctacgtgggctgtgtcgtctctgccctggcctgccttgtcaccattgccgcc?^^^ 

gtgcccctgccgtgcaggaggaaacctcgggactacaccatcaaggtgcacatgSSS 

ctgctggacacgagcttcctgctcagcgagccggtggccctgacaggctctSgSS 

^I^^^^^^^^^^^*^^^^^ctcacctgcctttcctggatgggcctcgagStSaaScS^^ 

oI^^^^^^^'^^'^^^^^t^'^ccctggctacctactcaagctgagcgccmgggctg^^ 

ctggtgacgctggtggccctggtggatgtggacaactatggccccatcatcttggctgtgcmag^ 

ggcgtcatctacccttccatgtgctggatccgggactccctggtcagctStSccSc^^^ 

"^SI^^r^'^'f'^^^^^TGGCCATGCTAGCCACCATGGTGGTG^^^^ 
S^J°Sr''"''^*^^'^^'^^'^^''^''^^^^'=CCTCAGCCTGGTCCTTGGCCTGCCCTGGGOT^ 

JJT^^ttctggcaccttccagcttgtcgtcctctaccttttcagcatcatcacctStSaag^^^ 
ttcatctggtactggtccatgcggctgcaggcccggggtggcccctcccctctgaaSgcaactc 

T^o™^^^^^*^'^'=^^^<^^^^^^^°^CTGTGGCCCCCGAGcEi^GCCCAGCCCCMScS 
JJr^^^^''^'^'=''''^*=^^^^''^^'^^'^<^°GCCGTTGCCATGGTGGACGGACTcS^^ 

S^^^^^'^<^°actactcggctctcactcagctcccacgggactcagaagtgcgccgccatStg^^ 
gtggtttccaggagctgcctggtgtctgctgtaaatgtttgtctactgcacSSScgS?SS^^ 

^S^r^^^^^^^'^^T^C^TG^^GCTGGGCTAGGTCCCTCTGTCaVTCTGScCT^^^ 



r^^»»™?^'^^^^'^'^^^'^^^^^^^CT°<^^^CTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGAAGAT 
^^S^^^^^'^''^^^^^^^<='^T^<=^'"CTTTTGTCTTTTGATATTAAAAAGAAGTA^^ 
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FIGURE 406 

MTPQSLLQTTLFLLSLLFLVC3GAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSEE 
ALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQHQE 
ESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLLSQF 
LKHPQECASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQDLHIH 
SRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGEKVLGI 
VVQNTKVANLTEPWLTFQHQLQPKNVTLQCVFWVEDPTLSSPGHWSSAGCETVRRETQTSCF 
CNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLVTIAAYLCSRVPLPCRRKPRDY 
TIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLEGYNLYRLV 
VEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIYPSMCWIRDS 
LVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLGLPWALIFFSF 
ASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 

Important features: 

Signal peptide: 

amino acids 1-25 

Putative transmenbretne domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Micr^odies C- terminal targeting signal. 

amino acids 691-693 

cJ^- and cGMP-dependent protein kinase phosphorylation sites . 

amino acids 198-201 and 370-373 
N-glycosylatibn sites. 

aitdno acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 : 

6-protein coupled receptors family 2 proteins 

amino acids 475-504 



wo 01/40466 PCT/USOO/32678 



407/550 



FIGURE 407 

TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGAG 

CCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTCGC 

TCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCTGGA 

CTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAAGCCC 

TGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAAAGCTC 

AGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAGCAGGGC 

TCTCAGAAGGCGGTGGTGCCCAGCTGGGATCMGTTGTTGGCCCTGGTCTGTCTGCTCAGCTG 

CCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTGCTACATGA 

CTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGCTTATTTCAC 

AAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACAACGGGATCTT 

CCAGATCAACAGCCGGA6GTGGTGCAGCAACCTCACCCCGAACGTCCCCAACGTGTGCCGGAT 

GTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGCCATGAAGATAAC 

CCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCCAGGGAAAAGACCT 

CACTGAATGGGTGGATGGCTGTGACTTC3EASGATGGACGGAACCATGCACAGCAGGCTGGGAA 

ATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATAAAGGATGGTTGAACG 
TGAAA 
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FIGURE 408 



MLLALVCLLSCLLPSSEAKLYGRCELARVLHDFGLDGYRGYSLADWVCLAYFTSGFNAAALDY 
EADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGYWE 
AWRHHCQGKDLTEWVDGCDF 

In^ortant £eat:ures: 
Signal peptide: 

amino. acids 1-18 

N-myristoylation site. 

amino acids 67-72 

Homolgous region to Alpha- lac talbuiain / lysozyme C proteins. 

amino acids 34-58 (catalytic domain)^ 111-132 and 66-107 



• 
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FIGURE 4QQ 

CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCAGAGAGGAAACGCGGAGCGGAGACAACAGTACCTGACGC 
CTCTTTCAGCCCGGGATCGCCCCAGCAGGGATGGGCGACAAGATCTGGCTGCCCTTCCCCGTGCTCCTTCTGGCC 
GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 

cccgccggccagaaggagtgcttctaccagcccatgcccctgaaggcctcgctggaSSaSaS^ 

gatggagcaggattagatattgatttccatcttgcctctccagaaggo^ccttagtotttgS 

tcagatggagttcacactgtagagactgaagttggtgattacatgttctgctttgacaatS^ 

aaatatattactggcacagatatattggatatgaaactggaagacatcctggaatccatcaacagcatc^ 

tf^I^^^^'^^^^^^'CACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGSACA^ 

^^^^''''''^'''''^'■'''''^^^''^^''^^^^^'^tagtggtcatggt 

»I™^^^^^^'^*^^*^^*^^^^'^^^*^^'™^TAGAACTTRAAACTCCAAACTAG 

^™^^*^^^^^'^*^^^'^^^'^^^^'^^^^tc^<^^ccattaatggtcttctccaaaatatttt 

AAAGTAGGAAACAGGTATAATTTTAATGTGAAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCT^TC^G 

ttgtacttaagtgtgtaacaggaatattttgcagaatataggtttaactgSgSgccSS 

in^^^^^^^^^^^^^^'TG^TGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCT^^ 

agtctgtttttaacaggttctattacccagaacttttttgtaaatgcggcagttac»^SSctgtg^ 
tcagttttaagttataaatcacctgagaattacctaatgatggattgaataaatctttagactSa^^ 
cttttctctatttacatatgcatctctcctataatgtaaatagaataatagctttgaaaSStt^ 
agatttttataaccaaatacatttcagtgtaacatattagcagaaagcattStcSt^^ 

n^^»^«^''''^^''^^*=^^''^^^^^^CAflAGTTACACTTACTAAAATTA^ 

aaatgaagaatatagtttaaaagcttcctcctccatagggacacattttctctaacccttaactaaagtct^^ 

»n»^^''^"''^^^'^^*^T^S'rTTATTTTTAATAGTATCTGTCAAGTTAATASGT^ 

I^^™«?^''*^"^'^^^^^^^^T'^^^°ctgacttggataattcattattaccagcaSS^ 

TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGA^ 

agaatttagaaaaacttgagaaaacctaatccaaaataaaattcacttaagtagaactatS^^ 

™JII^^^^®^'^'^°'^'''^''''^'''^^'^^'''^^^'^^'r"^CTGGTTTGAATTATTTTTGTAAGCAGGTAC^^^ 
/^TGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATMCAACm 
^^If^''''''''''''^'^''^^''^^''''^™^"^^^^^T^^<^TAAGGATTAGAAACATAG^^^^ 
^^^^^'^^'^'''^'^'''^^^^'''^^'^^^^'^CAAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATG^^^ 

tcaagtactagtaatttaacttcatcatgaatgaactataatttttaagttatgcccatttataaSSct^ 
gactacattgtgagttagaaacaaacttaaaatttggggtatagaacccctcaacaStS^^ 

tcttaggcgcctgggagaagagcagcttagatttccctactggcaaggtttttaaaaatgag^^ 
atgatcaattaccttaattggccaagaaaatgcttcaggtgtctaggggtatcctctc 

AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCrTTTGCGCTGTTAAATTTGCAATGASAGcS??^^ 

^^^ES^'^^^t'aataaagcagggtacagatataaactactgcatcttttctat 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATT^ 

TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAGACAGAT^^ 

»^^»J^^^r'"'^^"^^^^''TC^^TTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTT^^^^ 

^$^^SJ;S^^''^^^^'^'^^^'^^^™C^'^'^'''^'^^TACTACAATATTTAACTAAGGCCCA^^ 

CATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCTCTGATATGCATTTGGATGATTAAT 

gttatgctgttctttcatgtgaatgtcaagacatggagggtgtttgtaattttatggtaaStSc^^^ 

cacataatggtgtcttaaaattgacaaaaaatgagcacttacaattgtatgtctcctcaaatgaagattctSt 

gtgaaattttaaaagacattgattccgcatgtaaggatttttcatctgaagtacaataatgcacSSS^ 
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FIGURE 410 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEiEYQ 
VLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFELIL 
DNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHiQILLRi\FEARDRNIQE 
SNFDRVNFWSMVNLVVMVVVSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 



Tyrosine kinase phosphorylation site. 

amino acids 55-62 
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FIGURE 411 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCGCATTCCAGAGTCA 
GTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGCTTGGGGGAAAGATGGTGGGGACCAAG 
GCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACATCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTA 
AGAAGAGTCCAGCCTGGGAAGAAGAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAG 
TGGACAACATGGTTCAACATCGACTACCCAGGCGGGAAGGGCGACTATGAGCGGCTGGAGGCCATTCGCTTCTAC 
TATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGACACCTGCGGGCAGCACT 
GGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCAACAGGGAGCAGCGGCCTGGCCAGAACTGC 
TCTAATTACACCGTACGCTTCCTCTGCCCACCAGGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGG 
TCTCCCTGGAGCAAGTGCTCAGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAG 
ATGGTGTCGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGCACTGCATGGGCCAGGACTGTACAGCCTGTGACCTG 
ACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTTCATGCTTCATGGGGCTGTC 
TCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACCTCCTGACCAAGACGCCGAAGCTGCTGACCCAG 
ACAGACAGTGATGGGAGATTCCGAATCCCTGGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTC 
AAGTTTGCCCCCATTGTACTCACAATGCCCAAGACTAGCCTGAAGGCAGCCACCATCAAGGCAGAGTTTGTGAGG 
GCAGAGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCTCTGTGCTGT 
AAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACACATTGCTGGATCCTTCCCTCTAC 
AAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGCACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGT 
GATGCTGGGGCTGTGAAGTCCAAGGTTGCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTT 
CCTGAGAGCTATCTTATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGA 
CGCTGCCCTGTTAAGACTTGTGCAGGGCAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTGCAGAACTGCTGT 
GGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGCTACCCACCAAGGTGGCCAAGGAGTGC 
AGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGCGGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATG 
CGCTTTGGCCATGTGTACATGGGGAACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTC 
CCCCAGGACACTGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGCTA 
CCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTCGTCGGAAAGAGCCCATCACTTTGGAA 
GCCATGGAGACCAACATCATCCCCCTGGGGGAA9TGGTTGGTGAAGACCCCATGGCTGAACTGGAGATTCCATCC 
AGGAGTTTCTACAGGCAGAATGGGGAGCCCTACATAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGG 
AATATTTCCACAGCCACAGCTGCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGG 
ACGTATGGCATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGAAGGTC 
CACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGGTCACTCAATCCAGACACA 
GGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAGGAGGAACAAAAGAGAAGACAGAACCTTCCTG 
GTGGGCAACCTGGAGATTCGTGAGAGGAGGCTCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAG 
GTGAGGGCCTACCGGAGTGAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTG 
GAGCCTAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCCCCAACGGG 
GCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTATGTCTTGGCAAGCCTGGCTGGG 
GAGGAACTGCAAGCAGTGGAGTCTTCTCCTAAATTCAACCCAAATGCAATTGGCGTCCCTCAGCCCTATCTCAAC 
AAGCTCAACTACCGTCGGACGGACCATGAGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAG 
CCAAGGCCCAACTCAGCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCA 
CCACCCAGTGCAGCCCACTTCCGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAGTCCCCTTCAAC 
GAAGATGACCCTATGAGCTGGACTGT^GACTATCTGGCATGGTGGCCAAAGCCGATGGAATTCAGGGCCTGCTAT 
ATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGTGCGATCCCGCAACATGGGGGGCACTCATCGGCGGACA 
GTGGGGAAGCTGTATGGAATCCGAGATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCTGT 
CTGGAGTTCAAGTGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCAG 
GGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACCACTTGCCACTTGCAGTCAAC 
AACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGGCCACAACTATGGCATCTACACTGTCACT 
GACCAGGACCCTCGCACGGCCAAGGAGATCGCGCTCGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGA 
ATCATGAAGAGCAATGTGGGAGTAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTC 
CAGTACCTCCAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAGGCAG 
CAGCGAGCGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTGGCCTCTCTGAGATTTCCTAGAGTTGCTCAACAG 
CCCCTGATCAACTAAGTTTTGTGGTACTTCACCCTCTTCTGCCCTCATTTCATGTGACAGCCATTGTGAGACTGA 
TGCACAAACTGTCACTTGGTTAATTTAAGCACTTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTA 
CTTACTTTGTCCCATGCTACTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTC 
TTTAAATGAAACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGCCATTAAT 
GCAAATATACTTCCTCTTCTTTTTGCATGGTTTTGCCCACCTCTGCAATAGTGATAATCTGATGCTGAAGATCAA 
ATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGACATAGGCAAGCCTTGATCATAGTTCATACATAT 
AAATGGTGGTGAAATAAAGAAATAAAACACAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTA 
AATTTGGAATTCTAGTGCACATTCTVAAGTTAAGCTATTAAATATAGGGTGATCATAGTTCCTCTACCAAGTCTGG 
AAAGAACATCTCCTGGTATCCACAATTACACCAGGTTGCTAACTGTATTTGTACATTTCCCTTTGCATTCGCTTT 
TGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATACTCTGTATTTCGAAAAAA 
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FIGURE 412 

MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNID 
YPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQWHGSPREGFWCLNREQRP 
GQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRlCLAEMVSLCSEAS 
EEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAiYLLTKTPKL 
LTQTDSDGRFRIPGLCPDGKSILKITKVKFAPIVLTMPKTSLKAATIKAEFVRAETPYMVMNP 
ETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAGEYFCKA 
QSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPVKTCAGQQ 
DNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRVSAADNGEP 
MRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFNKKGSAVFHE 
I KMLRRKE PI TLEAMETN 1 1 PLGE WGEDPMAELE I PS RS FYRQNGE PY I GKVKAS VTFL DPR 
NISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPLNAGKVKVHLDSTQVKMPEHI 
STVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNLDVPESRRCFVKV 
RAYRSERFLPSEQIQGVVISVINLEPRTGFLSNPRAWGRFDSVITGPNGACVPAFCDDQSPDA 
YSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPRVKKTAFQISMAKPR 
PNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFNEDDPMSWTEDYLAWW 
PKPMEFRAC YI KVKI VGPLEVNVRSRNMGGTHRRTVGKLYGI RDVRSTRDRDQPNVSAACLE F 
KCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAVNNDTSEYTMLAPLDPLG 
HNYG I YTVT DQDPRTAKE lALGRC FDGTS DGSSRIMKSNVGVALT FNCVERQVGRQSAFQYLQ 
STPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVAQQPLIN 



• 
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FIGURE 413 



gccacgttgtcttctttccttcaccaccacccaggagctcagagatctaagct^ 

ttttctcccagccccaggacactgactctgta(::ag<^tggggccgtcctcttc 

atcctaatcccccttctccagctgatcaacccggggagtactct^gtgttccttagactccgtt 

atggataagaagatcaaggatgttctcaacagtctagagtacagtccctctcctataagcaag 

aagctctcgtgtgctagtgtcaaaagccaaggcagaccgtcctcctgccctgctgggatgg^ 

gtcactggctgtgcttgtggctatggctgtggttcgtgggatgttcagctggaaacctvcct^ 

cactgccagtgcagtgtggtggactggaccactgcccgctgctgccac^ 

ggaggctgagaactcagttttgtgacct^tgacagtaatgaaacca^gggtccc^ 

CTAACTCAAACGTCCCACTTCATTTGTTCCATTCCTGATTCTTGGGTAA^ 
GTACCTCAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 414 

MGPSSCLLLILIPLLQLINPGSTQCSLDSVMDKKIKDVI^SLEYSPSPISKiaiSCASVKS 
QGRPSSCPAGMAVTGCACGYGCGSWDVQLETTCHCQCSWDWTTARCCHLT 
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FIGURE 41!^ 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGTG 

AGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAGGA 

GTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCACCTTTTGCC 

TGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTGGGGG 

TGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCAGCTTC 

ATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTGCTGCTT 

ATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAAGGATTAA 

AAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTTTGGTTTCA 

TGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATATTATTGTAG 

ATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGflAGTAGTTAGAAACAAAC 

CTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGATGCGGACCACCC 

AGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAATGCCCTAATGTCA 

AACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTGAGAGACGGCATGG 

AGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGGAGATTTACGGTTCC 

GAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATTTGTACACAAATGTGA 

CAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACTCACTTGGATGGTCACA 

AGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCTATGGAAGAAAGGGGAAG 

GGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAATCACTTTTGATGTGGATT 

TTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAACAGCTACTGAAACAAGGGT 

CAGTGCAGAAGGTATACAATGGACTGCAAGGATATJSaGAGTGAATAAAATTGGACTTTGTTT 

AAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTTTTGTGTGTGTTTTTGTTTTTA 

TTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGATCATCATGAAATGAATAAGAGG 

GCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACCAGCAAAAGGTTTACTAATACCTC 

TCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGTTTCAAGAATTAAAGCTGCAAGAGG 

ACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGAGTTGTTAGCAATTTCATTCAAAATG 

CCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTGTTATTTTTA 
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FIGURE 416 

MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQAQEKFQDLGAAYE 
VLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQDRNIPRGSDIIVDLEVTLEEVYAGNF 
VEVVRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQMTQEVVCDECPNVKLVNEERTLEyEIEPGVRDGMEYPFI 
GEGEPHVDGEPGDLRFRIKWKHPIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLW 
KKGEGLPNFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 

Iii^ortant features : 
Signal peptide: 
amino acids 1-22 

Cell attachment sequence. 

amino acids 254-257 

Nt«-dnaJ domain signature • 

amino acids 67-87 

Homologous region to Nt-dnaJ domain proteins. 

amino acids 26-58 

H-glycosylation site. 

amino acids 5-9, 261-265 

Tyrosine kinase phosphorylation site, 
amino acids 253-260 

N-myristoylation site. 

amino acids 18-24, 31-37, 93-99, 215-221 



Amidation site. 

amino acids 164-168 
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FIGURE 417 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGGA 

TGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGCAG 

GaSSACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCTGCT 

GATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCTCTAG 

GCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACGGCCGA 

CTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGACCTTCC 

CAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAGGCTACGA 

CTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAGCGGAGGAG 

CGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCGCGCATTCGA 

CGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGGCCATCTACTG 

CTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTGAGCGGAAGCCT 

GCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCACGTGCACAACGC 

CAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCTCCCGCCACCTCAT 

GAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCCTTCGTGCGCCTGAT 

CTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCGCAAGTTCGCCGTGCC 

CATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGCGCGAGGCCTTCCGCGC 

TGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGACCCGCACACGGAGAAGCT 

GGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCACCCGTGCCAGATCGACTA 

CGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGCAGCTGCTGCAGCTACTCCA 

GGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGGACCGCCAGCAGCTGGGAGGA 

GGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCTGTATAAACTCTACGAGGCCGA 

CTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCCGAGACTSaAAGCTTTCGCGTTG 

CTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCCAGTTTTTTTATGACCTACGATTT 

TGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCCATTGAGTACTGTATCGATATTGTT 
TTTTAAGATTAATATATTTCAGGTATTTAATACGA 
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FIGURE 418 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTAD 
SDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAERRS 
VLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLSGSL 
LHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHUlKVKLKKYTKFLFVRbPFVRLI 
SAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAJIEAFRAGLKVSFANFIQYLLDPHTEKL 
APFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRTASSWEE 
DWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD 

Inqportant: features: 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 

amino acids 329-332 
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FIGURE 419 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAAG 

GCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGACT 

GTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATCGACCTCGCGGGACTGCTGA 

AGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATCATCA 

ACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAAGATCA 

ACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGTCGGGCA 

CGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAATGCCATCG 

TGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGAACGCTTTG 

GGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAATTATCGGCT 

GGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGATCGCAAGACGG 

TTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCTGATTCACTGTG 

AGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCCGGGCCAAGGGGC 

TGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATCACCGTGAGGAGCT 

TGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAATAATGAAAATCCAA 

CACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATGTTAGGAGGATCCCAC 

TGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCACAAGCTCTACCAGGAGA 

AGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGAGACGCCCATGGTGCCCC 

CCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGCTGGTGCTCTACCCTTTCT 

TCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACGCTGGCCAGCTTCATCCTCG 

TCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGTGACGGAAATTGACAAGGGCT 

CTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACTGACTCAGGGAGGTGTCACCAT 

CCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCTCCTTAGTGGGACACGGTGACAA 

AGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGACCTCTCCAGCCAGGGAGTCTGGT 

CTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTTTTTTCCCCATGTGCTTTAGTGGGC 

TTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGCTGTGTGGTGAGTGTGAACTTTGTTC 

TGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAGGGCAGGGCTGGGGACCGAAGGGGACA 

AGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGTAACCCTTGGTTGCCAGAGATAAAGTGA 

AAAGTGCTTTAGGTGAGATGACTAAATTATGCCTCCAAGAAAAAAAAATTAAAGTGCTTTTCT 
GGGTCAAAAAAAAAAAA 
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FIGURE 420 

MDLAGLLKSQFLCHLVFCYVFIASGLIINTIQLFTLLLWPINKQLFRKINCRLSYCISSQLVM 
LLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVLAKKEL 
AYVPIIGWMWYFTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHEISM 
QVARAKGLPRLKHHLLPRTKGFAITVRSLRNWSAVYDCTLNFRNNENPTLLGVLNGKKYHAD 
LYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVNWLFWA 
SLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDSKQEOiND 
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FIGURE 421 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCATC 
GCCAT6GACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGCCC 
TGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTCACC 
ACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGCGGCG 
CTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGGGTGCC 
CTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTGCAGACC 
ACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCTGCGGGAA 
CTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACGTCCGCACT 
GAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCGTGCCCCACG 
TCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGACGTGGGCGGCG 
GCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCCTGGATGAGCAG 
GGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGGGCTGTGCGCCAT 
CTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTTCAGCCACTGGAAC 
CAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGCTGCACACGGGGCTG 
TGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAGAAAAGGCACAACTGC 
JES&CCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTATCCTGGGGGCTGCTCA 
CCTCCCTGGCTCCTGGAGCTGATTGCCAAAGAGTTTTTTTCTTCCTCATCCACCGCTGCTGAG 
TCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCCTGGGCTCTGGGACCTCCA 
TGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACCTCCACTAGCTCCAAAATCC 
CTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAACCAAGGTTAGGTGACTGAGG 
ACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGAAGCTGTTTTTGCAGCCTGAGG 
AAGCATCAATAAATATTTGAGAAATGAAAAAA 
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FIGURE 422 

MDTTRYSKWGGSSEEVPGGPWGRWVHWSRRPLFLALAVLVTTVLWAVILSILLSECASTEEU^L 
LDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALREL 
RERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTWAAA 
QDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFSHWNQ 
GEPNDAWGRENCVMMLHTGLWNDAPCDSEKDGWICEKRHNC 

Important features : 

Type II transm^nbrane domain: 

amino acids 31-54 

^"crlycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern. 

amino acids 102-123 

N-myristoylation sites. 

amino acids 18-23, 133-138 and. 242-247 



C-type lectin domain signature. 

amino acids 264-287 
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FIGURE 42-^ 

GCCCAACATCGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGATCGC 

CTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTCCGCC^^^ 

SSJ?S'^'''^^''''''^^^^^'^^^^^^'r'rATATCTTAGAGAAGAAAT^^^ 

JS??JJ?^^'"^'^'^''^^^^*^^'^'^^'^^T^CGATGTCTGGAATGGCTGGTcS^^ 

ctctggcaagatatggcatcttcacaactatttcacagtgactcttggaattcctgStggtg 

ttcttatgtgtttttcgtcatagccaccttggtttttggcctttttatggSSgg5ot^ 

ggtaatatcagaatgtttctatgtgccacttccaaggcatttatS^^^ 

JSS^IJJ^^^'^^^^^^^^'^'^^^^a^'^atgctgacaagggactgtagatttaatgatg^^^ 

J^^J?S'^'^''''^''^^^^^^^TATGTCAGCTTCCCTTTGGCCT^GTTTGTA^^ 

cttaatttttcctgaatgagcaagcttctcttaaaagatgctctctagtca™^^ 

GCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGTGACAATCAGGATATAGAA^ 

tttccagcatttccaccatgggcattcaggctctccacactcttcactattSctSJg^^^ 

gaggactccaataacagccaggtttacatgaactgtgtttgttcmtctS^ 

agataatcagtaaccataacccctgaagctgtgactgccaaacatctcSatgaa^^ 

SSSSS^^^^^^^^^^^^'^^^^'^^tt^ttttaagttttctaagcaatatttttcaagccagaag 

ISS^Sr^^''''^^^''^^^^^^^°^'^^^'rC'rTGTGAAGAAAAGTTGAATACTGTO 
JSJJSJJIS™^''^''''"^^'^^™^^™TG^CTCCAGTACCTGAAAGTGAAAGATT 



# 
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FIGURE 424 



MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAPW 
CPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRGPG 
IFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAWCSY 
VFFVIATLVFGLFMGLVLVVISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEKDDSN 
EEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTREEVEPE 
EAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important: features : 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids .56-72 

Flavodoxin proteins 

amino acids 173-187 
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FIGURE 425 



GAGGAACCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTGCGCCGCGG 

CTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCA 

GGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCT 

TGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAACTGGAACCAGCAGTGAATCTTAATGTTCAC 

TTAAATCAGAACTTGCATAAGAAAGAGAATCGGAGTCTGGTTAAATAAAGATGACTATATCAGAGACTTGAAM 

GATCATTCTCTGTTTTCTGATAGTGTATATGGCCATTTTAGTGGGCACAGATCAGGATTTTTACAGTTTACTTGG 

AGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCTGATAA 

A7UVCCCGAATAACCCAAATGCACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGA 

TCTACGGAAAAAGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAGCTGGAA 

CTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGGAAAGAAGAGAATTTGATGC 

TGCTGTTAATTCTGGAGAACTGTGGTTTGTAAATTTTTACTCCCCAGGCTGTTCACACTGCCATGAT-TTAGCTCC 

CACATGGAGAGACTTTGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTT7UVCTGTGGTGATGATAGAAT 

GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATA 

TCATGGAGACAGATCAAAGGAGAGTTTAGTGAGTTTTGCAATGCAGCATGTTAGAAGTACAGTGACAGAACTTTG 

GACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCAAA 

AGGAGGAGATTGTTTGACTTCACAGACACGACTCAGGCTTAGTGGCATGTTGTTTCTCAACTCATTGGATGCTAA 

AGAAATATATTTGGTy^GTAATACATAATCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 

GGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACT 

AAAAACTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGTAA 

TCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCAAAGAATATGAAATTCATCATGGAAA 

GAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAA 

TTTTCCTGCCAATGACAAAGAACCATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACT 

ACCAGAGTTACGAAGAGCATCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCATGA 

GGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCAACCAGTCCAACATTCATGAGTA 

TGAAGGACATCACTCTGCTGAACAAATCTTGGAGTTCATAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 

ACCCACCACCTTCAACGAACTAGTTACACAAAGAAAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTG 

GTGTCATCCTTGCCAAGTCTTAATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 

CAGTATAGATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGATTTTT 

TCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTACAATGGTTGGAATAGGGATGCTTATTCCCTGAGAAT 

CTGGGGTCTAGGATTTTTACCTCAAGTATCCACAGATCTAACACCTCAGACTTTCAGTGAAAAAGTTCTACAAGG 

GAAAAATCATTGGGTGATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCT 

CTTGGCTAGGATGATTAAAGGAAAAGTGAAAGCTGGAAAAGTAGACTGTCAGGCTTATGCTCAGACATGCCAGAA 

AGCTGGGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAAGAAGAGCA 

GATAAATACCAGAGATGCAAAAGCAATCGCTGCCTTAATAAGTGAAAAATTGGATVACTCTCCGAAATCAAGGCAA 

GAGGAAT7\AGGATGAACTTTCATAATGTTGAAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAG 

AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCA 

GAATTATCTACAGCACTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTA 

GACTTTGCAGGCTATAATATATGGTTCACACATGAGAACAAGAATAGAGTCATCATGTATTCTTTGTTATTTGCT 

TTTAACAACCTTTAAAAAATATTAAAACGATTCTTAGCTCAGAGCCATACAAAAGTAGGCTGGATTCAGTCCATG 

GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 

ATCTACATAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACT 

TAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCAT 

GATGTGGCACAGTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGATTCTTCATTGTTTTCTTTCCTTCTCA 

AAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAGCAGTATATGTGCACACAGTAAGTACAC 

AAATTTGAGCAACAGTAAGTGCACAAATTCTGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTA 

TAGCAATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 

TGTGTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 

TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGG 

TTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTAT 

AATGATACTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATA 

ACTGAAGTTATTTTTATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 

CTCAAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATT 

CCAATCAGTCAAAAGAGGTCAATGAATTAAAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAA 
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FIGURE 426 

MGVWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKLH 
PDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGIYD 
DDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNCGDD 
RMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNSIQTA 
FAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSANTLEDR 
LAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQPSLAVFK 
GQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWCPPCRALL 
PELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHSAEQILEFI 
EDLMNPSWSLTPTTFNELVTQRKHNEVWMVDFYSPWCHPCQVLMPEWKRMARTLTGLINVGS 
IDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLGFLPQVSTDLT 
PQTFSEKVLQGKNHWVIDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDCQAYAQTCQKAG 
IRAYPTVKFYFYERAKRNFQEEQINTRDAKAIAALISEKLETLRNQGKRNKDEL 

In^ortant features: 

Endoplasmic reticulum targeting sequence, 
amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 



N-glycosylation site. 

amino acids 484-487 
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FIGURE 427 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGCA 

CGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTTGA 

GTCCTTCTGAGAT6ATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAG 

CGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATT 

CCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCAG 

TCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACC 

AGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCC 

GCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGC 

GTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAA 

ATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCT 

TGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAG 

GTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGT 

CCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCT 

CTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGA 

AAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTAaACCAGCT 

ATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATC 

AACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTTCCA 

AAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTGTGATTGCAGTAAATTACT 

GTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAACTTTTAATTATTTTTCT 

AAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATTGATTGTTAT 

CTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATTTCAGCTTATAGTTCTTAAAAG 

CATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAAT 

GATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATTTTCTGAAATGTACTATCTTAATG 

CTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGAAftTAAAATTTAACATTTAAAAAAA 
AAAAAA 
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FIGURE 428 



MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSAA 

PGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRHAM 

CCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEGSVC 

LRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQKDHH 
QASNSSRLHTCQRH 

Istportant features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Pungal Zn(2)-Cys(6) binuclear cluster dc»aain 

amino acids 110-126 
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FIGURE 429 

GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCC 

TTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGACCC 

CAGCGTTACCATSCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCT 

CCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGAATAT 

AGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGTCGTTT 

CAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATTTCCAAA 

TGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATA 

CAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATA 

CAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGCAACAAAAAAGTGACCCCAT 

TCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAAATATCATTGG 

ATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCGAATATTTTGCA 

TGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAGATATAGTGGCGA 

CAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACTTGGGAGCTATGAC 

AAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTTGTCCGAGAAATAAC 

ATTTGAAAATGGAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCATACTCTTTCACATGAA 

AGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGCAATTAATAAGTGAAAA 

AGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACATCCTCTTCTGCACATACA 

GAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAGGCATATGTATGTGTTTGG 

AGACTTCAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCGTATTTGACTTACATTCTGG 

AAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGATACAGCCCCAGGAGAGCAAGC 

CCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAAACTAGCACCCAGTGAATATAG 

GTATACTCTATTGAGGGATCGAGATGAGCTTTAAAAACTTGAAAAACAGTTTGTAAGCCTTTC 

AACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTATATTTTCATAATTCTATGTGTAT 

TTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 430 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQM 
LHPIFEEASDVIKEEFPNENQWFARVDCDQHSDIAQRYRISKYPTLKLFRNGMMMKREYRGQ 
RSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFERVANILHDDC 
AFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVREITFEN 
GEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLHIQKTP 
ADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPGEQAQDV 
ASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulum targeting sequence, 
amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 



Thioredoxin family proteins 

amino acids 50-66 
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FIGURE A^l 

GAGCAGGACGGAGCCaSSGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGCA 

GGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCGTG 

CAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGCGTG 

GACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGCAGTG 

CGGGGTTGCGGTTCGGGACTCCCCGGGAAGAATGACCGCGGCCTGGATCTTCACGGGCTTCTG 

GCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTCACCTCG 

CGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTGCTACAGC 

TGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCTGCTACAAC 

GCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCAGCTAATGTG 

ACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGATGGAGTAACA 

GGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACTCTGACCTCCGC 

AACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCTCCAGAGCCCACG 

ACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAGACCCACATCCACC 

ACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAGAACACGAGGCCTCC 

CGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGACCGCAGCAATTCAGGG 

CAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTGTGTGGCTCCCACAGCT 

GGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGTS&GCTTCTCCACCTGGA 

AATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCTCATCACTTCCTGTTCCCA 

CCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCCCAGTATCCCCAGCTTCTGC 

TGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATATATTCTGCCAGGGGTGTTCTA 

GCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTCTCCGCTTGTCCTCTTGTGATG 

TTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTGAGAGAGAGGATGCTAAGCTTCC 

TACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGGGGTGGGTGGGACAATGGCTCCCC 

ACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGGGAATCGGTTCCCCATATGTCTTCC 

TTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTACCCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 432 

MDPARKAGAQAMIWTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMKTVKCAPGVDVCTE 
AVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSE^LDP 
AGNESAYPPNGVECYSCVGLSREACQGTSPPVVSCYNASDHVYKGCFDGNVTLTAANVTVSLP 
VRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTTVAST 
TSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSGQYPAK 
GGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 433 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGTT 

GGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAGGA 

GAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTTCGC 

TGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTTTCAA 

TCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCCAATGC 

AACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAACCTGAG 

GCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCT2MGAAAACCGGCCACTTCAGCAACA 

GCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTAACACCATT 

CCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCTGCCCACCTC 

CCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTGGTCTTTGTGG 

CTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTTCCCAGGCAGGG 

GCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCACCTTCTGCTCCTA 

GGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGCGTAGGGTAAGAGC 

ACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCACTTT6TCATCATTCT 

TCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCCACCTGATCCCAGTCT 

GAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCCCAGCGTTGACGTCAGG 

CAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACGAGGAGTCCCCATCTGCC 

CCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCTACTCTGCCCCTGGGGAAT 

GTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGGGACCCTACCCCTTCCAACC 

TTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAGATGCAGACTACAGTCCCTGC 

AATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCCGTTGGGGCCAGCACACCGGGA 

TGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACGTCCCCTTAGATGGGCAGCAGAG 

GCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGAGCGGTGAGCGAGGTGGGTTGGAG 

ACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAGGTTGAAGGTCATAACGAGAGTGGG 

AACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTTCCCGCGGAAACCAACCAAACCGTGC 

GCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTATCCTCAACAACAACAGAAAAAAGGAAT 
AAAATATCCTTTGTTTCCT 
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FIGURE 434 



MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAWLFSVGILLILSRRCKC 
SFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 435 

GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTGC 

TGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGTCA 

TCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGACTT 

TTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAACTAA 

ATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACTTACAG 

AGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGCAGGCAA 

GGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGTTTCGATG 

GGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCCTGGAGCCA 

GAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATTACTTCTC7y\ 

TGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACCCTGGAGCCAA 

GTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCACAGCCACCACCC 

TCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCATCTGAGGAGAGT 

CCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACGGTCTTGATCAAAC 

TCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGTGGCCTCCAGCAGAT 

CATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTTTTGCCAACAATTTTA 

CCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACCTGATGGAATTCCTGCA 

CTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTCTCTTTTTGTTTGGAAAA 

TCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATTGTCAGTAAAATAATCACG 

TTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAGAATTTTTAAATTATTTAAT 

AAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTATTGTTCTGTACTGATATTTAA 
ATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 436 

MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFLH 
Y DCGNKTVTPVSPLGKKLNVTTAWKAQN PVLRE WDILTEQLRD IQLEN YT PKEPLTLQARMS 
CEQECAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDKVVAMSFHYFSMGD 
CIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 
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FIGURE 437 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACC^AAGCTCTTGTGGCAGG 

TAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTACCTCACGGCGCAAGTGTGGATTCTGT 

GTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCCAGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCA 

GCAAGGTGGTGTGCACGCGCCGGGGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACC 

TCATGGAGAACAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGTTGG 

GCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAACACCCTGGAGCTGTTCG 

ACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTCCAAGCTGCGGGAGCTCTGGCTTCGCAACA 

ACCCCATCGAAAGCATCCCCTCTTACGCCTTCAACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCA 

AGAAGCTGGAGTATATCTCTGAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCA 

ACATTAAAGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACTTCCCTG 

AGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATGAACTCACAGGTCAGCCTGA 

TTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACTCAACTTGGCCCACAATAACCTCTCTTCTTTGC 

CCCATGACCTCTTTACCCCGCTGAGGTACCTGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTG 

ACATTCTGTGGCTAGCCTGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTC 

CCATGCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCTTCATCATGG 

ACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGTCGGACTCCCCCTATGTCCTCCG 

TGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGCCTCCCGCCACCCAAGGATCTCTGTCCTCAACGACG 

GCACCTTGAACTTTTCCCACGTGCTGCTTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCA 

ACTCCAACGCCTCGGCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAG 

TAACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTACCACGTCCACTG 

GTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACCCGTGTGCCCAAGCAGGTGGCAGTAC 

CCGCGACAGACACCACTGACAAGATGCAGACCAGCCTGGATGAAGTCATGAAGACCACCAAGATCATCATTGGCT 

GCTTTGTGGCAGTGACTCTGCTAGCTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGC 

GGAGTACAGTCACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGCAG 

CAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTCATGACCATATTAACT 

ACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGCCTGGGGAACTCTCTGCACCCCACAGTCA 

CCACTATCTCTGAACCTTATATAATTCAGACCCATACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCC 

CCCCCAAAAAACTTATAAAATGCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTT 

TTCTTGTATATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGACAAAA 
AGTCAAAACA 
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FIGURE 438 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKVVCTR 
RGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNSIRQIEVGAFNGLASLN 
TLELFDNWLTVIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRLDLGELKKLEYISE 
GAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLWVMNS 
QVSLIERNAFDGLASLVELNLAHNNLSSLPHDLFTPLRYLVELHLHHNPWNCDCDILWLAWWL 
REYIPTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAELKCRTPP 
MSS VKWLLPNGTVLSHAS RH PRI S VLN DGTLN FS HVLLS DTG VYTCMVTN VAGNSN ASAYLN V 
STAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQTTRVPKQV 
AVPATDTTDKMQTSLDEVMKTTKIIIGCFVAVTLLAAAMLIVFYKLRKRHQQRSTVTAARTVE 

IIQVDEDIPAATSAAATAAPSGVSGEGAVVLPTIHDHINYNTYKPAHGAHWTENSLGNSLHPT 
VTTISEPYIIQTHTKDKVQETQI 
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FIGURE 439 



GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCTG 
TCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCTGC 
CCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTCCAA 
GTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTGCACC 
GATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGG/^TAgTGAAAAAAT 
GTGGTGTGTGACATGTATU^TGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACCCTGATC 
TTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAAT/U^TCACTTGCCCTGC 
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FIGURE 440 



MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKLEVK 
HCTDQI S FKKRLSLEOCSWWK 
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FIGURE 441 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGATG 

GGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCACC 

ACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCCAT6CCCTCCCCAGG 

GACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCTCCAG 

CTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCAGCCAA 

GCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGAAGGGGC 

AGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGTCAGGGGT 

TCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGGGAAGAGGC 

CAAAGAGGCCCCAGCCGACAAGSSATCGCCCACAAGCCTTACTCACCTCTCTCTAAGTTTAGA 

AGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTACAAGCTCAGG 
AGGCGAATAAATGTTCAAACTGTA 
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FIGURE 442 

MPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDGG 
QAEGAEDELEVRFNAPFDVGIKLSGVQYQQHSQALGKFLQDILWEEAKEAPADKO 
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FIGURE 443 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTCG 

TGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATaiSAACACGTGGCTGCTGTTC 

CTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTGCTC 

CTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCTGAGT 

ATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATCTTACA 

CCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTTGTCCCC 

CGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAAGGGCAAA 

TCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAACTCACGCCG 

AGACCTGCAGGAGTGGTGCCAGGTGCTT^GTAACAAGTTTAAAATGTTCAGAGACAATGGA 

ATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTTCCAAAAACAC 

AAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCAAACAACTGAAG 

CGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTCAGTCATGTTGCT 

GAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGTAACAACAACCTCC 

CTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCGTCTTCCATTCTCTG 

GGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGACACGGGCAGCAGAGTG 

TGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCAACCCCTGGTCAGGGCA 

GAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGAGCATCCCCTGCCTGCAG 

TTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGAGCCTCCTTGTTCATAACC 

ACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGATGTTTTAAAACACACACCTC 

TAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGAATTTTTTTTAATGAAAGTGC 

AATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAA 
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FIGURE 444 

MNTWLLFLPLFPVQVQTLIVVIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGWV 
VRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQAQ 
QEAELTPRPAGWPGA 
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FIGURE 445 



AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATCGACTGGCCTCACAACCTGC 
TGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAAGA 
GGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACCTGG 
TGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATGGTGG 
CCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCTGTGGA 
TGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCCGTATCC 
CCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACCATGCAGG 
AGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCGCCTCTGCC 
CGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCGCTGTGGGCT 
GCACCTGCATCTTCTGAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGACCATCCTCCT 
TGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAAGCAAG 



wo 01/40466 PCT/USOO/32678 

446/550 

FIGURE 446 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEYE 
RNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYSINHDPSRIPVDLPEARCLCLGC 
VNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Import^ant features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 



Homologous region to IL-17 

amino acids 96-180. 
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FIGURE 447 



GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACC ATG GCCAAGA 
TGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCTAT 
CACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGGTGC 
CCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTGGATG 
GAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGACCGGT 
TCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAGAACCAG 
GGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAAGGACTAC 
TGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAAGCGGTTGA 
TGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTATGGTTATCC 
CTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCTACTAACAGAC 
TTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTGCTGTTTCCTCT 
GTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTCTTCCAAGCGACT 
GTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTGGGCCTTCTACATG 
GCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCAACACGTACACCAGG 
ATGGTGCTGGAGTTCAAGTGCAAGCATaSTAAGAGCTTCAAGGAAAACCCGAACTGCCTACCA 
CATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCACCGTGGGTCCTTTGACC 
AGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGGGAGTCGACTTCTACTCC 
GAGCTGCGGT^ACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTGAAAGAAGCAGTTAGGTCA 
TCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGAGTAGGCTTGAGCCCTACCT 
TACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCGTCTCTTGAGCATGGTTTTTA 
GAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTCCTAAGGGATTCCTGGGTGCCA 
CTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCACCCCACATCTCACACATCCAG7UV 
TTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGCTAAACCATGGAGATAAAAAGAAG 
AGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 
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FIGURE 448 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMPV 
SLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKRGE 
KGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQLPPA 
TNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLELWLG 
LLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 449 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCC 

ACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGAGCCCGGGCGCCTCTCGCCGCTCCCCG 

CGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCGCCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCT 

GCAGCTGCCCGGGGCGCGAATGGCAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAAT 

GACGGACATTAACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAACGT 

CACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGCACGTAGTTGGGAAACT 

TGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCAGCCCCGAGCCGAGCAGCAAAGTGAGACAT 

TGTGCGCCTGCCAGATCCGCCGGCCGCGGACCGGGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCA 

TATAATTAGCCTGCACACAAAGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCG 

CTTCCAATGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCTGTCTAT 

ACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTTCAGATGCTGCCCGCCGCCC 

CCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCTGTACTGCGAGGCGCTCAACCTCACCGAGGCGC 

CCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCA 

CGGGGTTAATGCAGCTCACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGA 

AACTGCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTTCCGGCCCA 

TGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCGCCCGACCTCTTCCACGGGCTGC 

GGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCA 

GCCTCAAGTTTCTCGACATCGGATACAATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGC 

TCACCGAGCTGCACCTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGC 

ACTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGAACCTGGAGAAAA 

TGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAGACCGTGCCGCACCTGCAGTCCCTGC 

AGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCGGATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCC 

TGGCCGGGAACCTGTGGGATTGCGGGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCT 

ACGATGGCAACTTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTTCC 

ACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACCGCAGTGATCTGGGGC 

CCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAGCACGACGGCACATTCGAGCCTGCCACCG 

TGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGCCGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCT.CA 

TCTTCTCCTTCCTCATCGTGGTCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCA 

GACAGTGCTTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTCTGCCC 

AGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCAACGAGTATGGCTCGTGTA 

CCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCCCAGTGGCTCTCAACCCATGCGCTACCAAATA 

CGCCTGGGCAGCCGGGACGGGCCGGCGGGCACCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGAC 

TGAAACTTTAAGGGGATCTCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAA 

AACACAACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCCTTCCTCCC 

TTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAAGACAGAGAGAGAATAATCGTG 

CTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAACATCATGTATGTAGAAGATCTTAAGTCCATACGC 

ATTTCATGAAGAACCATTGGAAAGAGGAATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTA 

TGTTCTTACTTTGTGTGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACA 

AACGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGGAGGTGGGGGGA 

AACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACTTGAAATCTTTTCTTTCTCAAATTA 

ATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTCTAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATT 

TTTAAAGGTGTAGCACCTTTTTTTTTATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAA 

GAACTTAAGGCCACAATTCAGGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCAG 

CTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTTTTCACATGACAAAT 

GACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCAGAGGAAGCAGGTTAAATGAGACCTGTC 

CTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGATGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACC 

TACCCACAATGCAGCCTATACTCCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAA 

ATAGTTTTCTCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCTCTGC 

CTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATATGTAAGTAATCAGAGGGG 

CAZ^TGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACACACAGATCTCTGGTAGGATTAGGGGCCACTT 

GTGTTTCCGGCTTATTTTAGTCGACTTGTCAGCAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAG 

GGCATTGATGGATCACATGAGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAA 

CCAGAAATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAATTATCTCC 

TCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAATAAAAAGGAATTAGCTAGAA 

TATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGGTTTAGAATGTCA 
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FIGURE 450 

MDFLLLGLCLYWLLRRPSGVVLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPHN 
LSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQIT 
QLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRSLKF 
LDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVAIVVSSL 
DWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSITLAGNL 
WDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSGHLLSAV 
TNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKVVTGTMALIFSFLI 
VVLVLY VS WKC FPASLRQLRQC FVTQRRKQKQKQTMHQMAAMSAQE Y YVDYKPNH I EGALVI I. 
NEYGSCTCHQQPARECEV 



• 
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FIGURE 451 



TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTAC 

CCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACGGA 

CTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCAACT 

ACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAA ATGA CTTCG 

AAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTCTCCAA 

CTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTATATAAA 

GTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGTTACTAAT 

GTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTGCAGAGAAT 

CATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCCTTGGATCAC 

ATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCACAAACCAGAGG 

GCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATACATAAGCGCTTT 

CCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTTGCCAAAATTGTT 

GAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGAAGACCCTGATGAC 

ATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTTCAGATATTGACAAG 

AAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAACACTCTGAACCTAATC 

ATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAATAGAACTTGACCAGTAC 

CTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAGCCATCTTGCCAAAAGAA 

GGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCTAATCTTACTGTTTACAAA 

AAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCGAATTCAACCAATCATAGCA 

GTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATGACTTTCTGTTAGGCAACCAC 

GGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTAGCCCATGGTCCTGCCTTCAGA 

AAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTACCCACTACTATGCCACCTCCTC 

AATATCACTGCCATGCCACACAATGGATCATTCTGGAATGTCCAGGATCTGCTCAATTCAGCA 

ATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTCCTCCCTGGTAGTGTTAAACCAGCA 

GAATATGACCAAGAGGGGTCATACGCTTATTTCATAGGGGTCTCTCTTGGCAGCATTATAGTG 

ATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTCACAGTCAAATACCTGCCTTACAAGAT 

ATGCATGCTGAAATAGCTCAACCATTATTACAAGCCTAATGTTACTTTGAAGTGGATTTGCAT 

ATTGAAGTGGAGATTCCATAATTATGTCAGTGTTTAAAGGTTTCAAATTCTGGGAAACCAGTT 

CCAAACATCTGCAGAAACCATTAAGCAGTTACATATTTAGGTATACACACACACACACACACA 

CACATACACACACACGGACCAAAATACTTACACCTGCAAAGGAATAAAGATGTGAGAGTATGT 

CTCCATTGTTCACTGTAGCATAGGGATAGATAAGATCCTGCTTTATTTGGACTTGGCGCAGAT 

AATGTATATATTTAGCAACTTTGCACTATGTAAAGTACCTTATATATTGCACTTTAAATTTCT 

CTCCTGATGGGTACTTTAATTTGAAATGCACTTTATGGACAGTTATGTCTTATAACTTGATTG 

AAAATGACAACTTTTTGCACCCATGTCACAGAATACTTGTTACGCATTGTTCAAACTGAAGGA 

AATTTCTAATAATCCCGAATAATGAACATAGAAATCTATCTCCATAAATTGAGAGAAGAAGAA 

GGTGATAAGTGTTGAAAATTAAATGTGATAACCTTTGAACCTTGAATTTTGGAGATGTATTCC 

CAACAGCAGAATGCAACTGTGGGCATTTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCA 

TTTATTTTTCCCTCAAT^GAGAGTCAAATACTGACAGATTCGTTCTAAATATATTGTTTCTGT 

CATAAAATTATTGTGATTTCCTGATGAGTCATATTACTGTGATTTTCATAATAATGAAGACAC 

CATGAATATACTTTTCTTCTATATAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCAT 

CTCAGCAATGTTTTCTCTTGTTTGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATT 

ACATTAAAAATCAAATTGGATAAAAAAAAAAAAAAAAAAA 
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FIGURE 452 



MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVKQ 
VTNVFITKTYPNHYTLVTGLFAENHGIVANDMFDPIRNKSFSLDHMNIYDSKFWEEATPIWIT 
NQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLYWED 
PDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEERLIEL 
DQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYNSRIQP 
IIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPAFRKNFSKEAMNSTDLYPLLC 
HLLNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYFIGVSLGS 
IIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 

Intportant fea'bures: 
Signal Peptide: 

amino acids 1-22 

Transmembrane Domain: 

amino acids 429-452 

N-glycosylation sites : 

amino acids 101-104, 158-161, 292-295, 329-332, 362-365, 369-372, 
382-385, 389-392 

Somatomedin B Domain: 

amino acids 69-85 



Sul£atase protein Region: 

amino acids 212-241 
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FIGURE 453 



GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGGCGGAGGCCGAAGGTCCGTGAC 

TMGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTGCT 

GATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGACCAA 

CCCAGAG/VAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGCTGGA 

TGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGGCAGGC 

TGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAAACTCCA 

ATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCAACACCTA 

CACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATGGAGAGTTC 

AAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGAGGAACTGAA 

GAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGGTACGGCTGAT 

CAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTCTTTGATCTTGA 

ATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGGTCTTCAAGTGGT 

GATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGTTTGTGCTGGGCGC 

TGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGAGCCCTGCAGAAGCT 

GCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGTCCTGTTTGCACTGTG. 

CTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGCTCGGGGGGCTGCAGGT 

CCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTGCGCGTGGTCACACTGCT 

CTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGAGCTGACCCAGGAGATGTC 

CCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAGGCCTGTGGGAACAGGGCTG 

GTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGATGCCCGTGAGAAGGTGCTGCA 

GACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCGTCAGGACCCCCAGCTCGGCAG 

GACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCAGCCTGGAGCTGCAGGATGGTGA 

GGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAACAGCTTGCTGAAGGAGCTGAGATG 

AGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGAGGCTGAGGGGTGCCAGCGTGGGTG 

GGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCTTGGCCATTAAATGGAAACCTGAAGG 

CCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA/^AAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 454 



MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEELD 
AEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTNTY 
TSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNVVIETDMQIMVRLI 
NKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQVVINGLNSTEPLVKEYAAFVLGA 
AFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKLGGLQV 
LRTLVQEKGTEVLAVRVVTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPGLWEQGW 
CEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLASLELQDGE 
DEGYFQELLGSVNSLLKELR 

Importani: features: 
Signal peptide: 

amino acids 1-29 

Hypothetical YJL126w/YLR351c/yhcX family protein. 

amino acids 364-373 

N-glycosylation site. 

amino acids 193-197, 236-240 

N-myristoylation site. 

amino acids 15-21, 19-25, 234-240, 251-257, 402-408, 451-457 

Homologous region SLSl protein. 

amino acids 68-340 
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FIGURE 455 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAGT 

GGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCCTA 

CCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGGTGT 

CTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCTAGAC 

TCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGTCCTTG 

TGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCASraGCTACAGCAAGACCCCCCTGGA 

TGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTTCTCGCCA 

ACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCAGGACCTGG 

GAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCG 

ACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAACCAGCTCTACT 

GCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAGGAAGAAAGTTT 

TCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGCAGCAGATGTTCC 

AGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCTAACGACCTCATGC 

TCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGAT6TCAGACCCATCAACGTCTCCTCTC 

ATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAACCAAGAGCCCCCAAG 

TGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGTCAGAAAAGGTGCGAGG 

ATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGACAAAGCAGGTAGAGACT 

CCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCCTGCAGGGACTCGTGTCCT 

GGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTACACGAACCTCTGCAAGTTCA 

CCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCATCCCAGGACTCAGCACACCGG 

CATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAGACCCTCATTCCTTCCCAGAGA 

TGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCTCCTGGACTCAGGGTCTGCTTCC 

CCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGGGAACAATTTCCAAAACTGTCCAG 

GGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCATCCTCAAGCTCAGGGCCCATCCCTT 

CTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAACTGAGAAGTGGAAAAAAAAA 
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FIGURE 456 

MATARP PWMWVLCAL I T ALLLG VTEH VLANN DVSC DH PSNT VPSGSNQDLGAGAGEDARS DDS 
SSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPV 
YESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSG 
WGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPVVCN 
GSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 457 



GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCTTGCTTCCTGAACT 
AGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCTCACCGCTGTAGGAATCCAGATSCAGG 
CCAAGTACAGCAGCACGAGGGACATGCTGGATGATGATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTG 
CCACAACTCGGCATCCAGAGCCCCGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGA 
CCCTGCTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACCAGC 
TCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCCCAAGAGTTGCAATCTC 
TTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGCTGAAAAACTCTGTCGTGAGCTGTATAACA 
AAGCTGGAGCACACAGGTGCAGCCCTTGTACAGAACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATA 
AAGACAGCAAAAGTTGGGAGGACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAAC 
AAGAAGACCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTTTGCGCC 
CTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTGTTCCATATTATAATAGATG 
TCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGGGATGATCTTCTCAAAGGACTGCAAAGAATTGA 
AGCGTTGTGTCTGTGAGAGAAGGGCAGGAATGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCG 
AAGGTGACTGATTCGCCCTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAG 
ACATTGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTGTTTCCTGTT 
CAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACAAGAAGTCTTATTTACATGCCAC 
CAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTGGCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAA 
TGTCTAATATCACCTCCCTGTTTTCATGTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAA 
TACATtGAGGTAACATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTAC 
CAGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCATCAGTAAAGACCC 
CATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAGAATCTCAAATCTCAATGCCTTATAA 
GCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTGTCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATAT 
ATCCCCATCTCCGTTTCATATCAGAACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGT 
GAGCCTCTTCATCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAGGT 
ACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGURE 458 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLLI 
GLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRELY 
NKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAASQSY 
SEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKDCKEL 
KRCVCERRAGMVKPESLHVPPETLGEGD 



• 
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FIGURE 459 

GTTGATGGCAAACTTCCTCAAAGGAGGGGCAGAGCCTGCGCAGGGCAGGAGCAGCTGGCCCAC 

TGGCGGCCCGCAACACTCCGTCTCACCCTCTGGGCCCACTGCATCTAGAGGAGGGCCGTCTGT 

GAGGCCACTACCCCTCCAGCAACTGGGAGGTGGGACTGTCAGAAGCTGGCCCAGGGTGGTGGT 

CAGCTGGGTCAGGGACCTACGGCACCTGCTGGACCACCTCGCCTTCTCCATCGAAGCAGGGAA 

GTGGGAGCCTCGAGCCCTCGGGTGGAAGCTGACCCCAAGCCACCCTTCACCTGGACAGGAT6A 

GAGTGTCAGGTGTGCTTCGCCTCCTGGCCCTCATCTTTGCCATAGTCACGACATGGATGTTTA 

TTCGAAGCTACATGAGCTTCAGCATGAAAACCATCCGTCTGCCACGCTGGCTGGCAGCCTCGC 

CCACCAAGGAGATCCAGGTTAAAAAGTACAAGTGTGGCCTCATCAAGCCCTGCCCAGCCAACT 

ACTTTGCGTTTAAAATCTGCAGTGGGGCCGCCAACGTCGTGGGCCCTACTATGTGCTTTGAAG 

AGCGCATGATCATGAGTCCTGTGAAAAACAATGTGGGCAGAGGCCTAAACATCGCCCTGGTGA 

ATGGAACCACGGGAGCTGTGCTGGGACAGAAGGCATTTGACATGTACTCTGGAGATGTTATGC 

ACCTAGTGAAATTCCTTAAAGAAATTCCGGGGGGTGCACTGGTGCTGGTGGCCTCCTACGACG 

ATCCAGGGACCAAAATGAACGATGAAAGCAGGAAACTCTTCTCTGACTTGGGGAGTTCCTACG 

CAAAACAACTGGGCTTCCGGGACAGCTGGGTCTTCATAGGAGCCAAAGACCTCAGGGGTAAAA 

GCCCCTTTGAGCAGTTCTTAAAGAACAGCCCAGACACAAACAAATACGAGGGATGGCCAGAGC 

TGCTGGAGATGGAGGGCTGCATGCCCCCGAAGCCATTT3fA6GGTGGCTGTGGCTCTTCCTCAG 

CCAGGGGCCTGAAGAAGCTCCTGCCTGACTTAGGAGTCAGAGCCCGGCAGGGGCTGAGGAGGA 

GGAGCAGGGGGTGCTGCGTGGAAGGTGCTGCAGeTCCTTGCACGCTGTGTCGCGCCTCTCCTC 

CTCGGAAACAGAACCCTCCCACAGCACATCCTACCCGGAAGACCAGCCTCAGAGGGTCCTTCT 

GGAACCAGCTGTCTGTGGAGAGAATGGGGTGCTTTCGTCAGGGACTGCTGACGGCTGGTCCTG 

AGGAAGGACAAACTGCCCAGACTTGAGCCCAATTAAATTTTATTTTTGCTGGTTTTGAAAAAA 
AAAAAAAAAAAAAA 
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FIGURE 460 

MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCPA 
NYFAFKICSGAANWGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDV 
MHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRG 
KSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF 

Important features: 
Signal peptide: 

amino acids 1-15 

ATP/GTP-binding site motif A (P-loop) . ' 

amino acids 184-191 



N-glycosylation site. 

amino acids 107-110 
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FIGURE 461 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGGA 

AACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCCTC 

TAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGGCGT 

CCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTATTAA 

GGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCTCTACA 

GTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTATGCAGA 

GCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCACCGGAGA 

CAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCACACAGCACA 

GAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTCACCCAGAGG 

GCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCCAGGACACAAA 

GACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACGGTGTCAGAGAA 

GCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCAGCACAGAATGCT 

GGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCACAGCAGTCATCCC 

ACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAGAGCCCCACGACGCA 

GAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTGGGATTTTGAGGAAAA 

ATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTGTGAAGATCAAAGCCTC 

CAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTCTTCCTGGACTCCAGACA 

CTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACCCTTTGGCTTCATGGAGCT 

CAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAGTGCCCCAGCAGCAGCTGCT" 

CCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGTGCCGTGGTGGGCAACGGGGG 

CATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCACGACTACGTGTTCCGATTGAG 

CGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTCGGACATCCTTCTACGGCTTTAC 

CGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAATCGGGGTTTCAAGAACGTGCCTCT 

TGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCACCCGGGACTATGAGTGGCTGGAAGC 

ACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTTTCTGGTTCAGGCACAGACCCCAGGA 

AGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTGTTGCTGCACCCAGACTTTCTCCGATA 

CATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGATGGTGCCCACTGGAGGATATACCGCCC 

CACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTCAGCTCTGTGACCAGGTGAGTGCTTATGG 

CTTCATCACTGAGGGCCATGAGCGCTTTTCTGATCACTACTATGATACATCATGGAAGCGGCT 

GATCTTTTACATAAACCATGACTTCAAGCTGGAGAGAGAAGTCTGGAAGCGGCTACACGATGA 

AGGGATAATCCGGCTGTACCAGCGTCCTGGTCCCGGAACTGCCAAAGCCAAGAACTGACCGGG 

GCCAGGGCTGCCATGGTCTCCTTGCCTGCTCCAAGGCACAGGATACAGTGGGAATCTTGAGAC 

TCTTTGGCCATTTCCCATGGCTCAGACTAAGCTCCAAGCCCTTCAGGAGTTCCAAGGGAACAC 

TTGAACCATGGACAAGACTCTCTCAAGATGGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCA 

GTACATTGCTGTAGGTCCTGAGGCCAGGGATTTTTAATTAAATGGGGTGATGGGTGGCCAATA 

CCACAATTCCTGCTGAAAAACACTCTTCCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCC 

TGGATTTACAGAAACATATAGATCTGGTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCT 

TGAAGGTATTACTTAACTTCACTACAGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTC 

TAGAAGGGTCTATACTTGTCCTTGTCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAAC 

TGATAATAATACAAATGATTGTTGTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAA 
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FIGURE 462 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKPK 
SQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAWKS 
PEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQGKAA 
TTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIPPKEKKPQATPPPAPFQSPTTQRNQRLK 
AANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHFNQSEW 
DRLEHFAPPFGFMELNYSLVQKWTRFPPVPQQQLLLASLPAGSLRCITCAVVGNGGILNNSH 
MGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVPLGKDVRY 
LHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFLRYMKNRFL 
RSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSWKRLIFYINH 
DFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 

Important features : . 
Cytoplaszoic Domain: 

amino acids 1-10 

Type II Transmenibrane Domain: 

amino acids 11-35 

Ltimenal catalytic Domain: 

amino acids 36-600 

Ribonucleotide Reductase small subunit Signature: 

amino acids 481-496 

N-glycosylation Sites: j 

amino acids 300-303, 311-314, 331-334, 375-378, 460-463 
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FIGURE 463 



GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAAC 
CCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATGTC 
ATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAGCAC 
TGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGT^TATVAATACGACTTCCTGA 
GTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACCTTCTG 
GGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCATAGTCA 
CCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGGAAGAACC 
CCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAGCCCCTCCT 
TGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGACTTTCCACGG 
ATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCATCAACACAATT 
TGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTCAAAGCAAGTTT 
CTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCCATTGGTGAAATC 
TGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCAGTGTGAAGATGAGCACCTA 
TCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACCAAGAGTGGAGTCAA 
GGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGCACTGGATGCTGCGGT 
GACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCCTACCCAAACAAGATCT 
TGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGACTGACAACATATAGAGA 
ATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAAGCTTGGCATCACAGTGAC 
TGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCACTATGGAATGGTGGAATGA 
TCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTGTCTGTCAGTGTGACCCATCC 
TGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGACGCAATGGAGGTAGATGCTTT 
AAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTGCTCAGATCCGGGAGATGTTTGA 
TGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATGCTAAGGGAGTATCTTAGCGCTGA 
CGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCATAGCTATAAAAATACAAAAAACGA 
GGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAGATGGTGTAAAAGGGATGGATGGCTT 
TTGCTCTAGAAGTCAACATTCATCTTCATCCTCACATTGGCATCAGGAAGGGGTGGATGTGAA 
AACCATGATG/U^lCACTTGGACACTGCAGAGGGGTTTTCCCCTAATAACCATCACAGTGAGGGG - 

gaggaatgtacacatgaagcaagagcactacatgaagggctctgacggcgccccggacactgg 
gtacctgtggcatgttccattgacattcatcaccagcaaatccaacatggtccatcgattttt 
gctaaaaacaaaaacagatgtgctcatcctcccagaagaggtggaatggatcaaatttaatgt 
gggcatgaatggctattacattgtgcattacgaggatgatggatgggactctttgactggcct 
tttaaaaggaacacacacagcagtcagcagtaatgatcgggcaagtctcattaacaatgcatt 
tcagctcgtcagcattgggaagctgtccattgaaaaggccttggatttatccctgtacttgaa 

ACATGAAACTGAAATTATGCCCGTGTTTCAAGGTTTGAATGAGCTGATTCCTATGTATAAGTT 

aatggagaaaagagatatgaatgaagtggaaactcaattcaaggccttcctcatcaggctgct 
aagggacctcattgataagcagacatggacagacgagggctcagtctcagagcaaatgctgcg 
gagtgaactactactcctcgcctgtgtgcacaactatcagccgtgcgtacagagggcagaagg 
ctatttcagaaagtggaaggaatccaatggaaacttgagcctgcctgtcgacgtgaccttggc 
agtgtttgctgtgggggcccagagcacagaaggctgggattttctttatagtaaatatcagtt 
ttctttgtccagtactgagaaaagccaaattgaatttgccctctgcagaacccaaaataagga 
aaagcttcaatggctactagatgaaagctttaagggagataaaataaaaactcaggagtttcc 
acaaattcttacactcattggcaggaacccagtaggatacccactggcctggcaatttctgag 
gaaaaactggaacaaacttgtacaaaagtttgaacttggctcatcttccatagcccacatggt 

AATGGGTACAACAAATCAATTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAG 
CTCTTTGAAAGAAT^TGGTTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGA 

aaacatcggttggatggataagaattttgataaaatcagagtgtggctgcaaagtgaaaagct 
tgaacgtatgtaaaaattcctcccttgcccggttcctgttatctctaatcaccaacattttgt 
tgagtgtattttcaaactagagatggctgttttggctccaactggagatacttttttcccttc 
aactcattttttgactatccctgtgaaaagaatagctgttagtttttcatgaatgggcttttt 
catgaatgggctatcgctaccatgtgttttgttcatcacaggtgttgccctgcaacgtaaacc 
caagtgttgggttccctgccacagaagaataaagtaccttattcttctcaaaaaaaaaaaaaa 
aaaaaaaaaaaaa 
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FIGURE 464 

MVFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEASPKRSDGTPFPWNKIRLPEYVIPVHY 
DLLIHANLTTLTFWGTTKVEITASQPTSTIILHSHHLQISRATLRKGAGERLSEEPLQVLEHP 
PQEQIALLAPEPLLVGLPYTVVIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTAARM 
AFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVAFIIS 
DFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAAIPDFQ 
SGAMENWGLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWNDLWLNEGF 
AKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFDDVSYDKG 
ACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDGFCSRSQHS 
SSSSHWHQEGVDVKTMMNTWTLQRGFPLITITVRGRNVHMKQEHYMKGSDGAPDTGYLWHVPL 
TFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSLTGLLKGTHTA 
VSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIMPVFQGLNELIPMYKLMEKRDMN 
EVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCVQRAEGYFRKWKE 
SNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALCRTQNKEKLQWLLD 
ESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGSSSIAHMVMGTTNQF 
STRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIRVWLQSEKLERM 

laiportant features: 
Signal £>eptide: 

amino acids 1-34 

N-glycosylation sites : 

amino acids 70-74, 154-158, 414-418, 760-764, 901-905 

Neutral zinc metallopeptidases , zinc-binding region signature: 

amino acids 350-360 
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FIGURE 465 



CAGCCACAGACGGGTCATGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCACT 

GCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCCGA 

CCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGACAC 

GTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGGAGGC 

CAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATCTCCTA 

CACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCTTTGGGC 

CCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGGAAGGCTG 

TCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGATGGCCTCCT 

CAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCCCCAGCCAGG 

TTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGAACTGCAATAG 

GAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAACTTGGCTCAAGA 

ACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGTGTGTCAGGAGAC 

GCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAGGCTGCAGCACTGT 

TGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGGGTGCTTGTGGCCTC 

CTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAGCAGCGTTCTGCTGAA 

CTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTCCTACCTGTGTGCAGCC 

CCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGGGGCGCCACTCATTGTTA 

TGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAATGAGCATTCAGGGCTGCGT 

GGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCGGGATCTTCTCTGCGCGTGA 

GAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGTGGGGCTGAGGGCCTGGAGTC 

TCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTGGTGGGGAGTGGTTTGCCCTTC 

CTGC2AACTCTATTACCCCCACGATTCTTCACCGCTGCTGACCACCCACACTCAACCTCCCTC 

TGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTCCCATTCTGTCCATGAATCATCTT 

CCCCACACACAATCATTCATATCTACTCACCTAACAGCAACACTGGGGAGAGCCTGGAGCATC 

CGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTGGCTGCATGTATCTGATAATACAGAC 
CCTGTCCTTTCA 
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FIGURE 466 

MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLIE 
SGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQE.DFCNNLVNSLPLWAPQPPA 
DPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCNLLN 
GTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETLLLID 
VGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLNSLPPQ 
AAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGCVAQPSS 
FLLNHTRQIGIFSAREKRDVQPPASQHEGGGAEGLESLTWGVGLALAPALWWGWCPSC 
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FIGURE 467 

GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTTG 

AGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGAT6ATGGTGGCCCTTCG 

AGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTACCCA 

GGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGGAAAA 

ATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATATCTGT 

CATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTTGGCACT 

GAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTGACGAGTG 

CATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAAGAGAAAAA 

GATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTCTTTGAAAAT 

AGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATAACTCTCCAAA 

GGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAACATACGGGCATT 

CATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTCCTGGCAGGGAAC 

AGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTTCTAATGAGATAAT 

CAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCAGGAGGGGTAGGCCG 

AGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGTGGATGAGCATGGGCT 

CTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCACAAAGATTGAGCCGGG 

CACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAGGATGCTGAAGCCTCATT 

CCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCAGGGCCCTCATCGCATCAC 

CTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGCCCAACTTGTTCTTCCCCAA 

GAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGATAAGCAGCTCTATGCCTGGAA 

TGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAAGCTGCCTCTGAAGTAATGCAT 

TACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTCTACAGGACAGTGAGGCTATAGC 

CCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAGAGTGTGTAGAAGTGGAAATACGT 

ATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTCCAAGAGCTTAGATGAGAGCATATC 

ATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAACCTCCTGGCTCTCAAGGATGACCAC 

ATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACTGCTCCCCAGCATTTACTGTAACTCTG 

CCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGCCCCTAATATTCACCACTGGCTTTTCTC 

TCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTTCAAATGTCTATTGATATTCTCCCATTTT 

CACTGCCCAACTAAAATACTATTAATATTTCTTTCTTTTCTTTTCTTTTTTTTGAGACAAGGT 

CTCACTATGTTGCCCAGGCTGGTCTCAAACTCCAGAGCTCAAGAGATCCTCCTGCCTCAGCCT 

CCTAAGTACCTGGGATTACAGGCATGTGCCACCACACCTGGCTTAAAATACTATTTCTTATTG 

AGGTTTAACCTCTATTTCCCCTAGCCCTGTCCTTCCACTAAGCTTGGTAGATGTAATAATAAA 

GTGAAAATATTAACATTTGAATATCGCTTTCCAGGTGTGGAGTGTTTGCACATCATTGAATTC 

TCGTTTCACCTTTGTGAAACATGCACAAGTCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGT 

AACACAATTACAAAGTGAAAGATACAGCTAGAAAATACTACAAATCCCATAGTTTTTCCATTG 

CCCAAGGAAGCATCAAATACGTATGTTTGTTCACCTACTCTTATAGTCAATGCGTTCATCGTT 

TCAGCCTAAAAATAATAGTCTGTCCCTTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAAT 

AGGCCTTTCAAATGATAATTCCTCCAGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCC 

TCCTCTTGTCTTGCTGTCCTCTGTTTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTG 
AGCAAAAAAAAAAAAAAA 
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FIGURE 468 

MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQEF 
SKNISVMLGRCQTYTSEYKSAVGNLALRVERAQREIDYIQYLREADECIVSEDKTLAEMLLQE 
AEEEKKIRTLLNASCDNMLMGIKSLKIVKKMMDTHGSWMKDAVYNSPKVYLLIGSRNNTVWEF 
ANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVEDRMLL 
PGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDTPCRSQ 
DAEASFLLCGVLYVVYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIHYNPRDK 
QLYAWNEGNQIIYKj:.QTKRKLPLK 
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FIGURE 469 



TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGGC 
AGTGTTTTGCCTTCACCCCAAGTGACCATSAGAGGTGCCACGCGAGTCTCAATCATGCTCCTC 
CTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGTGGG 
GCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCTGGGG 
CGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCAAGCAC 
CACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTACCGCTGC 
TCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCACCATCCT 
TTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGACTCTCCCAG 
TCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAGGCAGACATA 
CCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGGCTGTGGTGTG 
AAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTAAATGGCAGAAA 
GGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTGCCCCTCTCCCCA 
CATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGTGCATTGCTCAGAG 
TCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGGACCAATTTGTGGGT 
AGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGACTCAAGATTGGCTCTT 
CCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCAGGGAGGCCAATCAGCC 
CCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGACCTGTGACCTTCTGCCA 
GAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGTTAACCACTGAAGCCCCCA 
ATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAATCTAATCTGATATTGACAT 
ATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCAAGGATCAGCCCTGAGAGCAG 
GTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGGTGGGAGCAAGGGACAGGGAGC 
AGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGGCAACTACACACCAACATGCTGG 
CTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 470 



MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHPG 
SHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 

Important feratures: 
Signal peptide: 

amino, acids 1-19 

Tyrosine kinase phosphorylation site: 

amino acids 88-95 

N-myristoylation sites: 

amino acids 33-39, 35-41, 46-52 
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FIGURE 471 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTCC 

TCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGCTG 

CTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGGCCC 

TGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAAGCAA 

AGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTTCCTCT 

GTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCGTCCCCC 

TCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTATTTCAAAG 

GAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGATGATGTTTA 

TGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAGGCTTCTCTG 

GAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCAGGAAAGCAGG 

GCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACTTTCTGTTCTGG 

AAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAAT^TTGTGTGAAACTG 

CCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATTGTTTCCACTCGTG 

TCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTATTAATGTATTTTAA 

TATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGACAAAAATCTGAAAAA 

CTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAATTGACTGCCAGGCTGG 

GTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGAGCAAGTCACTTGAG 

GTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTCTCTACTAAAAATACAAAA 

ATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCGGGAGGCTGAGGCAGGAGAA 

TCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCACACCACTGTATTCCAGCCTG 
GGTGACTGAGACTCTAACTAA 
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FIGURE 472 

MS FLQDPS FFTMGMWS I GAGALGAAALALLLANT DVFLSKPQKAALEYLEDI DLKTLEKEPRT 
FKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDFQP 
YFKGEIFLDEKKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFVVGS 
GKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 
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FIGURE 47-^ 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTTT 

TGGGATTTTAATTTTCAAACACAGCAGAATCACATTTTTTCTGTCACTATTATTATTGTTGGT 

ATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCTATC 

AAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTCCTAC 

CAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGAGGGTT 

AATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTT^ATTCAATGATTGTA 

AATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATATATACAA 

TAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACTTTATTAAT 

TTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGATCATATAAT 

TTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAAATGCGATACA 

GTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAAAGAAGGGAAAA 
TGTTGCCAAGGAAAAAAAAA 
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FIGURE 474 



MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGKG 
IVKGRNLDSRGLILGAEAWGRGVKKNT 
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FIGURE 475 

GACAGTGGAGGGCAGTGGAGAGGACCGCGCTGTCCTGCTGTCACCAAGAGCTGGAGACACCAT 

CTCCCACCGAGAGTCATGGCCCCATTGGCCCTGCACCTCCTCGTCCTCGTCCCCATCCTCCTC 

AGCCTGGTGGCCTCCCAGGACTGGAAGGCTGAACGCAGCCAAGACCCCTTCGAGAAATGCATG 

CAGGATCCTGACTATGAGCAGCTGCTCAAGGTGGTGACCTGGGGGCTCAATCGGACCCTGAAG 

CCCCAGAGGGTGATTGTGGTTGGCGCTGGTGTGGCCGGGCTGGTGGCCGCCAAGGTGCTCAGC 

GATGCTGGACACAAGGTCACCATCCTGGAGGCAGATAACAGGATCGGGGGCCGCATCTTCACC 

TACCGGGACCAGAACACGGGCTGGATTGGGGAGCTGGGAGCCATGCGCATGCCCAGCTCTCAC 

AGGATCCTCCACAAGCTCTGCCAGGGCCTGGGGCTCAACCTGACCAAGTTCACCCAGTACGAC 

AAGAACACGTGGACGGAGGTGCACGAAGTGAAGCTGCGCAACTATGTGGTGGAGAAGGTGCCC 

GAGAAGCTGGGCTACGCCTTGCGTCCCCAGGAAAAGGGCCACTCGCCCGAAGACATCTACCAG 

ATGGCTCTCAACCAGGCCCTCAAAGACCTCAAGGCACTGGGCTGCAGAAAGGCGATGAAGAAG 

TTTGAAAGGCACACGCTCTTGGAATATCTTCTCGGGGAGGGGAACCTGAGCCGGCCGGCCGTG 

CAGCTTCTGGGAGACGTGATGTCCGAGGATGGCTTCTTCTATCTCAGCTTCGCCGAGGCCCTC 

CGGGCCCACAGCTGCCTCAGCGACAGACTCCAGTACAGCCGCATCGTGGGTGGCTGGGACCTG 

CTGCCGCGCGCGCTGCTGAGCTCGCTGTCCG6GCTTCTGCTGTTGAACGCGCCCGTGGTGGCG 

ATGACCCAGGGACCGCACGATGTGCACGTGCAGATCGAGACCTCTCCCCCGGCGCGGAATCTG 

AAGGTGCTGAAGGCCGACGTGGTGCTGCTGACGGCGAGCGGACCGGCGGTGAAGCGCATCACC 

TTCTCGCCGCCGCTGCCCCGCCACATGCAGGAGGCGCTGCGGAGGCTGCACTACGTGCCGGCC 

ACCAAGGTGTTCCTAAGCTTCCGCAGGCCCTTCTGGCGCGAGGAGCACATTGAAGGCGGCCAC 

TCAAACACCGATCGCCCGTCGCGCATGATTTTCTACCCGCCGCCGCGCGAGGGCGCGCTGCTG 

CTGGCCTCGTACACGTGGTCGGACGCGGCGGCAGCGTTCGCCGGCTTGAGCCGGGAAGAGGCG 

TTGCGCTTGGCGCTCGACGACGTGGCGGCATTGCACGGGCCTGTCGTGCGCCAGCTCTGGGAC 

GGCACCGGCGTCGTCAAGCGTTGGGCGGAGGACCAGCACAGCCAGGGTGGCTTTGTGGTACAG 

CCGCCGGCGCTCTGGCAAACCGAAAAGGATGACTGGACGGTCCCTTATGGCCGCATCTACTTT 

GCCGGCGAGCACACCGCCTACCCGCACGGCTGGGTGGAGACGGCGGTCAAGTCGGCGCTGCGC 

GCCGCCATCAAGATCAACAGCCGGAAGGGGCCTGCATCGGACACGGCCAGCCCCGAGGGGCAC 

GCATCTGACATGGAGGGGCAGGGGCATGTGCATGGGGTGGCCAGCAGCCCCTCGCATGACCTG 

GCAAAGGAAGAAGGCAGCCACCCTCCAGTCCAAGGCCAGTTATCTCTCCAAAACACGACCCAC 

ACGAGGACCTCGCATTAAAGTATTTTCGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAA 
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FIGURE 476 



MAPLALHLLVLVPILLSLVASQDWKAERSQDPFEKCMQDPDYEQLLKVVTWGLNRTLKPQRVI 
VVGAGVAGLVAAKVLSDAGHKVTILEADNRIGGRIFTYRDQNTGWIGELGAMRMPSSHRILHK 
LCQGLGLNLTKFTQYDKNTWTEVHEVKLRNYVVEKVPEKLGYALRPQEKGHSPEDIYQMALNQ 
ALKDLKALGCRKAMKKFERHTLLEYLLGEGNLSRPAVQLLGDVMSEDGFFYLSFAEALRAHSC 
LSDRLQYSRIVGGWDLLPRALLSSLSGLVLLNAPVVAMTQGPHDVHVQIETSPPARNLKVLKA 
DWLLTASGPAVKRITFSPPLPRHMQEALRRLHYVPATKVFLSFRRPFWREEHIEGGHSNTDR 
PSRMIFYPPPREGALLLASYTWSDAAAAFAGLSREEALRLALDDVAALHGPVVRQLWDGTGW 
KRWAEDQHSQGGFWQPPALWQTEKDDWTVPYGRIYFAGEHTAYPHGWVETAVKSALRAAIKI 
NSRKGPASDTASPEGHASDMEGQGHVHGVASSPSHDLAKEEGSHPPVQGQLSLQNTTHTRTSH 



Important features: 
Signal peptide: 
amino acids 1-21 
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FIGURE 477 



CTGACATGGCCTGACTCGGGACAGCTCAGAGCAGGGCAGAACTGGGGACACTCTGGGCCGGCCTTCTGCCTGCAT 

6GACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCACGAGCGAGGGAAGAAGGACAGGGACTCGTGTGGCAGGAA 

GAACT.CAGAGCCGGGAAGCCCCCATTCACTAGAAGCACTGAGAGATGCGGCCCCCTCGCAGGGTCTGAATTTCCT 

GCTGCTGTTCACAAAGATGCTTTTTATCTTTAACTTTTTGTTTTCCCCACTTCCGACCCCGGCGTTGATCTGCAT 

CCTGACATTTGGAGCTGCCATCTTCTTGTGGCTGATCACCAGACCTCAACCCGTCTTACCTCTTCTTGACCTGAA 

CAATCAGTCTGTGGGAATTGAGGGAGGAGCACGGAAGGGGGTTTCCCAGAAGAACAATGACCTAACAAGTTGCTG 

CTTCTCAGATGCCAAGACTATGTATGAGGTTTTCCAAAGAGGACTCGCTGTGTCTGACAATGGGCCCTGCTTGGG 

ATATAGAAAACCAAACCAGCCCTACAGATGGCTATCTTACAAACAGGTGTCTGATAGAGCAGAGTACCTGGGTTC 

CTGTCTCTTGCATAAAGGTTATAAATCATCACCAGACCAGTTTGTCGGCATCTTTGCTCAGAATAGGCCAGAGTG 

GATCATCTCCGAATTGGCTTGTTACACGTACTCTATGGTAGCTGTACCTCTGTATGACACCTTGGGACCAGAAGC 

CATCGTACATATTGTCAACAAGGCTGATATCGCCATGGTGATCTGTGACACACCCCAAAAGGCATTGGTGCTGAT 

AGGGAATGTAGAGAAAGGCTTCACCCCGAGCCTGAAGGTGATCATCCTTATGGACCCCTTTGATGATGACCTGAA 

GCAAAGAGGGGAGAAGAGTGGAATTGAGATCTTATCCCTATATGATGCTGAGAACCTAGGCAAAGAGCACTTCAG 

AAAACCTGTGCCTCCTAGCCCAGAAGACCTGAGCGTCATCTGCTTCACCAGTGGGACCACAGGTGACCCCAAAGG 

AGCCATGATAACCCATCAAAATATTGTTTCAAATGCTGCTGCCTTTCTCAAATGTGTGGAGCATGCTTATGAGCC 

CACTCCTGATGATGTGGCCATATCCTACCTCCCTCTGGCTCATATGTTTGAGAGGATTGTACAGGCTGTTGTGTA 

CAGCTGTGGAGCCAGAGTTGGATTCTTCCAAGGGGATATTCGGTTGCTGGCTGACGACATGAAGACTTTGAAGCC 

CACATTGTTTCCCGCGGTGCCTCGACTCCTTAACAGGATCTACGATAAGGTACAAAATGAGGCCAAGACACCCTT 

GAAGAAGTTCTTGTTGAAGCTGGCTGTTTCCAGTAAATTCAAAGAGCTTCAAAAGGGTATCATCAGGCATGATAG 

TTTCTGGGACAAGCTCATCTTTGCAAAGATCCAGGACAGCCTGGGCGGAAGGGTTCGTGTAATTGTCACTGGAGC 

TGCCCCCATGTCCACTTCAGTCATGACATTCTTCCGGGCAGCAATGGGATGTCAGGTGTATGAAGCTTATGGTCA 

AACAGAATGCACAGGTGGCTGTACATTTACATTACCTGGGGACTGGACATCAGGTCACGTTGGGGTGCCCCTGGC 

TTGCAATTACGTGAAGCTGGAAGATGTGGCTGACATGAACTACTTTACAGTGAATAATGAAGGAGAGGTCTGCAT 

CAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCTGAGAAGACACAGGAAGCCCTGGACAGTGATGGCTG 

GCTTCACACAGGAGACATTGGTCGCTGGCTCCCGAATGGAACTCTGAAGATCATCGACCGTAAAAAGAACATTTT 

CAAGCTGGCCCAAGGAGAATACATTGCACCAGAGAAGATAGAAAATATCTACAACAGGAGTCAACCAGTGTTACA 

AATTTTTGTACACGGGGAGAGCTTACGGTCATCCTTAGTAGGAGTGGTGGTTCCTGACACAGATGTACTTCCCTC 

ATTTGCAGCCAAGCTTGGGGTGAAGGGCTCCTTTGAGGAACTGTGCCAAAACCAAGTTGTAAGGGAAGCCATTTT 

AGAAGACTTGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTTTGAACAGGTCAAAGCCATTTTTCTTCATCC 

AGAGCCATTTTCCATTGAAAATGGGCTCTTGACACCAACATTGAAAGCAAAGCGAGGAGAGCTTTCCAAATACTT 

TCGGACCCAAATTGACAGCCTGTATGAGCACATCCAGGATTAGGATAAGGTACTTAAGTACCTGCCGGCCCACTG 

TGCACTGCTTGTGAGAAAATGGATTAAAAACTATTCTTACATTTGTTTTGCCTTTCCTCCTATTTTTTTTTAACC 

TGTTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAATGTGTAAACTTAGTTCCCAAATAAATCA 

ATCCTGTCTTTCCCATCTTCGATGTTGCTAATATTAAGGCTTCAGGGCTACTTTTATCAACATGCCTGTCTTCAA 

GATCCCAGTTTATGTTCTGTGTCCTTCCTCATGATTTCCAACCTTAATACTATTAGTAACCACAAGTTCAAGGGT 

CAAAGGGACCCTCTGTGCCTTCTTCTTTGTTTTGTGATAAACATAACTTGCCAACAGTCTCTATGCTTATTTACA 

TCTTCTACTGTTCAAACTAAGAGATTTTTAAATTCTGAAAAACTGCTTACAATTCATGTTTTCTAGCCACTCCAC 

AAACCACTAAAATTTTAGTTTTAGCCTATCACTCATGTCAATCATATCTATGAGACAAATGTCTCCGATGCTCTT 

CTGCGTAAATTAAATTGTGTACTGAAGGGAAAAGTTTGATCATACCAAACATTTCCTAAACTCTCTAGTTAGATA 

TCTGACTTGGGAGTATTAAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCTTAAAGCATTATTTA 

CAGTAGGAACTGGGGAGTAAATCTGTTCCCTACAGTTTGCTGCTGAGCTGGAAGCTGTGGGGGAAGGAGTTGACA 

GGTGGGCCCAGTGAACTTTTCCAGTAAATGAAGCAAGCACTGAATAAAAACCTCCTGAACTGGGAACAAAGATCT 

ACAGGCAAGCAAGATGCCCACACAACAGGCTTATTTTCTGTGAAGGAACCAACTGATCTCCCCCACCCTTGGATT 

AGAGTTCCTGCTCTACCTTACCCACAGATAACACATGTTGTTTCTACTTGTAAATGTAAAGTCTTTAAAATAAAC 
TATTACAGATAAAAAA 
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FIGURE 478 

MDALKPPCLWRNHERGKKDRDSCGRKNSEPGSPHSLEALRDAAPSQGLNFLLLFTKMLFIFNF 
LFSPLPTPALICILTFGAAIFLWLITRPQPVLPLLDLNNQSVGIEGGARKGVSQKNNDLTSCC 
FSDAKTMYEVFQRGLAVSDNGPCLGYRKPNQPYRWLSYKQVSDRAEYLGSCLLHKGYKSSPDQ 
FVGIFAQNRPEWIISELACYTYSriVAVPLYDTLGPEAIVHIVNKADIAMVICDTPQKALVLIG 
NVEKGFTPSLKVIILMDPFDDDLKQRGEKSGIEILSLYDAENLGKEHFRKPVPPSPEDLSVIC 
FTSGTTGDPKGAMITHQNIVSNAAAFLKCVEHAYEPTPDDVAISYLPLAMMFERIVQAWYSC 
GARVGFFQGDIRLLADDMKTLKPTLFPAVPRLLNRIYDKVQNEAKTPLKKFLLKLAVSSKFKE 
LQKGI IRHDSFWDKLI FAKIQDSLGGRVRVIVTGAAPMSTSVMTFFRAAMGCQVYEAYGQTEC 
TGGCTFTLPGDWTSGHVGVPLACNYVKLEDVADMNYFTVNNEGEVCIKGTNVFKGYLKDPEKT 
QEALDSDGWLHTGDIGRWLPNGTLKIIDRKKNIFKLAQGEYIAPEKIENIYNRSQPVLQIFVH 
GESLRSSLVGWVPDTDVLPSFAAKLGVKGSFEELCQNQVVREAILEDLQKIGKESGLKTFEQ 
VKAIFLHPEPFSIENGLLTPTLKAKRGELSKYFRTQIDSLYEHIQD 

lo^ortant features: 

Type II transmembreme domain: 

amino acids 61-80 

Putative AMP-binding domain signature. 

amino acids 314-325 



N-glycosylatipn site. 

amino acids 102-105, 588-591 and 619-622 
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FIGURE 479 

GGAGGCGGAGGCCGCGGCGAGCCG5GCCGAGCAGTGAGGGCCCTAGCGGGGCCCGAGCGGGGC 
CCGGGGCCCCTAAGCCATTCCTGAAGTCATGGGCTGGCCAGGACATTGGTGACCCGCCAATCC 
GGTATGGACGACTGGAAGCCCAGCCCCCTCATCAAGCCCTTTGGGGCTCGGAAGAAGCGGAGC 

tggtaccttacctggaagtataaactgacaaaccagcgggccctgcggagattctgtcagaca" 

GGGGCCGTGCTTTTCCTGCTGGTGACTGTCATTGTCAATATCAAGTTGATCCTGGACACTCGG 

cgagccatcagtgaagccaatgaagacccagagccagagcaagactatgatgaggccctaggc 

CGCCTGGAGCCCCCACGGCGCAGAGGCAGTGGTCCCCGGCGGGTCCTGGACGTAGAGGTGTAT 

tcaagtcgcagcaaagtatatgtggcagtggatggcaccacggtgctggaggatgaggcccgg 

GAGCAGGGCCGGGGCATCCATGTCATTGTCCTCAACCAGGCCACGGGCCACGTGATGGCAAAA 

cgtgtgtttgacacgtactcacctcatgaggatgaggccatggtgctattcctcaacatggta 

GCGCCCGGCCGAGTGCTCATCTGCACTGTCAAGGATGAGGGCTCCTTCCACCTCAAGGACACA 

gccaaggctctgctgaggagcctgggcagccaggctggccctgccctgggctggagggacaca 
tgggccttcgtgggacgaaaaggaggtcctgtcttcggggagaaacattctaagtcacctgcc 
ctctcttcctggggggacccagtcctgctgaagacagatgtgccattgagctcagcagaagag 

GCAGAGTGCCACTGGGCAGACACAGAGCTGAACCGTCGCCGCCGGCGCTTCTGCAGCAAAGTT 

gagggctatggaagtgtatgcagctgcaaggaccccacacccatcgagttcagccctgaccca 

CTCCCAGACAACAAGGTCCTCAATGTGCCTGTGGCTGTCATTGCAGGGAACCGACCCAATTAC 

ctgtacaggatgctgcgctctctgctttcagcccagggggtgtctcctcagatgataacagtt 

TTCATTGACGGCTACTATGAGGAACCCATGGATGTGGTGGCACTGTTTGGTCTGAGGGGCATC 

cagcatactcccatcagcatcaagaatgcccgcgtgtctcagcactacaaggccagcctcact 

GCCACTTTCAACCTGTTTCCGGAGGCCAAGTTTGCTGTGGTTCTGGAAGAGGACCTGGACATT 

gctgtggattttttcagtttcctgagccaatccatccacctactggaggaggatgacagcctg 

TACTGCATCTCTGCCTGGAATGACCAGGGGTATGAACACACGGCTGAGGACCCAGCACTACTG 

taccgtgtggagaccatgcctgggctgggctgggtgctcaggaggtccttgtacaaggaggag 

CTTGAGCCCAAGTGGCCTACACCGGAAAAGCTCTGGGATTGGGACATGTGGATGCGGATGCCT 

gaacaacgccggggccgagagtgcatcatccctgacgtttcccgatcctaccactttggcatc 

GTCGGCCTCAACATGAATGGCTACTTTCACGAGGCCTACTTCAAGAAGCACAAGTTCAACACG 

gttccaggtgtccagctcaggaatgtggacagtctgaagaaagaagcttatgaagtggaagtt 

CACAGGCTGCTCAGTGAGGCTGAGGTTCTGGACCACAGCAAGAACCCTTGTGAAGACTCTTTC 

ctgccagacacagagggccacacctacgtggcctttattcgaatggagaaagatgatgacttc 

ACCACCTGGACCCAGCTTGCCAAGTGCCTCCATATCTGGGACCTGGATGTGCGTGGCAACCAT 

cggggcctgtggagattgtttcggaagaagaaccacttcctggtggtgggggtcccggcttcc 

CCCTACTCAGTGAAGAAGCCACCCTCAGTCACCCCAATTTTCCTGGAGCCACCCCCAAAGGAG 

gagggagccccaggagccccagaacagacatgagacctcctccaggaccctgcggggctgggt 

ACTGTGTACCCCCAGGCTGGCTAGCCCTTCCCTCCATCCTGTAGGATTTTGTAGATGCTGGTA 

ggggctggggctaccttgtttttaacatgagacttaattactaactccaaggggagggttccc 

CTGCTCCAACACCCCGTTCCTGAGTTAAAAGTCTATTTATTTACTTCCTTGTTGGAGAAGGGC 

aggagagtacctgggaatcattacgatccctagcagctcatcctgccctttgaataccctcac 

TTTCCAGGCCTGGCTCAGAATCTAACCTATTTATTGACTGTCCTGAGGGCCTTGAAAACAGGC 

cgaacctggagggcctggatttctttttgggctggaatgctgccctgagggtggggctggctc 

TTACTCAGGAAACTGCTGTGCCCAACCCATGGACAGGCCCAGCTGGGGCCCACATGCTGACAC 

agactcactcagagacccttagacactggaccaggcctcctctcagccttctctttgtccaga 

TTTCCAAAGCTGGATAAGTTGGTCATTGATTAAAAAAGGAGAAGCCCTCTGGGAAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 480 



MDDWKPSPLIKPFGARKKRSWYLTWKYKLTNQRALRRFCQTGAVLFLLVTVIVNIKLILDTRR 
AISEANEDPEPEQDYDEALGRLEPPRRRGSGPRRVLDVEVYSSRSKVYVAVDGTTVLEDEARE 
QGRGIHVIVLNQATGHVMAKRVFDTYSPHEDEAMVLFLNMVAPGRVLICTVKDEGSFHLKDTA 
KALLRSLGSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSWGDPVLLKTDVPLSSAEEA 
ECHWADTELNRRRRRFCSKVEGYGSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGNRPNYL 
YRMLRSLLSAQGVSPQMITVFIDGYYEEPMDVVALFGLRGIQHTPISIKNARVSQHYKASLTA 
TFNLFPEAKFAWLEEDLDIAVDFFSFLSQSIHLLEEDDSLYCISAWNDQGYEHTAEDPALLY 
RVETMPGLGWVLRRSLYKEELEPKWPTPEKLWDWDMWMRMPEQRRGRECIIPDVSRSYHFGIV 
GLNMNGYFHEAYFKKHKFNTVPGVQLRNVDSLKKEAYEVEVHRLLSEAEVLDHSKNPCEDSFL 
PDTEGHTYVAFIRMEKDDDFTTWTQLAKCLHIWDLDVRGNHRGLWRLFRKKNHFLWGVPASP 
YSVKKPPSVTPIFLEPPPKEEGAPGAPEQT 

Important features: 
Transmembrane domain: 

amino acids 38-55 

Homologous region to Mouse GNTl 

amino acids 229-660 
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FIGURE 481 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACCA 

GGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCATAT 

GCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAAGTT 

CCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGTATCA 

TTCTGGTTTGTGGTTACAGACCCTTdAAAAAATCACACCCTTCCTGCTGTTGAGGTGCAATCA 

GCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACTCTGGAA 

TTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTGGATTATT 

ATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTATCAGGGATC 

TGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGATAAGTGTGAA 

AACATGATCACAATTGAAAAT.GGCATCCCCTCTGATCCCCTGGACATGAAGGGGGGCATATTA 

ATGATGCCTTCATSACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGTTGTTCTGCTTC 

CTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATACCAAGAGCAGAT 

CATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCTTGAAAGTGAAAAG 

CAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGACTCAAAATATTCTAA 

AATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTGTAGTTATTGATT.TAA 

GCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTCAAAGACCTAAGGAAAA 

ATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCATTGAAAATGGATCCTTTT 

TGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAGTGAGAAGTAATTATTGTA 

AATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATTTTATCCTGTTATCACACCA 

ACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGACAATTCTATTTGTTGACCATT 

TCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAGTAATAATCATCTCTTTTTAAA 
AAAAAAAAAAAAAAAAAA2\AAAA 
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FIGURE 482 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDECAYAWDTNEEYLFKAMVAFSMRKVPN 
REATEISHVLLCNVTQRVSFWFWTDPSKNHTLPAVEVQSAIRMNKNRINNAFFLNDQTLEFL 
KI PSTLAPPMDPSVPI WI 1 1 FGVI FCI 1 1 VAIALLILSGI WQRRRKNKEPSEVDDAEDKCENM 
ITIENGIPSDPLDMKGGILMMPS 
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FIGURE 483 

CGTCTCTGCGTTCGCCATGCGTCCCGGGGCGCCAGGGCCACTCTGGCCTCTGCCCTGGGGGGC 

CCTGGCTTGGGCCGTGGGCTTCGTGAGCTCCATGGGCTCGGGGAACCCCGCGCCCGGTGGTGT 

TTGCTGGCTCCAGCAGGGCCAGGAGGCCACCTGCAGCCTGGTGCTCCAGACTGATGTCACCCG 

GGCCGAGTGCTGTGCCTCCGGCAACATTGACACCGCCTGGTCCAACCTCACCCACCCGGGGAA 

CAAGATCAACCTCCTCGGCTTCTTGGGCCTTGTCCACTGCCTTCCCTGCAAAGATTCGTGCGA 

CGGCGTGGAGTGCGGCCCGGGCAAGGCGTGCCGCATGCTGGGGGGCCGCCCGCGCTGCGAGTG 

CGCGCCCGACTGCTCGGGGCTCCCGGCGCGGCTGCAGGTCTGCGGCTCAGACGGCGCCACCTA 

CCGCGACGAGTGCGAGCTGCGCGCCGCGCGCTGCCGCGGCCACCCGGACCTGAGCGTCATGTA 

CCGGGGCCGCTGCCGCAAGTCCTGTGAGCACGTGGTGTGCCCGCGGCCACAGTCGTGCGTCGT 

GGACCAGACGGGCAGCGCCCACTGCGTGGTGTGTCGAGCGGCGCCCTGCCCTGTGCCCTCCAG 

CCCCGGCCAGGAGCTTTGCGGCAACAACAACGTCACCTACATCTCCTCGTGCCACATGCGCCA 

GGCCACCTGCTTCCTGGGCCGCTCCATCGGCGTGCGCCACGCGGGCAGCTGCGCAGGCACCCC 

TGAGGAGCCGCCAGGTGGTGAGTCTGCAGAAGAGGAAGAGAACTTCGTGTGAGCCTGCAGGAC 

AGGCCTGGGCCTGGTGCCCGAGGCCCCCCATCATCCCCTGTTATTTATTGCCACAGCAGAGTC 

TAATTTATATGCCACGGACACTCCTTAGAGCCCGGATTCGGACCACTTGGGGATCCCAGAACC 

TCCCTGACGATATCCTGGAAGGACTGAGGAAGGGAGGCCTGGGGGCCGGCTGGTGGGTGGGAT 

AGACCTGCGTTCCGGACACTGAGCGCCTGATTTAGGGCCCTTCTCTAGGATGCCCCAGCCCCT 

ACCCTAAGACCTATTGCCGGGGAGGATTCCACACTTCCGCTCCTTTGGGGATAAACCTATTAA 

TTATTGCTACTATCAAGAGGGCTGGGCATTCTCTGCTGGTAATTCCTGAAGAGGCATGACTGC 

TTTTCTCAGCCCCAAGCCTCTAGTCTGGGTGTGTACGGAGGGTCTAGCCTGGGTGTGTACGGA 

GGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTGGGT 

GAGTACGGAGGGTCTAGCCTGGGTGTGTATGGAGGATCTAGCCTGGGTGAGTATGGAGGGTCT 

AGCCTGGGTGAGTATGGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTGGGTGAGTAT 

GGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTGGGTGAGTATGGAGGGTCTAGCCTG 

GGTGTGTACGGAGGGTCTAGTCTGAGTGCGTGTGGGGACCTCAGAACACTGTGACCTTAGCCC 

AGCAAGCCAGGCCCTTCATGAAGGCCAAGAAGGCTGCCACCATTCCCTGCCAGCCCAAGAACT 

CCAGCTTCCCCACTGCCTCTGTGTGCCCCTTTGCGTCCTGTGAAGGCCATTGAGAAATGCCCA 

GTGTGCCCCCTGGGAAAGGGCACGGCCTGTGCTCCTGACACGGGCTGTGCTTGGCCACAGAAC 

CACCCAGCGTCTCCCCTGCTGCTGTCCACGTCAGTTCATGAGGCAACGTCGCGTGGTCTCAGA 

CGTGGAGCAGCCAGCGGCAGCTCAGAGCAGGGCACTGTGTCCGGCGGAGCCAAGTCCACTCTG 

GGGGAGCTCTGGCGGGGACCACGGGCCACTGCTCACCCACTGGCCCCGAGGGGGGTGTAGACG 

CCAAGACTCACGCATGTGTGACATCCGGAGTCCTGGAGCCGGGTGTCCCAGTGGCACCACTAG 

GTGCCTGCTGCCTCCACAGTGGGGTTCACACCCAGGGCTCCTTGGTCCCCCACAACCTGCCCC 

GGCCAGGCCTGCAGACCCAGACTCCAGCCAGACCTGCCTCACCCACCAATGCAGCCGGGGCTG 

GCGACACCAGCCAGGTGCTGGTCTTGGGCCAGTTCTCCCACGACGGCTCACCCTCCCCTCCAT 

CTGCGTTGATGCTCAGAATCGCCTACCTGTGCCTGCGTGTAAACCACAGCCTCAGACCAGCTA 

TGGGGAGAGGACAACACGGAGGATATCCAGCTTCCCCGGTCTGGGGTGAGGAATGTGGGGAGC 

TTGGGCATCCTCCTCCAGCCTCCTCCAGCCCCCAGGCAGTGCCTTACCTGTGGTGCCCAGAAA 

AGTGCCCCTAGGTTGGTGGGTCTACAGGAGCCTCAGCCAGGCAGCCCACCCCACCCTGGGGCC 
CTGCCTCACCAAGGAAATAAAGACTCAAGCCATAAAAAAAA 
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FIGURE 484 

MRPGAPGPLWPLPWGALAWAVGFVSSMGSGNPAPGGVCWLQQGQEATCSLVLQTDVTRAECCA 

SGNIDTAWSNLTHPGNKINLLGFLGLVHCLPCKDSCDGVECGPGKACRMLGGRPRCECAPDCS 

GLPARLQVCGSDGATYRDECELRAARCRGHPDLSVMYRGRCRKSCEHVVCPRPQSCWDQTGS 

AHCVVCRAAPCPVPSSPGQELCGNNNVTYISSCHMRQATCFLGRSIGVRHAGSCAGTPEEPPG 
GESAEEEENEV 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation sites. 

amino acids 73-77, 215-219 

Osteonectin domain proteins. 

amino acids 97-130, 169-202 
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FIGURE 485 

GCTCGAGGCCGGCGGCGGCGGGAGAGCGACCCGGGCGGCCTCGTAGCGGGGCCCCGGATCCCC 

GAGTGGCGGCCGGAGCCTCGAAAAGAGATTCTCAGCGCTGATTTTGAGaaeSATGGGCTTGGGA 

AACGGGCGTCGCAGCATGAAGTCGCCGCCCCTCGTGCTGGCCGCCCTGGTGGCCTGCATCATC 

GTCTTGGGCTTCAACTACTGGATTGCGAGCTCCCGGAGCGTGGACCTCCAGACACGGATCATG 

GAGCTGGAAGGCAGGGTCCGCAGGGCGGCTGCAGAGAGAGGCGCCGTGGAGCTGAAGAAGAAC 

GAGTTCCAGGGAGAGCTGGAGAAGCAGCGGGAGCAGCTTGACAAAATCCAGTCCAGCCACAAC 

TTCCAGCTGGAGAGCGTCAACAAGCTGTACCAGGACGAAAAGGCGGTTTTGGTGAATAACATC 

ACCACAGGTGAGAGGCTCATCCGAGTGCTGCAAGACCAGTTAAAGACCCTGCAGAGGAATTAC 

GGCAGGCTGCAGCAGGATGTCCTCCAGTTTCAGAAGAACCAGACCAACCTGGAGAGGAAGTTC 

TCCTACGACCTGAGCCAGTGCATCAATCAGATGAAGGAGGTGAAGGAACAGTGTGAGGAGCGA 

ATAGAAGAGGTCACCAAAAAGGGGAATGAAGCTGTAGCTTCCAGAGACCTGAGTGAAAACAAC 

GACCAGAGACAGCAGCTCCAAGCCCTCAGTGAGCCTCAGCCCAGGCTGCAGGCAGCAGGCCTG 

CCACACACAGAGGTGCCACAAGGGAAGGGAAACGTGCTTGGTAACAGCAAGTCCCAGACACCA 

GCCCCCAGTTCCGAAGTGGTTTTGGATTCAAAGAGACAAGTTGAGAAAGAGGAAACCAATGAG 

ATCCAGGTGGTGAATGAGGAGCCTCAGAGGGACAGGCTGCCGCAGGAGCCAGGCCGGGAGCAG 

GTGGTGGAAGACAGACCTGTAGGTGGAAGAGGCTTCGGGGGAGCCGGAGAACTGGGCCAGACC 

CCACAGGTGCAGGCTGCCCTGTCAGTGAGCCAGGAAAATCCAGAGATGGAGGGCCCTGAGCGA 

GACCAGCTTGTCATCCCCGACGGACAGGAGGAGGAGCAGGAAGCTGCCGGGGAAGGGAGAAAC 

CAGCAGAAACTGAGAGGAGAAGATGACTACAACATGGATGAAAATGAAGCAGAATCTGAGACA 

GACAAGCAAGCAGCCCTGGCAGGGAATGACAGAAACATAGATGTTTTTAATGTTGAAGATCAG 

AAAAGAGACACCATAAATTTACTTGATCAGCGTGAj^AAGCGGAATCATACACT CTCaVA TTGAA 

CTGGAATCACATATTTCACAACAGGGCCGAAGAGATGACTATAAAATGTTCATGAGGGACTGA 

ATACTGAAAACTGTGAAATGTACTAAATAAAATGTACATCTGA 




wo 01/40466 PCT/USOO/32678 

486/550 

FIGURE 486 

MMGLGNGRRSMKSPPLVLAALVACIIVLGFNYWIASSRSVDLQTRIMELEGRVRRAAAERGAV 
ELKKNEFQGELEKQREQLDKIQSSHNFQLESVNKLYQDEKAVLVNNITTGERLIRVLQDQLKT 
LQRNYGRLQQDVLQFQKNQTKLERKFSYDLSQCINQMKEVKEQCEERIEEVTKKGNEAVASRD 
LSENNDQRQQLQALSEPQPRLQAAGLPHTEVPQGKGNVLGNSKSQTPAPSSEWLDSKRQVEK 
EETNEIQWNEEPQRDRLPQEPGREQWEDRPVGGRGFGGAGELGQTPQVQAALSVSQENPEM 
EGPERDQLVIPDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQAALAGNDRNIDVF 
NVEDQKRDTINLLDQREKRNHTL 

In^ortant features: 
Signal peptide: 

amino acids 1-29 
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FIGURE 487 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGGG 

TGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGTAT 

GGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGGCCT 

ATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGCTCGG 

TTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGAATTTT 

CGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTCCAACCC 

ATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGATGCCTCC 

ATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGAAGAACCCA 

CCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTACGCTTCTCT 

GAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCATAATAGTAATT 

GTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTCATAAAGTGGTG 

GAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCTGTTTATTTAGAA 

GACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATT 

TCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGTTTTCCAACCAGTTC 

TGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCACAGTGCTCCTCCATAT 

CACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAAATTTTTTCAA 

GTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAAGACACTACTTACAGTGT 

TATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCCAATTTGTCTGTTACATTT 

CCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTAATGTGTTTACTCTCTTTCC 

TTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTGTTTCTGATTAACAGTAAATC 

CTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTCTCCTTAACTATGAGACACATC 

TTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATTTTTGTCG 
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FIGURE 488 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVTW 
NFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQVK 
NPPDVDGVIGEIRLSWHTVRFSEIHFLALAIGSACALMIIIVIWVLFQHYRKKRWAERAHK 
WEIKSKEEERLNQEKKVSVYLEDTD 




• 
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FIGURE 489 



AAGCAACCAAACTGCAAGCTTTGGGAGTTGTTCGCTGTCCCTGCCCTGCTCTGCTAGGGAGAG 

AACGCCAGAGGGAGGCGGCTGGCCCGGCGGCAGGCTCTCAGAACCGCTACCGGCGAT6CTACT 

GCTGTGGGTGTCGGTGGTCGCAGCCTTGGCGCTGGCGGTACTGGCCCCCGGAGCAGGGGAGCA 

GAGGCGGAGAGCAGCCAAAGCGCCCAATGTGGTGCTGGTCGTGAGCGACTCCTTCGATGGAAG 

GTTAACATTTCATCCAGGAAGTCAGGTAGTGAAACTTCCTTTTATCAACTTTATGAAGACACG 

TGGGACTTCCTTTCTGAATGCCTACACAAACTCTCCAATTTGTTGCCCATCACGCGCAGCAAT 

GTGGAGTGGCCTCTTCACTCACTTAACAGAATCTTGGAATAATTTTAAGGGTCTAGATCCAAA 

TTATACAACATGGATGGATGTCATGGAGAGGCATGGCTACCGAACACAGAAATTTGGGAAACT 

GGACTATACTTCAGGACATCACTCCATTAGTAATCGTGTGGAAGCGTGGACAAGAGATGTTGC 

TTTCTTACTCAGACAAGAAGGCAGGCCCATGGTTAATCTTATCCGTAACAGGACTAAAGTCAG 

AGTGATGGAAAGGGATTGGCAGAATACAGACAAAGCAGTAAACTGGTTAAGAAAGGAAGCAAT 

TAATTACACTGAACCATTTGTTATTTACTTGGGATTAAATTTACCACACCCTTACCCTTCACC 

ATCTTCTGGAGAAAATTTTGGATCTTCAACATTTCACACATCTCTTTATTGGCTTGAAAAAGT 

GTCTCATGATGCCATCAAAATCCCAAAGTGGTCACCTTTGTCAGAAATGCACCCTGTAGATTA 

TTACTCTTCTTATACAAAAAACTGCACTGGAAGATTTACAAAAAAAGAAATTAAGAATATTAG 

AGCATTTTATTATGCTATGTGTGCTGAGACAGATGCCATGCTTGGTGAAATTATTTTGGCCCT 

TCATCAATTAGATCTTCTTCAGAAAACTATTGTCATATACTCCTCAGACCATGGAGAGCTGGC 

CATGGAACATCGACAGTTTTATAAAATGAGCATGTACGAGGCTAGTGCACATGTTCCGCTTTT 

GATGATGGGACCAGGAATTAAAGCCGGCCTACAAGTATCAAATGTGGTTTCTCTTGTGGATAT 

TTACCCTACCATGCTTGATATTGCTGGAATTCCTCTGCCTCAGAACCTGAGTGGATACTCTTT 

GTTGCCGTTATCATCAGAAACATTTAAGAATGAACATAAAGTCAAAAACCTGCATCCACCCTG 

GATTCTGAGTGAATTCCATGGATGTAATGTGAATGCCTCCACCTACATGCTTCGAACTAACCA 

CTGGAAATATATAGCCTATTCGGATGGTGCATCAATATTGCCTCAACTCTTTGATCTTTCCTC 

GGATCCAGATGAATTAACAAATGTTGCTGTAAAATTTCCAGAAATTACTTATTCTTTGGATCA 

GAAGCTTCATTCCATTATAAACTACCCTAAAGTTTCTGCTTCTGTCCACCAGTATAATAAAGA 

GCAGTTTATCAAGTGGAAACAAAGTATAGGACAGAATTATTCAAACGTTATAGCAAATCTTAG 

GTGGCACCAAGACTGGCAGAAGGAACC7VAGGAAGTATGAAAATGCAATTGATCAGTGGCTTAA 

AACCCATATGAATCCAAGAGCAGTTT6AACAAAAAGTTTAAAAATAGTGTTCTAGAGATACAT 

ATAAATATATTACAAGATCATAATTATGTATTTTAAATGAAACAGTTTTAATAATTACCAAGT 

TTTGGCCGGGCACAGTGGCTCACACCTGTAATCCCAGGACTTTGGGAGGCTGAGGAAAGCAGA 

TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAA 

TACAAAAATTAGCTGGGCGCGGTGGTGCACACCTATAGTCTCAGCTACTCAGAGGCTGAGGCA 

GGAGGATCGCTTGAACCCGGGAGGCAGCAGTTGCAGTGAGCTGAGATTGCGCCACTGTACTCC 

AGCCTGGCAACAGAGTGAGACTGTGTCGCAAAAAAATAAAAATAAAATAATAATAATTACCAA 

TTTTTCATTATTTTGTAAGAATGTAGTGTATTTTAAGATAAAATGCCAATGATTATAAAATCA 

CATATTTTCAAAAATGGTTATTATTTAGGCCTTTGTACAATTTCTAACAATTTAGTGGAAGTA 

TCAAAAGGATTGAAGCAAATACTGTAACAGTTATGTTCCTTTAAATAATAGAGAATATAAAAT 

ATTGTAATAATATGTATCATAAAATAGTTGTATGTGAGCATTTGATGGTGAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA 



# 
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FIGURE 490 

MLLLWVSVVAALALAVLAPGAGEQRRRAAKAPNVVLWSDSFDGRLTFHPGSQWKLPFINFM 
KTRGTSFLNAYTNSPICCPSRAAMWSGLFTHLTESWNNFKGLDPNYTTWMDVMERHGYRTQKF 
GKLDYTSGHHSISNRVEAWTRDVAFLLRQEGRPMVNLIRNRTKVRVMERDWQNTDKAVNWLRK 
EAINYTEPFVIYLGLNLPHPYPSPSSGENFGSSTFHTSLYWLEKVSHDAIKIPKWSPLSEMHP 
VDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIYSSDHG 
ELAMEHRQFYKMSMYEASAHVPLLMMGPGIKAGLQVSNVVSLVDIYPTMLDIAGIPLPQNLSG 
YSLLPLSSETFKNEHKVKNLHPPWILSEFHGCNVNASTYMLRTNHWKYIAYSDGASILPQLFD 
LSSDPDELTNVAVKFPEITYSLDQKLHSIINYPKVSASVHQYNKEQFIKWKQSIGQNYSNVIA 
NLRWHQDWQKEPRKYENAI DQWLKTHMNPRAV 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites* 

amino acids 108-111, 166-169, 193-196, 262-265, 375-378, 413-416, 
498-501 



Sulfatases proteins: 

amino acids 286-315, 359-369, 78-97 
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FIGURE 491 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGA 

GCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCMGGC 

CTCTCTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGT 

TGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGT 

TGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGA 

CATCTATAGCACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGAC 

ATCCAGTGCAATCTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTT 

CTGCCAGGAATCCCGAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGG 

AGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTC 

ACCACTGGTGCCTGACAGCATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTC 

TTCCCTGTTCTCCCTGATAGCTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCG 

CTCCAACTACTACGATGCCTACCAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGG 

TCAACCTCCCAAAGTCAAGAGTGAGTTCAATTCCTACAGCCTGACAGGGTATGTGTSAAGAAC 

CAGGGGCCAGAGCTGGGGGGTGGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCA 

CAGGTGAGGGACACTACCACTGGATCGTGTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGG 

CCATTGGATTGAGCAAAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAA 

GGATGCTCGCCATGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTCCCCTGCCCTAAGTCCCC 

AACCCTCAACTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGGATCCCTTTGCCCTCTGGT 

TTACCTGGGACTCCATCCCCAAACCCACTAATCACATCCCACTGACTGACCCTCTGTGATCAA 

AGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATTGCTGGGGATGGGAAGGAGAAGCAGT. 

GGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTCCCTCCAAAGAAACTGATTGGCCC 

TGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCAGACTAATTTGTGCATGAACTG 

AAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATGCAGGATGGGAGGACAGGAA 
GGCAGCCTGGGACATTTAAAAAAATA 



• 
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FIGURE 492 



MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQ 
CDIYSTLLGLPADIQAAQAMMVTSSAISSLACIISVVGMRCTVFCQESRAKDRVAVAGGVFFI 
LGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQR 
NRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

FMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 493 

GCACTGCTGCTGTCCCATCAGCTGCTCTGAAGCTCCATGGTGCCCAGAATCTTCGCTCCTGCT 
TATGTGTCAGTCTGTCTCCTCCTCTTGTGTCCAAGGGAAGTCATCGCTCCCGCTGGCTCAGAA 
CCATGGCTGTGCCAGCCGGCACCCAGGTGTGGAGACAAGATCTACAACCCCTTGGAGCAGTGC 
TGTTACAATGACGCCATCGTGTCCCTGAGCGAGACCCGCCAATGTGGTCCCCCCTGCACCTTC 
TGGCCCTGCTTTGAGCTCTGCTGTCTTGATTCCTTTGGCCTCACAAACGATTTTGTTGTGAAG 
CTGAAGGTTCAGGGTGTGAATTCCCAGTGCCACTCATCTCCCATCTCCAGTAAATGTGAAAGC 
AGAAGACGTTTTCCCTGAGAAGACATAGAAAGAAAATCAACTTTCACTAAGGCATCTCAGAAA 
CATAGGCTAAGGTAATATGTGTACCAGTAGAGAAGCCTGAGGAATTTACAAAATGATGCAGCT 
CCAAGCCATTGTATGGCCCATGTGGGAGACTGATGGGACATGGAGAATGACAGTAGATTATCA 
GGAAATAAATAAAGTGGTTTTTCCAATGTACACACCTGTAAAA 
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FIGURE 494 

MVPRIFAPAYVSVCLLLLCPREVIAPAGSEPWLCQPAPRCGDKIYNPLEQCCYNDAIVSLSET 
RQCGPPCTFWPCFELCCLDSFGLTNDFWKLKVQGVNSQCHSSPISSKCESRRRFP 



In^ortant features: 
Signal peptide: 

amino acids 1-25 



• 



• 
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FIGURE 495 



CTCCACTGCAACCACCCAGAGCCATGGCTCCCCGAGGCTGCATCGTAGCTGTCTTTGCCATTT 

TCTGCATCTCCAGGCTCCTCTGCTCACACGGAGCCCCAGTGGCCCCCATGACTCCTTACCTGA 

TGCTGTGCCAGCCACACAAGAGATGTGGGGACAAGTTCTACGACCCCCTGCAGCACTGTTGCT 

ATGATGATGCCGTCGTGCCCTTGGCCAGGACCCAGACGTGTGGAAACTGCACCTTCAGAGTCT 

GCTTTGAGCAGTGCTGCCCCTGGACCTTCATGGTGAAGCTGATAAACCAGAACTGCGACTCAG 

CCCGGACCTCGGATGACAGGCTTTGTCGCAGTGTCAGCTAATGGAACATCAGGGGAACGATGA 

CTCCTGGATTCTCCTTCCTGGGTGGGCCTGGAGAAAGAGGCTGGTGTTACCTGAGATCTGGGA 

TGCTGAGTGGCTGTTTGGGGGCCAGAGAAACACACACTCAACTGCCCACTTCATTCTGTGACC 

TGTCTGAGGCCCACCCTGCAGCTGCCCTGAGGAGGCCCACAGGTCCCCTTCTAGAATTCTGGA 

CAGCATGAGATGCGTGTGCTGATGGGGGCCCAGGGACTCTGAACCCTCCTGATGACCCCTATG 

GCCAACATCAACCCGGCACCACCCCAAGGCTGGCTGGGGAACCCTTCACCCTTCTGTGAGATT 

TTCCATCATCTCAAGTTCTCTTCTATCCAGGAGCAAAGCACAGGATCATAATAAATTTATGTA 
CTTTATAAATGAAAA 
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FIGURE 496 

MAPRGCIVAVFAIFCISRLLCSHGAPVAPMTPYLMLCQPHKRCGDKFYDPLQHCCYDDAWPL 
ARTQTCGNCTFRVCFEQCCPWTFMVKLINQNCDSARTSDDRLCRSVS 



In^rtant features: 
Signal peptide: 

amino acids 1-24 
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FIGURE 497 



TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACTC 

CTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTGGC 

ACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCCTCA 

GAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGGCCAG 

CGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTTGCCAA 

GGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGGCAACAT 

GGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCACAGGGCC 

GACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAGCCCGGGCT 

CCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACTGGGCCTCTC 

ACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCTTCAATTTATC 

CAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCACAGGCCAAAAG 

GCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACTGTTTGATGAGAT 

TAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGAAATGGTTGACCCC 

ATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTACAAGACCATTAAGGT 

GCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAATTTTCGTTGTCATGT 

CCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCATGGAGAAAATGGGTGA 

CCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACATGGCTCAGAAACATGAA 

AACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCAGAAGTATGAGATGCATGA 

GCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTGCTGACCTTAGTGAACTCTC 

AGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGAACAGTGATTGAAGTTGATGA 

AAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTACTGCTTATTCCATGCCTCCTGT 

CATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAGAAACCTCTGGAATGCTTCTGTT 

TCTGGGCAGGGTGGTGAATCCGACTCTCCTA2AATTCAGGACATGCATAAGCACTTCGTGCTG 

TAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGATACCAGCAATGGATGGCAGGGGAG 

AGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTCAAATAAATACAGTAGTCCCCACTTA 

TCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGCCTGAAACGGTGGACAGTGCTGAACCT 

TATATATATTTTTTCCTACACATACATACCTATGATAAAGTTTAATTTATAAATTAGGCACAG 

TAAGAGATTAACAATAATAACAACATTAAGTAAAATGAGTTACTTGAACGCAAGCACTGCAAT 

ACCATAACAGTCAAACTGATTATAGAGAAGGCTACTAAGTGACTCATGGGCGAGGAGCATAGA 

CAGTGTGGAGACATTGGGCAAGGGGAGAATTCACATCCTGGGTGGGACAGAGCAGGACGATGC 

AAGATTCCATCCCACTACTCAGAATGGCATGCTGCTTAAGACTTTTAGATTGTTTATTTCTGG 

AATTTTTCATTTAATGTTTTTGGACCATGGTTGACCATGGTTAACTGAGACTGCAGAAAGCAA 

AACCATGGATAAGGGAGGACTACTACAAAAGCATTAAATTGATACATATTTTTTAAAAAAAAA 
AAAAAAAAAA 



t 
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FIGURE 498 

MKVVPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKAGEE 

EKAWLMASRQQLAKETSNFGFSLLRKISMRHDGNMVFSPFGMSLAMTGLMLGATGPTETQIKR 

GLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYFDTE 

CVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDPVFTE 

VDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKMGDHLALEDY 

LTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSATGRNLQ 

VSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLFLGRWNP 
TLL 
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FIGURE 499 

CTAGCCTGCGCCAAGGGGTAGTGAGACCGCGCGGCAACAGCTTGCGGCTGCGGGGAGCTCCCG 
TGGGCGCTCCGCTGGCTGTGCAGGCGGCCATGGATTCCTTGCGGAAAATGCTGATCTCAGTCG 
CAATGCTGGGCGCAGGGGCTGGCGTGGGCTACGCGCTCCTCGTTATCGTGACCCCGGGAGAGC 
GGCGGAAGCAGGT^TGCTAAAGGAGATGCCACTGCAGGACCCAAGGAGCAGGGAGGAGGCGG 
CCAGGACCCAGCAGCTATTGCTGGCCACTCTGCAGGAGGCAGCGACCACGCAGGAGAACGTGG 
CCTGGAGGAAGAACTGGATGGTTGGCGGCGAAGGCGGCGCCAGCGGGAGGTCACCGTGAGACC 
GGACTTGCCTCCGTGGGCGCCGGACCTTGGCTTGGGCGCAGGAATCCGAGGCAGCCTTTCTCC 
TTCGTGGGCCCAGCGGAGAGTCCGGACCGAGATACCATGCCAGGACTCTCCGGGGTCCTGTGA 
GCTGCCGTCGGGTGAGCACGTTTCCCCCAAACCCTGGACTGACTGCTTTAAGGTCCGCAAGGC 
GGGCCAGGGCCGAGACGCGAGTCGGATGTGGTGAACTGAAAGAACCAATAAAATCATGTTCCT 
CCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 




wo 01/40466 PCT/USOO/32678 

500/550 

FIGURE 500 

MDSLRKMLISVAMLGAGAGVGYALLVIVTPGERRKQEMLKEMPLQDPRSREEAARTQQLLLAT 
LQEAATTQENVAWRKN WMVGGEGGASGRS P 

Important features: 
Signal peptide: 

■ ** 

amino acids 1-18 
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FIGURE 501 



CAGGAGAGAAGGCACCGCCCCCACCCCGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGAG 
GCTGACTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCMSGGGCCCAGCACCCC 
TCTCCTCATCTTGTTCCTTTTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCTTGT 
GGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCCAGGACCAGAG 
TAGTCGGCATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCTGCCACTGCTGGAGGTGGC 
AGAGAAGGAGCGGGAGGCACTCAGAACTGAGGCCGACACCATCTCCGGGAGAGTGGATCGTCT 
GGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCTCTGCCCTGTGTAGAGTTTGATGA 
GAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAGGGAAGAAGGAATGAGAAGTACGATATGGT 
GACAGACTGTGGCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAGCGATTTGGTGG 
CCCAGCTGGTCTATGGACCAAGGATCCACTGGGGCAAACAGAGAAGATCTACGTGTTAGATGG 
GACACAGAATGACACAGCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTTGCCATGGCTGC 
CCGGAAAGCTTCCCGAGTCCGGGTGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGG 
TGGCTTTCTTTATTTTGCTCGGAGGCCTCCTGGAAGACCTGGTGGAGGTGGTGAGATGGAGAA 
CACTTTGCAGCTAATCAAATTCCACCTGGCAAACCGAACAGTGGTGGACAGCTCAGTATTCCC 
AGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCTACATCGACCTGGTAGCTGA 
TGAGGAAGGTCTTTGGGCTGTCTATGCCACCCGGGAGGATGACAGGCACTTGTGTCTGGCCAA 
GTTAGATCCACAGACACTGGACACAGAGCAGCAGTGGGACACACCATGTCCCAGAGAGAATGC 
TGAGGCTGCCTTTGTCATCTGTGGGACCCTCTATGTCGTCTATAACACCCGTCCTGCCAGTCG 
GGCCCGCATCCAGTGCTCCTTTGATGCCAGCGGCACCCTGACCCCTGAACGGGCAGCACTCCC 
TTATTTTCCCCGCAGATATGGTGCCCATGCCAGCCTCCGCTATAACCCCCGAGAACGCCAGCT 
CTATGCCTGGGATGATGGCTACCAGATTGTCTATAAGCTGGAGATGAGGAAGAAAGAGGAGGA 
GGTT TGU GGAGCTAGCCTTGTTTTTTGCATCTTTCTCACTCCCATACATTTATATTATATCCC 
CACTAAATTTCTTGTTCCTCATTCTTCAAATGTGGGCCAGTTGTGGCTCAAATCCTCTATATT 
TTTAGCCAATGGCAATCAAATTCTTTCAGCTCCTTTGTTTCATACGG7\ACTCCAGATCCTGAG 
TAATCCTTTTAGAGCCCGAAGAGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCTGCCCCATG 
TCAACAAATTTCAGGCTAAGGATGCCCCAGACCCAGGGCTCTAACCTTGTATGCGGGCAGGCC 
CAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAGTGACTTGGGGAGGGAGAAATAGGAGGAGA 
CGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGTGTCCTGAGGAACAGGACTTTCTC 
CACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAAAATCCACAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 502 

MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQDQSSRHAAELRDFKNKML 
PLLEVAEKEREALRTEADTISGRVDRLEREVDYLETQNPALPCVEFDEKVTGGPGTKGKGRRN 
EKYDMVTDCGYTISQVRSMKILKRFGGPAGLWTKDPLGQTEKIYVLDGTQNDTAFVFPRLRDF 
TLAMAARKASRVRVPFPWVGTGQLVYGGFLYFARRPPGRPGGGGEMENTLQLIKFHLANRTVV 
DSSVFPAEGLIPPYGLTADTYIDLVADEEGLWAVYATREDDRHLCLAKLDPQTLDTEQQWDTP 
CPRENAEAAFVICGTLYWYNTRPASRARIQCSFDASGTLTPERAALPYFPRRYGAHASLRYN 
PRERQLYAWDDGYQIVYKLEMRKKEEEV 

Important features : 
Signal peptide: 

amino acids 1-21 



N-glycosylation sites. 

amino acids 177-180^ 248-251 
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FIGURE 503 



TGCGGCGCAGTGTAGACCTGGGAGGATGGGCGGCCTGCTGCTGGCTGCTTTTCTGGCTTTGGT 
CTCGGTGCCCAGGGCCCAGGCCGTGTGGTTGGGAAGACTGGACCCTGAGCAGCTTCTTGGGCC 
CTGGTACGTGCTTGCGGTGGCCTCCCGGGAAAAGGGCTTTGCCATGGAGAAGGACATGAAGAA 
CGTCGTGGGGGTGGTGGTGACCCTCACTCCAGAAAACAACCTGCGGACGCTGTCCTCTCAGCA 
CGGGCTGGGAGGGTGTGACCAGAGTGTCATGGACCTGATAAAGCGAAACTCCGGATGGGTGTT 
TGAGAATCCCTCAATAGGCGTGCTGGAGCTCTGGGTGCTGGCCACCAACTTCAGAGACTATGC 
CATCATCTTCACTCAGCTGGAGTTCGGGGACGAGCCCTTCAACACCGTGGAGCTGTACAGTCT 
GACGGAGACAGCCAGCCAGGAGGCCATGGGGCTCTTCACCAAGTGGAGCAGGAGCCTGGGCTT 
CCTGTCACAGS^AGGCCCAGCTGCAGAAGGACCTCACCTGTGCTCACAAGATCCTTCTGTG 
AGTGCTGCGTCCCCAGTAGGGATGGCGCCCACAGGGTCCTGTGACCTCGGCCAGTGTCCACCC 
ACCTCGCTCAGCGGCTCCCGGGGCCCAGCACCAGCTCAGAATAAAGCGATTCCACAGCA 
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FIGURE 504 



MGGLLLAAFLALVSVPRAQAVWLGRLDPEQLLGPWYVLAVASREKGFAMEKDMKNWGVWTL 
TPENNLRTLSSQHGLGGCDQSVMDLIKRNSGWVFENPSIGVLELWVLATNFRDYAIIFTQLEF 
GDEPFNTVELYSLTETASQEAMGLFTKWSRSLGFLSQ 

In^ortant; features: 
Signal peptide: 
amino acids 1-20 
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FIGURE 505 

GTTCCGCAGATGCAGAGGTTGAGGTGGCTGCGGGACTGGAAGTCATCGGGCAGAGGTCTCACA 
GCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAGGATTCTGCAGTTAATC 
CTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGGTTCGAGTGC 
AAGCCTCACTCCCAGCCCTGGCAGGCAGCCCTGTTCGAGAAGACGCGGCTACTCTGTGGGGCG 
ACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAGCCCCGCTACATAGTT 
CACCTGGGGCAGCACAACCTCCAGAAGGAGGAGGGCTGTGAGCAGACCCGGACAGCCACTGAG 
TCCTTCCCCCACCCCGGCTTCAACAACAGCCTCCCCAACAAAGACCACCGCAATGACATCATG 
CTGGTGAAGATGGCATCGCCAGTCTCCATCACCTGGGCTGTGCGACCCCTCACCCTCTCCTCA 
CGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCCGGCTGGGGCAGCACGTCCAGCCCCCAG 
TTACGCCTGCCTCACACCTTGCGATGCGCCAACATCACCATCATTGAGCACCAGAAGTGTGAG 
AACGCCTACCCCGGCAACATCACAGACACCATGGTGTGTGCCAGCGTGCAGGAAGGGGGCAAG 
GACTCCTGCCAGGGTGACTCCGGGGGCCCTCTGGTCTGTAACCAGTCTCTTCAAGGCATTATC 
TCCTGGG.GCCAGGATCCGTGTGCGATCACCCGAAAGCCTGGTGTCTACACGAAAGTCTGCAAA 
TATGTGGACTGGATCCAGGAGACGATGAAGAACAATTAGACTGGACCCACCCACCACAGCCCA 
TCACCCTCCATTTCCACTTGGTGTTTGGTTCCTGTTCACTCTGTTAATAAGAAACCCTAAGCC 
AAGACCCTCTACGAACATTCTTTGGGCCTCCTGGACTACAGGAGATGCTGTCACTTAATAATC 
AACCTGGGGTTCGAAATCAGTGAGACCTGGATTCAAATTCTGCCTTGAAATATTGTGACTCTG 
GGAATGACAACACCTGGTTTGTTCTCTGTTGTATCCCCAGCCCCAAAGACAGCTCCTGGCCAT 
ATATCAAGGTTTCAATAAATATTTGCTAAATGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAA 
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FIGURE 506 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHC 
LKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITWAV 
RPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTMVCA 
SVQEGGKDSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDWIQETMKNN 

In^ortant features: 
Signal peptide: 
amino acids 1-18 

Serine proteases, trypsin family, his tidine active site. 

amino acids 58-63 

N-glycosylation sites. 

amino acids 99-102, 165-168, 181-184, 210-213 

Glycosaminoglycan attachment site . 

amino acids 145-148 

Kringle domain proteins. 

amino acids 197-209, 47-64 

Serine proteases, trypsin family, his tidine protein 

amino acids 199-209, 47-63, 220-243 



Apple domain proteins 

amino acids 222-249, 189-222 
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FIGURE 507 



CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGAC^SAGG 
AGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAGCA 
CCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAGAAGGCC 
TGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCCTGTC 
CAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCATCCTT 
CCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCCGAGCCC 
GACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCCCCGGTTG 
TGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACATCTACCAC 
CCCCAGT^GGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGGCTGTTGGGA 

CTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAAAAAAAAAAAA 
AAAAAA 
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FIGURE 508 

MRRLLLVTSLVWLLWEAGAVPAPKVPIKMQVKHWPSEQDPEKAWGARWEPPEKDDQLWLF 
PVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEERP 
RLWVMPNHQVLLGPEEDQDHIYHPQ 




wo 01/40466 



PCTAJSOO/32678 



509/550 



FIGURE 509 



GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCCC 
ACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCGCG 
CTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCCATG 
GCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAACAGAG 
ACTCTCCAACATGTGCCTTCTGACCATACAAATGAAACTTCCAACAGTACTGTGAAACCACCA 
ACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCGGCATCT 
AATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTCTACACCC 
AAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCGTAACCCAC 
AATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCATTCTGAAGCA 
AAG7U^GGATCAA7\ATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAACGCTGGGAGTT 
TTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTCGGTATCGAACC 
ATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGAATACAGATTGAT 
GCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCTCTTTTTGAAT^TA 
GTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTAAAGATTCTTCAAGG 
TAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGTTCATACAATGGTTTT 
AGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCTGGGGTGGGGGCATTGG 
TCACATATGACCAGTAATTG7VAAGACGTCATCACTGAAAGACAGAATGCCATCTGGGCATACA 
AATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGTAGCTCACATAAAGAACTT 
CAGTGCTTTTCAGAGCTGGATAtATCTTAATTACTAATGCCACACAGAAATTATACAATCAAA 
CTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTGTGCTTTAAACTGTAGTAGTT 
GGTCTAGAAACAAAATACTCC 
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FIGURE 510 

mglgargawaalllgtlqvlallgaahesaamaasaniensglphnssanstetlqhvpsdht 
netsnstvkpptsvasdssnttvttmkptaasntttpgmvstnmtsttlkstpkttsvsqnts 
qiststmtvthnssvtsaassvtitttmhseakkgskfdtgsfvggivltlgvlsilyigckm' 
yysrrgiryrtidehdaii' 
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FIGURE 511 



GACTTTGCTTGT^TGTTTACATTTTCTGCTCGCTGTCCTACATATCACAATATAGTGTTCACGTTTTGTTA/yy^C 
TTTGGGGTGTCAGGAGTTGAGCTTGCTCAGCAAGCCAGCATGGCTAGGATGAGCTTTGTTATAGCAGCTTGCCAA 
TTGGTGCTGGGCCTACTAATGACTTCATTAACCGAGTCTTCCATACAGAATAGTGAGTGTCCACAACTTTGCGTA 
TGTGAAATTCGTCCCTGGTTTACCCCACAGTCAACTTACAGAGAAGCCACCACTGTTGATT6CAATGACCTCCGC 
TTAACAAGGATTCCCAGTAACCTCTCTAGTGACACACAAGTGCTTCTCTTACAGAGCAATAACATCGCGAAGACT 
GTGGATGAGCTGCAGCAGCTTTTCAACTTGACTGAACTAGATTTCTCCC7\AAACAACTTTACTAACATTAAGGAG 
GTCGGGCTGGCAAACCTAACCCAGCTCACAACGCTGCATTTGGAGGAAAATCAGATTACCGAGATGACTGATTAC 
TGTCTACAAGACCTCAGCAACCTTCAAGAACTCTACATCAACCACAACCAAATTAGCACTATTTCTGCTCATGCT 
TTTGCAGGCTTAAAAAATCTATTAAGGCTCCACCTGAACTCCAACAAATTGAAAGTTATTGATAGTCGCTGGTTT 
GATTCTACACCCAACCTGGAAATTCTCATGATCGGAGAAAACCCTGTGATTGGAATTCTGGATATGAACTTCAAA 
CCCCTCGCAAATTTGAGAAGCTTAGTTTTGGCAGGAATGTATCTCACTGATATTCCTGGAAATGCTTTGGTGGGT 
CTGGATAGCCTTGAGAGCCTGTCTTTTTATGATAAC7\AACTGGTTAAAGTCCCTCAACTTGCCCTGCAAAAAGTT 
CCAAATTTGAAATTCTTAGACCTCAACAAAAACCCCATTCACAAAATCCAAGAAGGGGACTTCAAAAATATGCTT 
CGGTTAAAAGAACTGGGAATCAACAATATGGGCGAGCTCGTTTCTGTCGACCGCTATGCCCTGGATAACTTGCCT 
GAACTCACAAAGCTGGAAGCCACCAATAACCCTAAACTCTCTTACATCCACCGCTTGGCTTTCCGAAGTGTCCCT 
GCTCTGGAAAGCTTGATGCTGAACAACAATGCCTTGAATGCCATTTACCAAAAGACAGTCGAATCCCTCCCCAAT 
CTGCGTGAGATCAGTATCCATAGCAATCCCCTCAGGTGTGACTGTGTGATCCACTGGATTAACTCCAACAAAACC 
AACATCCGCTTCATGGAGCCCCTGTCCATGTTCTGTGCCATGCCGCCCGAATATAAAGGGCACCAGGTGAAGGAA 
GTTTTAATCCAGGATTCGAGTGAACAGTGCCTCCCAATGATATCTCACGACAGCTTCCCAAATCGTTTAAACGTG 
GATATCGGCACGACGGTTTTCCTAGACTGTCGAGCCATGGCTGAGCCAGAACCTGAAATTTACTGGGTCACTCCC 
ATTGGAAATAAGATAACTGTGGAAACCCTTTCAGATAAATACAAGCTAAGTAGCGAAGGTACCTTGGAAATATCT 
AACATACAAATTGAAGACTCAGGAAGATACACATGTGTTGCCCAGAATGTCCAAGGGGCAGACACTCGGGTGGCA 
ACAATTAAGGTTAACGGGACCCTTCTGGATGGTACCCAGGTGCTAAAAATATACGTCAAGCAGACAGAATCCCAT 
TCCATCTTAGTGTCCTGGAAAGTTAATTCCAATGTCATGACGTCAAACTTAAAATGGTCGTCTGCCACCATGAAG 
ATTGATAACCCTCACATAACATATACTGCCAGGGTCCCAGTCGATGTCCATGAATACAACCTAACGCATCTGCAG 
CCTTCCACAGATTATGAAGTGTGTCTCACAGTGTCCAATATTCATCAGCAGACTCAAAAGTCATGCGTAAATGTC 
ACAACCAAAAATGCCGCCTTCGCAGTGGACATCTCTGATCAAGAAACCAGTACAGCCCTTGCTGCAGTAATGGGG 
TCTATGTTTGCCGTCATTAGCCTTGCGTCCATTGCTGTGTACTTTGCCAAAAGATTTAAGAGAAAAAACTACCAC 
CACTCATTAAAAAAGTATATGCAAAAAACCTCTTCAATCCCACTAAATGAGCTGTACCCACCACTCATTAACCTC 
TGGGAAGGTGACAGCGAGAAAGACAAAGATGGTTCTGCAGACACCAAGCCAACCCAGGTCGACACATCCAGAAGC 
TATTACATGTGGTAACTCAGAGGATATTTTGCTTCTGGTAGTAAGGAGCACAAAGACGTTTTTGCTTTATTCTGC 
AAAAGTGAACAAGTTGAAGACTTTTGTATTTTTGACTTTGCTAGTTTGTGGCAGAGTGGAGAGGACGGGTGGATA 
TTTCAAATTTTTTTAGTATAGCGTATCGCAAGGGTTTGACACGGCTGCCAGCGACTCTAGGCTTCCAGTCTGTGT 
TTGGTTTTTATTCTTATCATtATTATGATTGTTATTATATTATTATTTTATTTTAGTTGTTGTGCTAAACTCAAT 
AATGCTGTTCTAACTACAGTGCTCAATAAAATGATTAATGACAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAATVAAAAAAAAAAAAAT^AAAAAAAAAA 
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FIGURE 512 

MARMSFVIAACQLVLGLLMTSLTESSIQNSECPQLCVCEIRPWFTPQSTYREATTVDCNDLRL 
TRIPSNLSSDTQVLLLQSNNIAKTVDELQQLFNLTELDFSQNNFTNIKEVGLANLTQLTTLHL 
EENQITEMTDYCLQDLSNLQELYINHNQISTISAHAFAGLKNLLRLHLNSNKLKVIDSRWFDS 
TPNLEILMIGENPVIGILDMNFKPLANLRSLVLAGMYLTDIPGNALVGLDSLESLSFYDNKLV 
KVPQLALQKVPNLKFLDLNKNPIHKIQEGDFKNMLRLKELGINNMGELVSVDRYALDNLPELT 
KLEATNNPKLSYIHRLAFRSVPALESLMLNNNALNAIYQKTVESLPNLREISIHSNPLRCDCV 
IHWINSNKTNIRFMEPLSMFCAHPPEYKGHQVKEVLIQDSSEQCLPMISHDSFPNRLNVDIGT 
TVFLDCR7\MAEPEPEIYWVTPIGNKITVETLSDKYKLSSEGTLEISNIQIEDSGRYTCVAQNV 
QGADTRVATIKVNGTLLDGTQVLKIYVKQTESHSILVSWKVNSNVMTSNLKWSSATMKIDNPH 
ITYTARVPVDVHEYNLTHLQPSTDYEVCLTVSNIHQQTQKSCVNVTTKNAAFAVDISDQETST 
ALAAVMGSMFAVISLASIAVYFAKRFKRKNYHHSLKKYMQKTSSIPLNELYPPLINLWEGDSE 
KDKDGSADTKPTQVDTSRSYYMW ' 

Important £eatures: 
Signal peptide: 

Amino acids 1-25 

Transmembrane domain: 

Amino acids 508-530 

. N-glycosylation sites: 

Amino acids 69-73;96-100;106-110;117-121;385-389;517-521; 
582-586; 611-615 

Tyrosine kinase phosphorylation site: 

Amino acids 573-582 



N-myristoylation sites: 

Amino acids 16-22;224-230;464-470;637-643;698-704 
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FIGURE 513 



GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCTTCC 
CAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGGGGCCCT 
GTCTGGATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGGGCT 
GAGCAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGGAATCACGCATGC 
CGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAGCCACACCGGCAAGGA 
GTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGTTGCCCATGAGATCAACCA 
TGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCATGGGGTCAACAACGCTGCTGG 
ACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTTCCACACTGGGGTCCACCAGGCTGG 
GAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCATGCTGCTGACCAGGCTGGAAAGGAAGT 
GGAGAAGCTTGGCCAAGGTGCCCACCATGCTGCTGGCCAGGCCGGGAAGGAGCTGCAGAATGC 
TCATAATGGGGTCAACCAAGCCAGCAAGGAGGCCAACCAGCTGCTGAATGGCAACCATCAAAG 
CGGATCTTCCAGCCATCAAGGAGGGGCCACAACCACGCCGTTAGCCTCTGGGGCCTCAGTCAA 
CACGCCTTTCATCAACCTTCCCGCCCTGTGGAGGAGCGTCGCCAACATCATGCCC TAHA CTGG 
CATCCGGCCTTGCTGGGAGAATAATGTCGCCGTTGTCACATCAGCTGACATGACCTGGAGGGG 
TTGGGGGTGGGGGACAGGTTTCTGAAATCCCTGAAGGGGGTTGTACTGGGATTTGTGAATAAA 
CTTGATACACCA 
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FIGURE 514 

MHLARLVGSCSLLLLLGALSGWAASDDPIEKVIEGINRGLSNAEREVGKALDGINSGITHAGR 
EVEKVFNGLSNMGSHTGKELDKGVQGLNHGMDKVAHEINHGIGQAGKEAEKLGHGVNNAAGQA 
GKEADKAVQGFHTGVHQAGKEAEKLGQGVNHAADQAGKEVEKLGQGAHHAAGQAGKELQNAHN 
GVNQASKEANQLLNGNHQSGSSSHQGGATTTPLASGASVNTPFINLPALWRSVANIMP 



Important features: 
Signal peptide: 

amino acids 1-25 

Homologous region to circumsporozoite (CS) repeats: 

amino acids 35-225 




wo 01/40466 



PCT/USOO/32678 



515/550 



FIGURE 515 



CCCACGCGTCCGCCCACGCGTCCGGGTGCCACTCGCGCGCCGGCCGCGCTCCGGGCTTCTCTT 

TTCCCTCCGACGCGCCACGGCTGCCCAGACATTCCGGCTGCCGGGTCTGGAGAGCTCCCCGAA 

CCCCTCCGCGGAGAGGAGCGAGGCGGCGCCAGGGTGGCCCCCGGGGCGCGCTTGGTCTCGGAG 

AAGCGGGGACGAGGCCGGAGGATGAGCGACTGAGGGCGACGCGGGCACTGACGCGAGTTGGGG 

CCGCGACTACCGGCAGCTGACAGCGCGATGAGCGACTCCCCAGAGACGCCCTAGCCCGGTGTG 

CGCGCCAGGCGGAGCGCGCAGGTGGGGCTGGGCTGTTAGTGGTCCGCCCCACGCGGGTCGCCG 

GCCGGCCCAGGATGGGCGCTGGCAACCCGGGCCCGCGCCCGCCGCTGCTACCCCTGCGCCCGC 

TGCGAGCCCGGCGTCCGGCCCGCGCCCTGCGCTCATGGACGGCGGCTCCCGGCTGGCGGCGGC 

GCGCCCCCGGGCTGTGAATGCGACTCGCCCCTCGGCCGCGCTCCCCGCCCGCCCGCCCGCCGG 

GACGTGGTAGGGGATGCCCAGCTCCACTGCGATGGCAGTTGGCGCGCTCTCCAGTTCCCTCCT 

GGTCACCTGCTGCCTGATGGTGGCTCTGTGCAGTCCGAGCATCCCGCTGGAGAAGCTGGCCCA 

GGCACCAGAGCAGCCGGGCCAGGAGAAGCGTGAGCACGCCACTCGGGACGGCCCGGGGCGGGT 

GAACGAGGTCGGGCGCCCGGCGAGGGACGAGGGCGGCAGCGGCCGGGACTGGAAGAGCAAGAG 

CGGCCGTGGGCTCGCCGGCCGTGAGCCGTGGAGCAAGCTGT^GCAGGCCTGGGTCTCCCAGGG 

CGGGGGCGCCAAGGCCGGGGATCTGCAGGTCCGGCCCCGCGGGGACACCCCGCAGGCGGAAGC 

CCTGGCCGCAGCCGCCCAGGACGCGATTGGCCCGGAACTCGCGCCCACGCCCGAGCCACCCGA 

GGAGTACGTGTACCCGGACTACCGTGGCAAGGGCTGCGTGGACGAGAGCGGCTTCGTGTACGC 

GATCGGGGAGAAGTTCGCGCCGGGCCCCTCGGCCTGCCCGTGCCTGTGCACCGAGGAGGGGCC 

GCTGTGCGCGCAGCCCGAGTGCCCGAGGCTGCACCCGCGCTGCATCCACGTCGACACGAGCCA 

GTGCTGCCCGCAGTGCAAGGAGAGGAAGAACTACTGCGAGTTCCGGGGCAAGACCTATCAGAC 

TTTGGAGGAGTTCGTGGTGTCTCCATGCGAGAGGTGTCGCTGTGAAGCCAACGGTGAGGTGCT 

ATGCACAGTGTCAGCGTGTCCCCAGACGGAGTGTGTGGACCCTGTGTACGAGCCTGATCAGTG 

CTGTCCCATCTGCAAAAATGGTCCAAACTGCTTTGCAGAAACCGCGGTGATCCCTGCTGGCAG 

AGAAGTGAAGAGTGACGAGTGCACCATATGCCACTGTACTTATGAGGAAGGCACATGGAGAAT 

CGAGCGGCAGGCCATGTGCACGAGACATGAATGCAGGCAAATGTAGACGCTTCCCAGAACACA 

AACTCTGACTTTTTCTAGAACATTTTACTGATGTGAACATTCTAGATGACTCTGGGAACTATC 

AGTCAAAGAAGACTTTTGATGAGGT^TAATGGAAAATTGTTGGTACTTTTCCTTTTCTTGATA 

ACAGTTACTACAAGAGAAGGAAATGGATATATTTCAAAACATCAACAAGAACTTTGGGCATAA 

AATCCTTCTCTAAATAAATGTGCTATTTTCACAGTAAGTACACAAAAGTACACTATTATATAT 

CAAATGTATTTCTATAATCCCTCCATTAGAGAGCTTATATAAGTGTTTTCTATAGATGCAGAT- 

TAAAAATGCTGTGTTGTCAACCGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 516 

MPSSTAMAVGALSSSLLVTCCLMVALCSPSIPLEKLAQAPEQPGQEKREHATRDGPGRVNELG 
RPARDEGGSGRDWKSKSGRGLAGREPWSKLKQAWVSQGGGAKAGDLQVRPRGDTPQAEALAAA 
AQDAIGPELAPTPEPPEEYVYPDYRGKGCVDESGFVYAIGEKFAPGPSACPCLCTEEGPLCAQ 
PECPRLHPRCIHVDTSQCCPQCKERKNYCEFRGKTYQTLEEFWSPCERCRCEANGEVLCTVS 
ACPQTECVDPVYEPDQCCPICKNGPNCFAETAVIPAGREVKTDECTICHCTYEEGTWRIERQA 
MCTRHECRQM 

Is^ortant features: 
Signal peptide : 

amino acids 1-27. 

Transmembrane domain: 

amino acids 11-30 

GlycosaminoglyccLn attachment site. 

amino acids 80-83 

N-myristoylation sites. 

amino acids 10-15, 102-107, 103-108 

Cell attachment sequence. 

amino acids 114-117 

£6F-like domain cysteine pattern signature. 

amino acids 176-187 
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FIGURE 517 



GGACAACCGTTGCTGGGTGTCCCAGGGCCTGAGGCAGGACGGTACTCCGCTGACACCTTCCCT 
TTCGGCCTTGAGGTTCCCAGCCTGGTGGCCCCAGGACGTTCCGGTCGCATGGCAGAGTGCTAC 
GGACGACGCCTATGAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCTTCGTCTGTGC 
CGGCTTATCCGAGCATAACTGTGACACCTGATGAAGAGCAAAACTTGAATCATTATATACAAG 
TTTTAGAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGGTCGTGAGAAAAAATCTAACT 
CTCCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAATTTAAGGAGCTAGTTACACATG 
GAGACGCTTCAACTGAGAATGATGTTTTAACCAATCCTATCAGTGAAGAAACTACAACTTTCC 
CTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAGTACCCCATTCTGGT 
CGATCAAACCATU^CAATGTTTCCATTGTTTTGCATGCAGAGGAACCTTATATTGAAAATGAAG 
AGCCAGAGCCAGAGCCGGAGCCAGCTGCAAAACAAACTGAGGCACCAAGAATGTTGCCAGTTG 
TTACTGAATCATCTACAAGTCCATATGTTACCTCATACAAGTCACCTGTCACCACTTTAGATA 
AGAGCACTGGCATTGAGATCTCTACAGAATCAGAAiGATGTTCCTCAGCTCTCAGGTGAAACTG 
CGATAGAAAAACCCGAAGAGTTTGGAAAGCACCCAGAGAGTTGGAATAATGATGACATTTTGA 
AAAAAATTTTAGATATTAATTCACAAGTGCAACAGGCACTTCTTAGTGACACCAGCAACCCAG 
CATATAGAGT^GATATTGAAGCCTCTAAAGATCACCTAAAACGAAGCCTTGCTCTAGCAGCAG 
CAGCAGAACATAAATTAAAAACAATGTATAAGTCCCAGTTATTGCCAGTAGGACGAACAAGTA 
ATAAAATTGATGACATCGAAACTGTTATTAACATGCTGTGTAATTCTAGATCTAAACTCTATG 
AATATTTAGATATTAAATGTGTTCCACCAGAGATGAGAGAAAAAGCTGCTACAGTATTCAATA 
CATTAAAA/^TATGTGTAGATCAAGGAGAGTCACAGCCTTATTAAAAGTTTATTAAACAATAA 
TATAAAT^TTTTAAACCTACTTGATATTCCATTU^CAAAGCTGATTTAAGCAAACTGCATTTTT 
TCACAGGAGAAATAATCATATTCGTAATTTCAAAAGTTGTATAAAAATATTTTCTATTGTAGT 
TCAAATGTGCCAACATCTTTATGTGTCATGTGTTATGAACAATTTTCATATGCACTAAAAACC 
TAATTTAAAATAAAATTTTGGTTCAGGAAAAAA 
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FIGURE 518 

MKPLVLLVALLLWPSSVPAYPSITVTPDEEQNLNHYIQVLENLVRSVPSGEPGREKKSNSPKH 
VYSIASKGSKFKELVTHGDASTENDVLTNPISEETTTFPTGGFTPEIGKKKHTESTPFWSIKP 
NNVSIVLHAEEPYIENEEPEPEPEPAAKQTEAPE^LPVVTESSTSPYVTSYKSPVTTLDKSTG 
lEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQALLSDTSNPAYRE 
DIEASKDHLKRSLALAAAAEHKLKTMYKSQLLPVGRTSNKIDDIETVINMLCNSRSKLYEYLD 
IKCVPPEMREKAATVFNTLKNMCRSRRVTALLKVY 



Important features: 
Signal peptide: 

amino acids 1-19 
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FIGURE 519 



CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTTG 
GATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTACTG 
GATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTGGGT 
GATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGATAGAC 
TGGACTCTGTCACCAGGAGAGCACGCCZUVGGACGAATATGTGCTATACTATTACTCCAATCTC 
AGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGCAATGAT 
GGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGAAATCCGC 
CTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAGAGGAGCCC 
AAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAGAGCACAGAA 
GTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGAGGAGATTGTA 
TTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGGGCCACTTCCAG 
AATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATGCTTCAAGGAGTG 
AGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCTGGTGTTCAAGAAA 
ACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCCCGGCAGCCCTGAGG 
CCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTCTGTGCCACAATCCTG 
CTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAAGAGTTCAGTGAATTCT 
ACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAGAAAAACCCTGCCATTTT 
GAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTACGGGAGGTGATCGAGGAA 
GAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCACCCAGTTTGGCCTTCTCTG 
AGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGGGAATGCCAAAAACACAGCAA 
GCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGGTGGAGACTCTCTCCTGTGTGT 
GTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTCCCAGCTGTCCTCCTGTCTCATT 
GTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGGCAGAGAGACTGGACAGCTCTGGA 
GGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGCCTCTGGAGTGGGACACTGGCCCTG 
GGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTTGGATCAGACCCTCCTGTGGGCAGGG 
TTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATAAAAACCAACCCAAATCAA 
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FIGURE 520 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLSPGE 

HAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRLKGESQV 

FKKAWLHVLPEEPKELMVHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFRYYHKL 

RMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIVLHVS 

PEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTVLVKNT 

KKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLRSDRNNS 
LEKKSGGGMPKTQQAF 
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FIGURE 521 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGGAAAACAAATGTGTCCCATCTCACATGG 
TTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGGAA 
ACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATGCAGGA 
TGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTGGCCC 
TGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGGATGGT 
TGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGATGAGAA 
TG7W\ATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGGTAAAACA 
ATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAACTGGAGATA 
TTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAGTAAGCAGTA 
CTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGGAGTACATCAA 
AGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAATGAGGTCTGGAA 
GTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGATGGAAAAGGAAA 
TATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTGAGAACAAACATTA 
TTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCTTAATGCAAAGAGGT 
GGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGTATGAATGCATCAGTA 
GCTGAAAAAAAAAAAAAA 
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FIGURE 522 

MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRNYLQ 
DENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEES 
KQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFLEDG 
KGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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FIGURE 523 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGC 
AAAGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAA 
GAAAATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAAT 
TGTCCTGTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGA 
GCACACTTTCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTGACCAG 
AACTGAAATATTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAA 
CGGATACACTGGCATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGT 
GATTCCTG7VATTTTCTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACTTT 
CTTTGAACAGTCAGTGATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAA 
AAATTCCAAAATTCTGGAGATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTi\AT 
ATCAGTTTCTGAGTTACAAGACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGA 
AAAAAAAGGGATTGAACAAAATGAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCG 
TCACGCCAGACAAGCAAGTGAGGAAGT^CTTCCAATAAATGACTATACTGAAAATGGAATAGA 
ATTTGATCCCATGCTGGATGAGAGAGGTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTA 
TTGCCGCCGCGTCTGTGAACCTTTACTAGGCTACTACCCATATCCATACTGCTACCAAGGAGG 
ACGAGTCATCTGTCGTGTCATCATGCCTTGTAACTGGTGGGTGGCCCGCATGCTGGGGAGGGT 
CJCAATAGGAGGTTTGAGCTCAAATGCTTAAACTGCTGGCAACATATAATAAATGCATGCTATT 
CAATGAATTTCTGCCTATGAGGCATCTGGCCCCTGGTAGCCAGCTCTCCAGAATTACTTGTAG 
GTAATTCCTCTCTTCATGTTCTAATAAACTTCTACATTATCACCAAAAAAAAT^AAAAAAAAAA 
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FIGURE 524 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYD 
MEHTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQI 
KVIPEFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPT 
LISVSELQDFEEEGEDLHFPANEKKGIEQNEQWVVPQVKVEKTRHARQASEEELPINDYTENG 
lEFDPMLDERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylatlon sites. 

amino acids 94-97, 180-183 . 

Glycosamlnoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myrlstoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Mlcrobcdles C-terulnal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-blndlng site signature. 

amino acids 9-14 




# 
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FIGURE 525 



AGTGACAATCTCAGAGCAGCTTCTACACCACAGCCATTTCCAGCATGAAGATCACTGGGGGTC 
TCCTTCTGCTCTGTACAGTGGTCTATTTCTGTAGCAGCTCAGAAGCTGCTAGTCTGTCTCCAA 
A7\AAAGTGGACTGCAGCATTTACAAGAAGTATCCAGTGGTGGCCATCCCCTGCCCCATCACAT 
ACCTACCAGTTTGTGGTTCTGACTACATCACCTATGGGAATGAATGTCACTTGTGTACCGAGA 
GCTTGA7VAAGTAATGGAAGAGTTCAGTTTCTTCACGATGGAAGTTGCTAAATTCTCCATGGAC 
ATAGAGAGAAAGGAATGATATTCTCATCATCATCTTCATCATCCCAGGCTCTGACTGAGTTTC 
TTTCAGTTTTACTGATGTTCTGGGTGGGGGACAGAGCCAGATTCAGAGTAATCTTGACTGAAT 
GGAGAAAGTTTCTGTGCTACCCCTACAAACCCATGCCTCACTGACAGACCAGCATTTTTTTTT 
TAACACGTCAATAAAAAAATAATCTCCCAGA 
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FIGURE 526 

MKITGGLLLLCTWYFCSSSEAASLSPKKVDCSIYKKYPVVAIPCPITYLPVCGSDYITYGNE 
CHLCTESLKSNGRVQFLHDGSC 



Important features: 
Signal peptide: 

amino acids 1-19 




• 
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FIGURE 527 



CGACGATGCTACGCGCGCCCGGCTGCCTCCTCCGGACCTCCGTAGCGCCTGCCGCGGCCCTGG 

CTGCGGCGCTGCTCTCGTCGCTTGCGCGCTGCTCTCTTCTAGAGCCGAGGGACCCGGTGGCCT 

CGTCGCTCAGCCCCTATTTCGGCACCAAGACTCGCTACGAGGATGTCAACCCCGTGCTATTGT 

CGGGCCCCGAGGCTCCGTGGCGGGACCCTGAGCTGCTGGAGGGGACCTGCACCCCGGTGCAGC 

TGGTCGCCCTCATTCGCCACGGCACCCGCTACCCCACGGTCAAACAGATCCGCAAGCTGAGGC 

AGCTGCACGGGTTGCTGCAGGCCCGCGGGTCCAGGGATGGCGGGGCTAGTAGTACCGGCAGCC 

GCGACCTGGGTGCAGCGCTGGCCGACTGGCCTTTGTGGTACGCGGACTGGATGGACGGGCAGC 

TAGTAGAGAAGGGACGGCAGGATATGCGACAGCTGGCGCTGCGTCTGGCCTCGCTCTTCCCGG 

CCCTTTTCAGCCGTGAGAACTACGGCCGCCTGCGGCTCATCACCAGTTCCAAGCACCGCTGCA 

TGGATAGCAGCGCCGCCTTCCTGCAGGGGCTGTGGCAGCACTACCACCCTGGCTTGCCGCCGC 

CGGACGTCGCAGATATGGAGTTTGGACCTCCAACAGTTAATGATAAACTAATGAGATTTTTTG 

ATCACTGTGAGAAGTTTTTAACTGAAGTAGAAAAAAATGCTACAGCTCTTTATCACGTGGAAG 

CCTTCAAAACTGGACCAGAAATGCAGAACATTTTAAAAAAAGTTGCAGCTACTTTGCAAGTGC 

CAGTAAATGATTTAAATGCAGATTTAATTCAAGTAGCCTTTTTCACCTGTTCATTTGACCTGG 

CAATTAAAGGTGTTAAATCTCCTTGGTGTGATGTTTTTGACATAGATGATGCAAAGGTATTAG 

AATATTTAAATGATCTGAAACAATATTGGAAAAGAGGATATGGGTATACTATTAACAGTCGAT 

CCAGCTGCACCTTGTTTCAGGATATCTTTCAGCACTTGGACAAAGCAGTTGAACAGAAACAAA 

GGTCTCAGCCAATTTCTTCTCCAGTCATCCTCCAGTTTGGTCATGCAGAGACTCTTCTTCCAC 

TGCTTTCTCTCATGGGCTACTTCAAAGACAAGGAACCCCTAACAGCGTACAATTACAAAAAAC 

AAATGCATCGGAAGTTCCGAAGTGGTCTCATTGTACCTTATGCCTCGAACCTGATATTTGTGC 

TTTACCACTGTGAATW^TGCTAAGACTCCTAAAGAACAATTCCGAGTGCAGATGTTATTAAATG 

AAAAGGTGTTACCTTTGGCTTACTCACAAGAAACTGTTTCATTTTATGAAGATCTGAAGAACC 

ACTACAAGGACATCCTTCAGAGTTGTCAAACCAGTGAAGAATGTGAATTAGCAAGGGCTAACA 

GTACATCtGATGAACTATGAGTAACTGAAGAACATTTTTAATTCTTTAGGAATCTGCAATGAG 

TGATTACATGCTTGTAATAGGTAGGCAATTCCTTGATTACAGGAAGCTTTTATATTACTTGAG 

TATTTCTGTCTTTTCACAGAAAAACATTGGGTTTCTCTCTGGGTTTGGACATGAAATGTAAGA 

AAAGATTTTTCACTGGAGCAGCTCTCTTAAGGAGAAACAAATCTATTTAGAGAAACAGCTGGC 

CCTGCAAATGTTTACAGAAATGAAATTCTTCCTACTTATATAAGAAATCTCACACTGAGATAG 

AATTGTGATTTCATAATAACACTTGAAAAGTGCTGGAGT/yVCAAAATATCTCAGTTGGACCAT 

CCTTAACTTGATTGAACTGTCTAGGAACTTTACAGATTGTTCTGCAGTTCTCTCTTCTTTTCC 

TCAGGTAGGACAGCTCTAGCATTTTCTTAATCAGGAATATTGTGGTAAGCTGGGAGTATCACT 

CTGGAAGAAAGTAACATCTCCAGATGAGAATTTGAAACAAGAAACAGAGTGTTGTAAAAGGAC 

ACCTTCACTGAAGCAAGTCGGAAAGTACAATGAAAATAAATATTTTTGGTATTTATTTATGAA 

ATATTTGAACATTTTTTCAATAATTCCTTTTTACTTCTAGGAAGTCTCAAAAGACCATCTTAA 

ATTATTATATGTTTGGACAATTAGCAACAAGTCAGATAGTTAGAATCGAAGTTTTTCAAATCC 

ATTGCTTAGCTAACTTTTTCATTCTGTCACTTGGCTTCGATTTTTATATTTTCCTATTATATG 

AAATGTATCTTTT6GTTGTTTGATTTTTCTTTCTTTCTTTGTAAATAGTTCTGAGTTCTGTCA 

AATGCCGTGAAAGTATTTGCTATAATAAAGAAAATTCTTGTGACTTTAAAAAAAAA 
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FIGURE 528 



MLRAPGCLLRTSVAPAAALAAALLSSLARCSLLEPRDPVASSLSPYFGTKTRYEDVNPVLLSG 
PEAPWRDPELLEGTCTPVQLVALIRHGTRYPTVKQIRKLRQLHGLLQARGSRDGGASSTGSRD 
LGAALADWPLWYADWMDGQLVEKGRQDMRQLALRLASLFPALFSRENYGRLRLITSSKHRCMD 
SSAAFLQGLWQHYHPGLPPPDVADMEFGPPTVNDKLMRFFDHCEKFLTEVEKNATALYHVEAF 
KTGPEMQNILKKVAATLQVPVNDLNADLIQVAFFTCSFDLAIKGVKSPWCDVFDIDDAKVLEY 
LNDLKQYWKRGYGYTINSRSSCTLFQDIFQHLDKAVEQKQRSQPISSPVILQFGHAETLLPLL 
SLMGYFKDKEPLTAYNYKKQMHRKFRSGLIVPYASNLIFVLYHCENAKTPKEQFRVQMLLNEK 
VLPLAYSQETVSFYEDLKNHYKDILQSCQTSEECELARANSTSDEL 

Important features: 
Signal sequence 

amino acids 1-30 

N-glycosylation sites « 

amino acids 242-246, 481-485 

N-myristoylation sites. 

amino, acids 107-113, .113-119, 117-123, 118-124, 128-134 



Endoplasmic reticulum targeting sequence. 

amino acids 484-489 
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FIGURE 529 



GGAGAGCCGCGGCTGGGACCGGAGTGGGGAGCGCGGCGTGGAGGTGCCACCCGGCGCGGGTGG 
CGGAGAGATCAGAAGCCTCTTCCCCAAGCCGAGCCAACCTCAGCGGGGACCCGGGCTCAGGGA 
CGCGGCGGCGGCGGCGGCGACTGCAGTGGCTGGACGATGGCAGCGTCCGCCGGAGCCGGGGCG 
GTGATTGCAGCCCCAGACAGCCGGCGCTGGCTGTGGTCGGTGCTGGCGGCGGCGCTTGGGCTC 
TTGACAGCTGGAGTATCAGCCTTGGAAGTATATACGCCAAAAGAAATCTTCGTGGCAAATGGT 
ACACAAGGGAAGCTGACCTGCAAGTTCAAGTCTACTAGTACGACTGGCGGGTTGACCTCAGTC 
TCCTGGAGCTTCCAGCCAGAGGGGGCCGACACTACTGTGTCGTTTTTCCACTACTCCCAAGGG 
CAAGTGTACCTTGGGAATTATCCACCATTTAAAGACAGAATCAGCTGGGCTGGAGACCTTGAC 
AAGAAAGATGCATCAATCAACATAGAAAATATGCAGTTTATACACAATGGCACCTATATCTGT 
GATGTCAAAAACCCTCCTGACATCGTTGTCCAGCCTGGACACATTAGGCTCTATGTCGTAGAA 
AAAGAGAATTTGCCTGTGTTTCCAGTTTGGGTAGTGGTGGGCATAGTTACTGCTGTGGTCCTA 
GGTCTCACTCTGCTCATCAGCATGATTCTGGCTGTCCTCTATAGAAGGAAAAACTCTAAACGG 
GATTACACTGGCTGCAGTACATCAGAGAGTTTGTCACCAGTTAAGCAGGCTCCTCGGAAGTCC 
CCCTCCGACACTGAGGGTCTTGTAAAGAGTCTGCCTTCTGGATCTCACCAGGGCCCAGTCATA 
TATGCACAGTTAGACCACTCCGGCGGACATCACAGTGACAAGATTAACAAGTCAGAGTCTGTG 
GTGTATGCGGATATCCGAAAGAATTAAGAGAATACCTAGAACATATCCTCAGCAAGAAACAAA 
ACCAAACTGGACTCTCGTGCAGAAAATGTAGCCCATTACCACATGTAGCCTTGGAGACCCAGG 
CAAGGACAAGTACACGTGTACTCACAGAGGGAGAGAAAGATGTGTACAAAGGATATGTATAAA 
TATTCTATTTAGTCATCCTGATATGAGGAGCCAGTGTTGCATGATGAAAAGATGGTATGATTC 
TACATATGTACCCATTGTCTTGCTGTTTTTGTACTTTCTTTTCAGGTCATTTACAATTGGGAG 
ATTTCAGAAACATTCCTTTCACCATCATTTAG7yU\TGGTTTGCCTTAATGGAGACAATAGCAG 
ATCCTGTAGTATTTCCAGTAGACATGGCCTTTTAATCTAAGGGCTTAAGACTGATTAGTCTTA 
GCATTTACTGTAGTTGGAGGATGGAGATGCTATGATGGAAGCATACCCAGGGTGGCCTTTAGC 
ACAGTATCAGTACCATTTATTTGTCTGCCGCTTTTAAAAAATACCCATTGGCTATGCCACTTG 
AAAACAATTTGAGAAGTTTTTTTGAAGTTTTTCTCACTAAAATATGGGGCAATTGTTAGCCTT 
ACATGTTGTGTAGACTTACTTTAAGTTTGCACCCTTGAAATGTGTCATATC/U^TTTCTGGATT 
CATAATAGCAAGATTAGCAAAGGATAAATGCCGAAGGTCACTTCATTCTGGACACAGTTGGAT 
CAATACTGATTAAGTAGAAAATCCAAGCTTTGCTTGAGAACTTTTGTAACGTGGAGAGTAAAA 
AGTATCGGTTTTA 
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FIGURE 530 

MAASAGAGAVIAAPDSRRWLWSVLAAALGLLTAGVSALEVYTPKEIFVANGTQGKLTCKFKST 

STTGGLTSVSWSFQPEGADTTVSFFHYSQGQVYLGNYPPFKDRISWAGDLDKKDASINIENMQ 

FIHNGTYICDVKNPPDIVVQPGHIRLYWEKENLPVFPVWWVGIVTAWLGLTLLISMILAV 

LYRRKNSKRDYTGCSTSESLSPVKQAPRKSPSDTEGLVKSLPSGSHQGPVIYAQLDHSGGHHS 
DKINKSESWYADIRKN 

Important features: 
Signal peptide: 

amino acids 1-37 



Transmembrane domain: 

amino acids 161-183 




# 
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FIGURE 531 



GTGACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGAATTCGGCTCGA 

GGCTGGTGGGAAGAAGCCGAGATGGCGGCAGCCAGCGCTGGGGCAACCCGGCTGCTCCTGCTC 

TTGCTGATGGCGGTAGCAGCGCCCAGTCGAGCCCGGGGCAGCGGCTGCCGGGCCGGGACTGGT 

GCGCGAGGGGCTGGGGCGGAAGGTCGAGAGGGCGAGGCCTGTGGCACGGTGGGGCTGCTGCTG. 

GAGCACTCATTTGAGATCGATGACAGTGCCAACTTCCGGAAGCGGGGCTCACTGCTCTGGAAC 

CAGCAGGATGGTACCTTGTCCCTGTCACAGCGGCAGCTCAGCGAGGAGGAGCGGGGCCGACTC 

CGGGATGTGGCAGCCCTGAATGGCCTGTACCGGGTCCGGATCCCAAGGCGACCCGGGGCCCTG 

GATGGCCTGGAAGCTGGTGGCTATGTCTCCTCCTTTGTCCCTGCGTGCTCCCTGGTGGAGTCG 

CACCTGTCGGACCAGCTGACCCTGCACGTGGATGTGGCCGGCAACGTGGTGGGCGTGTCGGTG 

GTGACGCACCCCGGGGGCTGCCGGGGCCATGAGGTGGAGGACGTGGACCTGGAGCTGTTCAAC 

ACCTCGGTGCAGCTGCAGCCGCCCACCACAGCCCCAGGCCCTGAGACGGCGGCCTTCATTGAG 

CGCCTGGAGATGGAACAGGCCCAGAAGGCCAAGAACCCCCAGGAGCAGAAGTCCTTCTTCGCC 

AAATACTGGATGTACATCATTCCCGTCGTCCTGTTCCTCATGATGTCAGGAGCGCCAGACACC 

GGGGGCCAGGGTGGGGGTGGGGGTGGGGGTGGTGGTGGGGGTAGTGGCCTTTGCTGTGTGCCA 

CCCTCCCTGJAAGTCTATTTAAAAACATCGACGATACATTGAAATGTGTGAACGTTTTGAAAA 

GCTACAGCTTCCAGCAGCCAAAAGC7\ACTGTTGTTTTGGCAAGACGGTCCTGATGTACAAGCT 

TGATTGAAATTCACTGCTCACTTGATACGTTATTCAGAAACCCAAGGAATGGCTGTCCCCATC 

CTCATGTGGCTGTGTGGAGCTCAGCTGTGTTGTGTGGCAGTTTATTAAACTGTCCCCCAGATC 

GACACGCAAAAAAAAA 
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FIGURE 532 

MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEID 
DSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGG 
YVSSFVPACSLVESHLSDQLTLHVDVAGNVVGVSWTHPGGCRGHEVEDVDLELFNTSVQLQP 
PTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPWLFLMMSGAPDTGGQGGGG 
GGGGGGGSGLCCVPPSL 

Important: features: 
Signal peptide: 

amino acids 1-24 



Transmembrane domain: 

amino acids 226-243 
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FIGURE 533 



TCTGCCTCCACTGCTCTGTGCTGGGATCATGGAACTTGCACTGCTGTGTGGGCTGGTGGTGAT 
GGCTGGTGTGATTCCAATCCAGGGCGGGATCCTGAACCTGAACAAGATGGTCAAGCAAGTGAC 
TGGGAAAATGCCCATCCTCTCCTACTGGCCCTACGGCTGTCACTGCGGACTAGGTGGCAGAGG 
CCAACCCAAAGATGCCACGGACTGGTGCTGCCAGACCCATGACTGCTGCTATGACCACCTGAA 
GACCCAGGGGTGCGGCATCTACAAGGACAACAACAAAAGCAGCATACATTGTATGGATTTATC 
TCAACGCTATTGTTTAATGGCTGTGTTTAATGTGATCTATCTGGAAAATGAGGACTCCGAATA 
AAAAGCTATTACTAWTTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATVAAAAAAAAAAAA 
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FIGURE 534 



MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDATDWC 
CQTHDCCYDHLKTQGCGIYKDNNKSSIHCMDLSQRYCLMAVFNVIYLENEDSE 

Important features: 
Si^al peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 1-24 

N-glycosylation site. 

amino acids 86-89 

N-myristoylation sites. 

amino acids 20-25, 45-50. 



Phospholipase A2 histidine active site. 

amino acids 63-70 
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FIGURE 535 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTG 
AACTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCC 
CCAAATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTT 
TTTTTAACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCAT 
GAAGATCCTATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTT 
GTTCTTGGAGTGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAG 
GGGTCCAATTTTTCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGG 
CTGTCATGCAACTGGCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAA 

aggatgggtttcaatgtaattaggctactgagcggatcagctgtagcactggttatagccccc 
actgtcttactgacaatgctttcttctgccgaacgaggatgccctaagggctgtaggtgtgaa 
ggcaaaatggtatattgtgaatctcagaaattacaggagataccctcaagtatatctgctggt 
tgcttaggtttgtcccttcgctataacagccttcaaaaacttaagtataatcaatttaaaggg 
ctcaaccagctcacctggctataccttgaccataaccatatcagcaatattgacgaaaatgct 
tttaatggaatacgcagactcaaagagctgattcttagttccaatagaatctcctattttctt 
aacaataccttcagacctgtgacaaatttacggaacttggatctgtcctataatcagctgcat 
tctctgggatctgaacagtttcggggcttgcggaagctgctgagtttacatttacggtctaac 
tccctgagaaccatccctgtgcgaatattccaagactgccgcaacctggaacttttggacctg 
ggatataaccggatccgaagtttagccaggaatgtctttgctggcatgatcagactcaaagaa 
cttcacetggagcacaatcaattttccaagctcaacctggccctttttccaaggttggtcagc 
cttcagaacctttacttgcagtggaataaaatcagtgtcataggacagaccatgtcctggacc 
tggagctccttacaaaggcttgatttatcaggcaatgagatcgaagctttcagtggacccagt 
gttttccagtgtgtcccgaatctgcagcgcctcaacctggattccaacaagctcacatttatt 
ggtcaagagattttggattcttggatatccctcaatgAcatcagtcttgctgggaatatatgg 
gaatgcagcagaaatatttgctcccttgtaaactggctgaaaagttttaaaggtctaagggag 
aatacaattatctgtgccagtcccaaagagctgcaaggagtaaatgtgatcgatgcagtgaag 
aactacagcatctgtggcaaaagtactacagagaggtttgatctggccagggctctcccaaag 
ccgacgtttaagcccaagctccccaggccgaagcatgagagcaaaccccctttgcccccgacg 
gtgggagccacagagcccggcccagagaccgatgctgacgccgagcacatctctttccataaa 
atcatcgcgggcagcgtggcgcttttcctgtccgtgctcgtcatcctgctggttatctacgtg 
tcatggaagcggtaccctgcgagcatgaagcagctgcagcagcgctccctcatgcgaaggcac 
aggaaaaagaaaagacagtccctaaagcaaatgactcccagcacccaggaattttatgtagat 
tataaacccaccaacacggagaccagcgagatgctgctgaatgggacgggaccctgcacctat 

AACAAATCGGGCTCCAGGGAGTGTGAGGTAT^ACCATTGTGATAAAAAGAGCTCTTAAAAGC 

tgggaaataagtggtgctttattgaactctggtgactatcaagggaacgcgatgccccccctc 
cccttccctctccctctcactttggtggcaagatccttccttgtccgttttagtgcattcata 
atactggtcattttcctctcatacataatcaacccattgaaatttaaataccacaatcaatgt 
gaagcttgaactccggtttaatataatacctattgtataagaccctttactgattccattaat 

GTCGCATTTGTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAT^ 
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FIGURE 536 



mgfnvirllsgsavalviaptvlltmlssaergcpkgcrcegkmvycesqklqeipssisagc 

lglslrynslqklkynqfkglnqltwlyldhnhisnidenafngirrlkelilssnrisyfln 

ntfrpvtnlrnldlsynqlhslgseqfrglrkllslhlrsnslrtipvrifqdcrnlelldlg 

ynrirslarnvfagmirlkelhlehnqfsklnlalfprlvslqnlylqwnkisvigqtmswtw 

sslqrldlsgneieafsgpsvfqcvpnlqrlnldsnkltfigqeildswislndislagniwe 

csrnicslvnwlksfkglrentiicaspkelqgvnvidavknysicgkstterfdlaralpkp 

tfkpklprpkheskpplpptvgatepgpetdadaehisfhkiiagsvalflsvlvillviyvs 

wkrypasmkqlqqrslmrf:hrkkkrqslkqmtpstqefyvdykptntetsemllngtgpctyn 
ksgsrecev 



Important features: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 



N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 



Tyrosine kinase phosphorylation site . 

amino acids 136-142 



N-myrlstoylatlon sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 537 



GGGACTACAAGCCGCGCCGCGCTGCCGCTGGCCCCTCAGCAACCCTCGACATGGCGCTGAGGCGGCCACCGCGAC 
TCCGGCTCTGCGCTCGGCTGCCTGACTTCTTCCTGCTGCTGCTTTTCAGGGGCTGCCTGATAGGGGCTGTAAATC 
TCAAATCCAGCAATCGAACCCCAGTGGTACAGGAATTTGAAAGTGTGGAACTGTCTTGCATCATTACGGATTCGC 
AGACAAGTGACCCCAGGATCGAGTGGAAGAAAATTCAAGATGAACAAACCACATATGTGTTTTTTGACAACAAAA 
TTCAGGGAGACTTGGCGGGTCGTGCAGAAATACTGGGGAAGACATCCCTGAAGATCTGGAATGTGACACGGAGAG 
ACTCAGCCCTTTATCGCTGTGAGGTCGTTGCTCGAAATGACCGCAAGGAT^ATTGATGAGATTGTGATCGAGTTAA 
CTGTGCAAGTGAAGCCAGTGACCCCTGTCTGTAGAGTGCCGAAGGCTGTACCAGTAGGCAAGATGGCAACACTGC 
ACTGCCAGGAGAGTGAGGGCCACCCCCGGCCTCACTACAGCTGGTATCGCAATGATGTACCACTGCCCACGGATT 
CCAGAGCCAATCCCAGATTTCGCAATTCTTCTTTCCACTTAAACTCTGAAACAGGCACTTTGGTGTTCACTGCTG 
TTCACAAGGACGACTCTGGGCAGTACTACTGCATTGCTTCCAATGACGCAGGCTCAGCCAGGTGTGAGGAGCAGG 
AGATGGAAGTCTATGACCTGAACATTGGCGGAATTATTGGGGGGGTTCTGGTTGTCCTTGCTGTACTGGCCCTGA 
TCACGTTGGGCATCTGCTGTGCATACAGACGTGGCTACTTCATCAACAATAAACAGGATGGAGAAAGTTACAAGA 
ACCCAGGGAAACCAGATGGAGTTAACTACATCCGCACTGACGAGGAGGGCGACTTCAGACACAAGTCATCGTTTG 
TGATCTGAGACCCGCGGTGTGGCTGAGAGCGCACAGAGCGCACGTGCACATACCTCTGCTAGAAACTCCTGTCAA 
GGCAGCGAGAGCTGATGCACTCGGACAGAGCTAGACACTCATTCAGAAGCTTTTCGTTTTGGCCAAAGTTGACCA 
CTACTCTTCTTACTCTAACAAGCCACATGAATAGAAGAATTTTCCTCAAGATGGACCCGGTAAATATAACCACAA 
GGAAGCGAAACTGGGTGCGTTCACTGAGTTGGGTTCCTAATCTGTTTCTGGCCTGATTCCCGCATGAGTATTAGG 
GTGATCTTAAAGAGTTTGCTCACGTAAACGCCCGTGCTGGGCCCTGTGAAGCCAGCATGTTCACCACTGGTCGTT 
CAGCAGCCACGACAGCACCATGTGAGATGGCGAGGTGGCTGGACAGCACCAGCAGCGCATCCCGGCGGGAACCCA 
GAAAAGGCTTCTTACACAGCAGCCTTACTTCATCGGCCCACAGACACCACCGCAGTTTCTTCTTAAAGGCTCTGC 
TGATCGGTGTTGCAGTGTCCATTGTGGAGAAGCTTTTTGGATCAGCATTTTGTAAAAACAACCAAAATCAGGAAG 
GTAAATTGGTTGCTGGAAGAGGGATCTTGCCTGAGGAACCCTGCTTGTCCAACAGGGTGTCAGGATTTAAGGAAA 
ACCTTCGTCTTAGGCTAAGTCTGAAATGGTACTGAAATATGCTTTTCTATGGGTCTTGTTTATTTTATAAAATTT 
TACATCTAAATTTTTGCTAAGGATGTATTTTGATTATTGAAAAGAAAATTTCTATTTAAACTGTAAATATATTGT 
CATACAATGTTAAATAACCTATTTTTTTAAAAAAGTTCAACTTAAGGTAGAAGTTCCAAGCTACTAGTGTTAAAT 
TGGAAAATATCAATAATTAAGAGTATTTTACCCAAGGAATCCTCTCATGGAAGTTTACTGTGATGTTCCTTTTCT 
CACACAAGTTTTAGCCTTTTTCACAAGGGAACTCATACTGTCTACACATCAGACCATAGTTGCTTAGGAAACCTT 
TAAAAATTCCAGTTAAGCAATGTTGAAATCAGTTTGCATCTCTTCAAAAGAAACCTCTCAGGTTAGCTTTGAACT 
GCCTCTTCCTGAGATGACTAGGACAGTCTGTACCCAGAGGCCACCCAGAAGCCCTCAGATGTACATACACAGATG 
CCAGTCAGCTCCTGGGGTTGCGCCAGGCGCCCCCGCTCTAGCTCACTGTTGCCTCGCTGTCTGCCAGGAGGCCCT 
GCCATCCTTGGGCCCTGGCAGTGGCTGTGTCCCAGTGAGCTTTACTCACGTGGCCCTTGCTTCATCCAGCACAGC 
TCTCAGGTGGGCACTGCAGGGACACTGGTGTCTTCCATGTAGCGTCCCAGCTTTGGGCTCCTGTAACAGACCTCT 
TTTTGGTTATGGATGGCTCACAAAATAGGGCCCCCAATGCTATTTTTTTTTTTTAAGTTTGTTTAATTATTTGTT 
AAGATTGTCTAAGGCCAAAGGCAATTGCGAAATCAAGTCTGTCAAGTACAATAACATTTTTAAAAGAAAATGGAT 
CCCACTGTTCCTCTTTGCCACAGAGAAAGCACCCAGACGCCACAGGCTCTGTCGCATTTCAAAACAAACCATGAT 
GGAGTGGCGGCCAGTCCAGCCTTTTAAAGAACGTCAGGTGGAGCAGCCAGGTGAAAGGCCTGGCGGGGAGGAAAG 
TGAAACGCCTGAATCAAAAGCAGTTTTCTAATTTTGACTTTAAATTTTTCATCCGCCGGAGACACTGCTCCCATT 
TGTGGGGGGACATTAGCAACATCACTCAGAAGCCTGTGTTCTTCAAGAGCAGGTGTTCTCAGCCTCACATGCCCT 
GCCGTGCTGGACTCAGGACTGAAGTGCTGTAAAGCAAGGAGCTGCTGAGAAGGAGCACTCCACTGTGTGCCTGGA 
GAATGGCTCTCACTACTCACCTTGTCTTTCAGCTTCCAGTGTCTTGGGTTTTTTATACTTTGACAGCTTTTTTTT 
AATTGCATACATGAGACTGTGTTGACTTTTTTTAGTTATGTGAAACACTTTGCCGCAGGCCGCCTGGCAGAGGCA 
GGAAATGCTCCAGCAGTGGCTCAGTGCTCCCTGGTGTCTGCTGCATGGCATCCTGGATGCTTAGCATGCAAGTTC 
CCTCCATCATTGCCACCTTGGTAGAGAGGGATGGCTCCCCACCCTCAGCGTTGGGGATTCACGCTCCAGCCTCCT 
TCTTGGTTGTCATAGTGATAGGGTAGCCTTATTGCCCCCTCTTCTTATACCCTAAAACCTTCTACACTAGTGCCA 
TGGGAACCAGGTCTGAAAAAGTAGAGAGAAGTGAAAGTAGAGTCTGGGAAGTAGCTGCCTATAACTGAGACTAGA 
CGGAAAAGGAATACTCGTGTATTTTAAGATATGAATGTGACTCAAGACTCGAGGCCGATACGAGGCTGTGATTCT 
GCCTTTGGATGGATGTTGCTGTACACAGATGCTACAGACTTGTACTAACACACCGTAATTTGGCATTTGTTTAAC 
CTCATTTATA/^GCTTCAAAAAAACCCA 
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FIGURE 538 

MALRRPPRLRLCARLPDFFLLLLFRGCLIGAVNLKSSNRTPVVQEFESVELSCIITDSQTSDP 
RIEWKKIQDEQTTYVFFDNKIQGDLAGRAEILGKTSLKIWNVTRRDSALYRCEWARNDRKEI 
DEIVIELTVQVKPVTPVCRVPKAVPVGKMATLHCQESEGHPRPHYSWYRNDVPLPTDSRANPR 
FRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASNDAGSARCEEQEMEVYDLNIGGIIGGVLW 
LAVLALITLGICCAYRRGYFINNKQDGESYKNPGKPDGVNYIRTDEEGDFRHKSSFVI 

In^ortant features: 
Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 24 3-263 

N-glycosylation sites. 

amino acids 104-107, 192-195 

cAMP- and cG^-dependent protein kinase phosphorylation site. 

amino acids 107-110 

Casein kinase II phosphorylation site. 

amino acids 106-109, 296-299 

Tyrosine kinase phosphorylation site. 

amino acids 69-77 

N-myristoylation sites . 

amino acids 26-31, 215-220, 226-231, 243-248, 244-249, 262-267 
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CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGC 
GCAACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAA 
CTTCAGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGG 
CACAGGCGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGG 
GCTCGGGCGGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGG 
CTGCGCCCTGGGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGA 
TGAGCCGCGTGGTCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCC 
GCCGCGTGGTCAGCGGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGG 
CCTACTTCCATGAACTGTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTG 
AGGGAGGAGTCCTCCTCAGCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGC 
AAAACCTGACAAAACCCGGGACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGA 
ATGGAGATGGGCAAACATCTGGTGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATT 
CCCAGTACCGAAACTGGTACACAGATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGT 
ATCACCAACCAACTGCCAATCCTGGCCTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACA 
GGTGTAACATGAAGCACAATTATATTTGCAAGTATGAACCAGAGATTAATCCAACAGCCCCTG 
TAGAAAAGCCTTATCTTACAAATCAACCAGGAGACACCCATCAGAATGTGGTTGTTACTGAAG 
CAGGTAT7U\TTCCCAATCTAATTTATGTTGTTATACCAACAATACCCCTGCTCTTACTGATAC 
TGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCTGCATAAAAGTAAAGGAAGAACAAAAACTA 
GTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTACCAGAAAAGAAAGTGGCATGGAAGTAT 
AATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCTGGATCTGTATAAGGAATGGCATCAG 
AACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGCAAGATGAACTGTAAGCTCCCCCT 
TGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCATTTAAAGAATATGCTGTGCTA 
ATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAAACTTCAAACTTCAAGCAAA 
TGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTATGTGTGTTAG/^GCAAT 
TCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAATGTATATTGTATTGA 
AATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAAATGAACTGTTCTA 
ATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAGAAACTTATTAC 
TGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGTTTTCTCGAA 
ATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGAAATAAGA 
AGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGATTGTCT 
AATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTA/yy^TTAAACAGATTTTG 
TAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAGT 
GACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTG 
AGGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTT 
GGGATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGC 
CTCTGACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAG 
TGTGGAGACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAA 
ATGGGTTGGAACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATG 
GTGGCTCCTTTCTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGA 
AAGTTGTAACTCTCTGGTCTTCATATGTCCCTGTGCTCCTTTTAACC/yyVTAAAGAGTTCTTG 
TTTCTGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MSRVVSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACES 

EGGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSN 

SQYRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAP 

VEKPYLTNQPGDTHQNWVTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKT 
SPNQSTLWISKSTRKESGMEV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89 and 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site* 

amino acids 266-269 



M-myristoyla.tion sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140-145 
and 212-217 
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GGAGAMGGAGAGAGCAGTGAGAGTGGAGTCCGGGGTCCTGGTCGGGGTGGTCTGTCTGCTCCTGGCATGCCCTG 
CCACAGCCACTGGGCCCGAAGTTGCTCAGCCTGAAGTAGACACCACCCTGGGTCGTGTGCGAGGCCGGCAGGTGG 
GCGTGAAGGGCACAGACCGCCTTGTGAATGTCTTTCTGGGCATTCCATTTGCCCAGCCGCCACTGGGCCCTGACC 
GGTTCTCAGCCCCACACCCAGCACAGCCCTGGGAGGGTGTGCGGGATGCCAGCACTGCGCCCCCAATGTGCCTAC 
AAGACGTGGAGAGCATGAACAGCAGCAGATTTGTCCTCAACGGAAAACAGCAGATCTTCTCCGTTTCAGAGGACT 
GCCTGGTCCTCAACGTCTATAGCCCAGCTGAGGTCCCCGCAGGGTCCGGTAGGCCGGTCATGGTATGGGTCCATG 
GAGGCGCTCTGATAACTGGCGCTGCCACCTCCTACGATGGATCAGCTCTGGCTGCCTATGGGGATGTGGTCGTGG 
TTACAGTCCAGTACCGCCTTGGGGTCCTTGGCTTCTTCAGCACTGGAGATGAGCATGCACCTGGCAACCAGGGCT 
TCCTAGATGTGGTAGCTGCTTTGCGCTGGGTGCAAGAAAACATCGCCCCCTTCGGGGGTGACCTCAACTGTGTCA 
CTGTCTTTGGTGGATCTGCCGGTGGGAGCATCATCTCTGGCCTGGTCCTGTCCCCAGTGGCTGCAGGGCTGTTCC 
ACAGAGCCATCACACAGAGTGGGGTCATCACCACCCCAGGGATCATCGACTCTCACCCTTGGCCCCTAGCTCAGA 
AAATCGCAAACACCTTGGCCTGCAGCTCCAGCTCCCCGGCTGAGATGGTGCAGTGCCTTCAGCAGAAAGAAGGAG 
AAGAGCTGGTCCTTAGCAAGAAGCTGAAAAATACTATCTATCCTCTCACCGTTGATGGCACTGTCTTCCCCAAAA 
GCCCCAAGGAACTCCTG7\AGGAGAAGCCCTTCCACTCTGTGCCCTTCCTCATGGGTGTCAACAACCATGAGTTCA 
GCTGGCTCATCCCCAGGGGCTGGGGTCTCCTGGATACAATGGAGCAGATGAGCCGGGAGGACATGCTGGCCATCT 
CAACACCCGTCTTGACCAGTCTGGATGTGCCCCCTGAGATGATGCCCACCGTCATAGATGAATACCTAGGAAGCA 
ACTCGGACGCACAAGCCAAATGCCAGGCGTTCCAGGAATTCATGGGTGACGTATTCATCAATGTTCCCACCGTCA 
GTTTTTCAAGATACCTTCGAGATTCTGGAAGCCCTGTCTTTTTCTATGAGTTCCAGCATCGACCCAGTTCTTTTG 
CGAAGATCAAACCTGCCTGGGTGAAGGCTGATCATGGGGCCGAGGGTGCTTTTGTGTTCGGAGGTCCCTTCCTCA 
TGGACGAGAGCTCCCGCCTGGCCTTTCCAGAGGCCACAGAGGAGGAGAAGCAGCTAAGCCTCACCATGATGGCCC 
AGTGGACCCACTTTGCCCGGACAGGGGACCCCAATAGCAAGGCTCTGCCTCCTTGGCCCCAATTCAACCAGGCGG 
AACAATATCTGGAGATCAACCCAGTGCCACGGGCCGGACAGAAGTTCAGGGAGGCCTGGATGCAGTTCTGGTCAG 
AGACGCTCCCCAGCAAGATACAACAGTGGCACCAGAAGCAGAAGAACAGGAAGGCCCAGGAGGACCTCTGAGGCC 
AGGCCTGAACCTTCTTGGCTGGGGCAAACCACTCTTCAAGTGGTGGCAGAGTCCCAGCACGGCAGCCCGCCTCTC 
CCCCTGCTGAGACTTTAATCTCCACCAGCCCTTAAAGTGTCGGCCGCTCTGTGACTGGAGTTATGCTCTTTTGAA 
ATGTCACAAGGCCGCCTCCCACCTCTGGGGCATTGTACAAGTTCTTCCCTCTCCCTGAAGTGCCTTTCCTGCTTT 
CTTCGTGGTAGGTTCTAGCACATTCCTCTAGCTTCCTGGAGGACTCACTCCCCAGGAAGCCTTCCCTGCCTTCTC 
TGGGCTGTGCGGCCCCGAGTCTGCGTCCATTAGAGCACAGTCCACCCGAGGCTAGCACCGTGTCTGTGTCTGTCT 
CCCCCTCAGAGGAGCTCTCTCAAAATGGGGATTAGCCTAACCCCACTCTGTCACCCACACCAGGATCGGGTGGGA 
CCTGGAGCTAGGGGGTGTTTGCTGAGTGAGTGAGTGAAACACAGAATATGGGAATGGCAGCTGCTGAACTTGAAC 
CCAGAGCCTTCAGGTGCCAAAGCCATACTCAGGCCCCCACCGACATTGTCCACCCTGGCCAGAAGGGTGCATGCC 
AATGGCAGAGACCTGGGATGGGAGAAGTCCTGGGGCGCCAGGGGATCCAGCCTAGAGCAGACCTTAGCCCCTGAC 
TAAGGCCTCAGACTAGGGCGGGAGGGGTCTCCTCCTCTCTGCTGCCCAGTCCTGGCCCCTGCACAAGACAACAGA 
ATCCATCAGGGCCATGAGTGTCACCCAGACCTGACCCTCACCAATTCCAGCCCCTGACCCTCAGGACGCTGGATG 
CCAGCTCCCAGCCCCAGTGCCGGGTCCTCCCTCCCTTCCTGGCTTGGGGAGACCAGTTTCTGGGGAGCTTCCAAG 
AGCACCCACCAAGACACAGCAGGACAGGCCAGGGGAGGGCATCTGGACCAGGGCATCCGTCGGGCTATTGTCACA 
GAGAAAAGAAGAGACCCACCCACTCGGGCTGCAAAAGGTGAAAAGCACCAAGAGGTTTTCAGATGGAAGTGAGAG 
GTGACAGTGTGCTGGCAGCCCTCACAGCCCTCGCTTGCTCTCCCTGCCGCCTCTGCCTGGGCTCCCACTTTGGCA 
GCACTTGAGGAGCCCTTCAACCCGCCGCTGCACTGTAGGAGCCCCTTTCTGGGCTGGCCAAGGCCGGAGCCAGCT 
CCCTCAGCTTGCGGGGAGGTGCGGAGGGAGAGGGGCGGGCAGGAACCGGGGCTGCGCGCAGCGCTTGCGGGCCAG 
AGTGAGTTCCGGGTGGGCGTGGGCTCGGCGGGGCCCCACTCAGAGCAGCTGGCCGGCCCCAGGCAGTGAGGGCCT 
TAGCACCTGGGCCAGCAGCTGCTGTGCTCGATTTCTCGCTGGGCCTTAGCTGCCTCCCCGCGGGGCAGGGCTCGG 
GACCTGCAGCCCTCCATGCCTGACCCTCCCCCCACCCCCCGTGGGCTCCTGTGCGGCCGGAGCCTCCCCAAGGAG 
CGCCGCCCCCTGCTCCACAGCGCCCAGTCCCATCGACCACCCAAGGGCTGAGGAGTGCGGGTGCACAGCGCGGGA 
CTGGCAGGCAGCTCCACCTGCTGCCCCAGTGCTGGATCCACTGGGTGAAGCCAGCTGGGCTCCTGAGTCTGGTGG 
GGACTTGGAGAACCTTTATGTCTAGCTAAGGGATTGTAAATACACCGATGGGCACTCTGTATCTAGCTCAAGGTT 
TGTAAACACACCAATCAGCACCCTGTGTCTAGCTCAGTGTTTGTGAATGCACCAATCCACACTCTGTATCTGGCT 
ACTCTGGTGGGGACTTGGAGAACCTTTGTGTCCACACTCTGTATCTAGCTAATCTAGTGGGGATGTGGAGAACCT 
TTGTGTCTAGCTCAGGGATCGTAAACGCACCAATCAGCACCCTGTCAAAACAGACCACTTGACTCTCTGTAAAAT 
GGACCAATCAGCAGGATGTGGGTGGGGCGAGACAAGAGAATAAAAGCAGGCTGCCTGAGCCAGCAGTGACAACCC 
CCCTCGGGTCCCCTCCCACGCCGTGGAAGCTTTGTTCTTTCGCTCTTTGCAATAAATCTTGCTACTGCCCAAAA 
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MERAVRVESGVLVGVVCLLLACPATATGPEVAQPEVDTTLGRVRGRQVGVKGTDRLVNVFLGI 

PFAQPPLGPDRFSAPHPAQPWEGVRDASTAPPMCLQDVESMNSSRFVLNGKQQIFSVSEDCLV 

LNVYSPAEVPAGSGRPVMVWVHGGALITGAATSYDGSALAAYGDWWTVQYRLGVLGFFSTG 

DEHAPGNQGFLDWAALRWVQENIAPFGGDLNCVTVFGGSAGGSIISGLVLSPVAAGLFHRAI 

TQSGVITTPGIIDSHPWPLAQKIANTLACSSSSPAEMVQCLQQKEGEELVLSKKLKNTIYPLT 

VDGTVFPKSPKELLKEKPFHSVPFLMGVNNHEFSWLIPRGWGLLDTMEQMSREDMLAISTPVL 

TSLDVPPEMMPTVIDEYLGSNSDAQAKCQAFQEFMGDVFINVPTVSFSRYLRDSGSPVFFYEF 

QHRPSSFAKIKPAWVKADHGAEGAFVFGGPFLMDESSRLAFPEATEEEKQLSLTMMAQWTHFA 

. RTGDPNSKALPPWPQFNQAEQYLEINPVPRAGQKFREAWMQFWSETLPSKIQQWHQKQKNRKA 
QEDL 

Inportant features: 
Signal peptide: 

amino acids 1-27 

Transmembrane domain: 

amino acids 226-245 

N-glycosyiation site. 

amino acids 105-109 

N-myristoylation sites. 

amino acids 10-16, 49-55, 62-68, 86-92, 150-156, 155-161, . 

162-168, 217-223, 227-233, 228-234, 232-238, 262-268, 357-363, 
461-467 . 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 12-23 

Carboxylesterases type-B serine active site. 

amino acids 216-232 



/ 
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TGTCGCCTGGCCCTCGCCATGCAGACCCCGCGAGCGTCCCCTCCCCGCCCGGCCCTCCTGCTTCTGCTGCTGCTA 
CTGGGGGGCGCCCACGGCCTCTTTCCTGAGGAGCCGCCGCCGCTTAGCGTGGCCCCCAGGGACTACCTGAACCAC 
TATCCCGTGTTTGTGGGCAGCGGGCCCGGACGCCTGACCCCCGCAGAAGGTGCTGACGACCTCAACATCCAGCGA 
GTCCTGCGGGTCAACAGGACGCTGTTCATTGGGGACAGGGACAACCTCTACCGCGTAGAGCTGGAGCCCCCCACG 
TCCACGGAGCTGCGGTACCAGAGGAAGCTGACCTGGAGATCTAACCCCAGCGACATAAACGTGTGTCGGATGAAG 
GGCAAACAGGAGGGCGAGTGTCGAAACTTCGTAAAGGTGCTGCTCCTTCGGGACGAGTCCACGCTCTTTGTGTGC 
GGTTCCAACGCCTTCAACCCGGTGTGCGCCAACTACAGCATA6ACACCCTGCAGCCCGTCGGAGACAACATCAGC 
GGTATGGCCCGCTGCCCGTACGACCCCAAGCACGCCAATGTTGCCCTCTTCTCTGACGGGATGCTCTTCACAGCT 
ACTGTTACCGACTTCCTAGCCATTGATGCTGTCATCTACCGCAGCCTCGGGGACAGGCCCACCCTGCGCACCGTG 
AAACATGACTCCAAGTGGTTCAAAGAGCCTTACTTTGTCCATGCGGTGGAGTGGGGCAGCCATGTCTACTTCTTC 
TTCCGGGAGATTGCGATGGAGTTTAACTACCTGGAGAAGGTGGTGGTGTCCCGCGTGGCCCGAGTGTGCAAGAAC 
GACGTGGGAGGCTCCCCCCGCGTGCTGGAGAAGCAGTGGACGTCCTTCCTGAAGGCGCGGCTCAACTGCTCTGTA 
CCCGGAGACTCCCATTTCTACTTCAACGTGCTGCAGGCTGTCACGGGCGTGGTCAGCCTCGGGGGCCGGCCCGTG 
GTCCTGGCCGTTTTTTCCACGCCCAGCAACAGCATCCCTGGCTCGGCTGTCTGCGCCTTTGACCTGACACAGGTG 
GCAGCTGTGTTTGAAGGCCGCTTCCGAGAGCAGAAGTCCCCCGAGTCCATCTGGACGCCGGTGCCGGAGGATCAG 
GTGCCTCGACCCCGGCCCGGGTGCTGCGCAGCCCCCGGGATGCAGTACAATGCCTCCAGCGCCTTGCCGGATGAC 
ATCCTCAACTTTGTCAAGACCCACCCTCTGATGGACGAGGCGGTGCCCTCGCTGGGCCATGCGCCCTGGATCCTG 
CGGACCCTGATGAGGCACCAGCTGACTCGAGTGGCTGTGGACGTGGGAGCCGGCCCCTGGGGCAACCAGACCGTT 
GTCTTCCTGGGTTCTGAGGCGGGGACGGTCCTCAAGTTCCTCGTCCGGCCCAATGCCAGCACCTCAGGGACGTCT 
GGGCTCAGTGTCTTCCTGGAGGAGTTTGAGACCTACCGGCCGGACAGGTGTGGACGGCCCGGCGGTGGCGAGACA 
GGGCAGCGGCTGCTGAGCTTGGAGCTGGACGCAGCTTCGGGGGGCCTGCTGGCTGCCTTCCCCCGCTGCGTGGTC 
CGAGTGCCTGTGGCTCGCTGCCAGCAGTACTCGGGGTGTATGAAGAACTGTATCGGCAGTCAGGACCCCTACTGC 
GGGTGGGCCCCCGACGGCTCCTGCATCTTCCTCAGCCCGGGCACCAGAGCCGCCTTTGAGCAGGACGTGTCCGGG 
GCCAGCACCTCAGGCTTAGGGGACTGCACAGGACTCCTGCGGGCCAGCCTCTCCGAGGACCGCGCGGGGCTGGTG 
TCGGTGAACCTGCTGGTAACGTCGTCGGTGGCGGCCTTCGTGGTGGGAGCCGTGGTGTCCGGCTTCAGCGTGGGC 
TGGTTCGTGGGCCTCCGTGAGCGGCGGGAGCTGGCCCGGCGCAAGGACAAGGAGGCCATCCTGGCGCACGGGGCG 
GGCGAGGCGGTGCTGAGCGTCAGCCGCCTGGGCGAGCGCAGGGCGCAGGGTCCCGGGGGCCGGGGCGGAGGCGGT 
GGCGGTGGCGCCGGGGTTCCCCCGGAGGCCCTGCTGGCGCCCCTGATGCAGAACGGCTGGGCCAAGGCCACGCTG 
CTGCAGGGCGGGCCCCACGACCTGGACTCGGGGCTGCTGCCCACGCCCGAGCAGACGCCGCTGCCGCAGAAGCGC 
CTGCCCACTCCGCACCCGCACCCCCACGCCCTGGGCCCCCGCGCCTGGGACCACGGCCACCCCCTGCTCCCGGCC 
TCCGCTTCATCCTCCCTCCTGCTGCTGGCGCCCGCCCGGGCCCCCGAGCAGCCCCCCGCGCCTGGGGAGCCGACC 
CCCGACGGCCGCCTCTATGCTGCCCGGCCCGGCCGCGCCTCCCACGGCGACTTCCCGCTCACCCCCCACGCCAGC 
CCGGACCGCCGGCGGGTGGTGTCCGCGCCCACGGGCCCCTTGGACCCAGCCTCAGCCGCCGATGGCCTCCCGCGG 
CCCTGGAGCCCGCCCCCGACGGGCAGCCTGAGGAGGCCACTGGGCCCCCACGCCCCTCCGGCCGCCACCCTGCGC 
CGCACCCACACGTTCAACAGCGGCGAGGCCCGGCCTGGGGACCGCCACCGCGGCTGCCACGCCCGGCCGGGCACA 
GACTTGGCCCACCTCCTCCCCTATGGGGGGGCGGACAGGACTGCGCCCCCCGTGCCCTAGGCCGGGGGCCCCCCG 
ATGCCTTGGCAGTGCCAGCCACGGGAACCAGGAGCGAGAGACGGTGCCAGAACGCCGGGGCCCGGGGCAACTCCG 
AGTGGGTGCTCAAGTCCCCCCCGCGACCCACCCGCGGAGTGGGGGGCCCCCTCCGCCACAAGGAAGCACAACCAG 
CTCGCCCTCCCCCTACCCGGGGCCGCAGGACGCTGAGACGGTTTGGGGGTGGGTGGGCGGGAGGACTTTGCTATG 
GATTTGAGGTTGACCTTATGCGCGTAGGTTTTGGTTTTTTTTTGCAGTTTTGGTTTCTTTTGCGGTTTTCTAACC 
AATTGCACAACTCCGTTCTCGGGGTGGCGGCAGGCAGGGGAGGCTTGGACGCCGGTGGGGAATGGGGGGCCACAG 
CTGCAGACCTAAGCCCTCCCCCACCCCTGGAAAGGTCCCTCCCCAACCCAGGCCCCTGGCGTGTGTGGGTGTGCG 
TGCGTGTGCGTGCCGTGTTCGTGTGCAAGGGGCCGGGGAGGTGGGCGTGTGTGTGCGTGCCAGCGAAGGCTGCTG 
TGGGCGTGTGTGTCAAGTGGGCCACGCGTGCAGGGTGTGTGTCCACGAGCGACGATCGTGGTGGCCCCAGCGGCC 
TjGGGCGTTGGCTGAGCCGACGCTGGGGCTTCCAGAAGGCCCGGGGGTCTCCGAGGTGCCGGTTAGGAGTTTGAAC 
CCCCCCCACTCTGCAGAGGGAAGCGGGGACAATGCCGGGGTTTCAGGCAGGAGACACGAGGAGGGCCTGCCCGGA 
AGTCACATCGGCAGCAGCTGTCTAAAGGGCTTGGGGGCCTGGGGGGCGGCGAAGGTGGGTGGGGCCCCTCTGTAA 
ATACGGCCCCAGGGTGGTGAGAGAGTCCCATGCCACCCGTCCCCTTGTGACCTCCCCCCTATGACCTCCAGCTGA 
CCATGCATGCCACGTGGCTGGCTGGGTCCTCTGCCCTCTTTGGAGTTTGCCTCCCCCAGCCCCCTCCCCATCAAT 
AAAACTCTGTTTACAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MQT PRAS P PRPALLLLLLLLGGAHGL FPEE PP PLS VAPRDYLN H Y PV FVGSG PGRLT PAEGAD 
DLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGKQEGEC 
RNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKHANVALFSDG 
ML FTATVT DFLAI DAVI YRSLGDRPTLRTVKH DSKWFKE PY FVHAVEWGS H VYFFFRE I AME F 
NYLEKVVVSRVARVCKNDVGGSPRVLEKQWTSFLECARLNCSVPGDSHFYFNVLQAVTGWSLG 
GRPWLAVFSTPSNSIPGSAVCAFDLTQVAAVFEGRFREQKSPESIWTPVPEDQVPRPRPGCC 

aapgmqynassalpddilnfvkthplmdeavpslghapwilrtlmrhqltrvavdvgagpwgn 
qtwflgseagtvlkflvrpnastsgtsglsvfleefetyrpdrcgrpgggetgqrllsleld 
aasggllaafprcwrvpvarcqqysgcmkncigsqdpycgwapdgsciflspgtraafeqdv 
sgastsglgdctgllraslsedraglvsvnllvtssvaafwgawsgfsvgwfvglrerrel 
arrkdkeailahgageavlsvsrlgerraqgpggrgggggggagvppeallaplmqngwakat 

LLQGGPHDLDSGLLPTPEQTPLPQKRLPTPHPHPHALGPRAWDHGHPLLPASASSSLLLLAPA 

rapeqppapgeptpdgrlyaarpgrashgdfpltphaspdrrrwsaptgpldpasaadglpr 
pwsppptgslrrplgphappaatlrrthtfnsgearpgdrhrgcharpgtdlahllpyggadr 

TAPPVP 

In^ortant features: 
Signal peptide: 
amino acids 1-25 
Txansmenibrana domains: 
amino acids 318-339, 598-617 
N-glycosylation sites. 

amino acids 74-78, 155-159, 167-171, , 291-295v 386.-390, 441-445, 
462-466 

Glycosaminoglycem attachment sites. 

amino acids 51-55, 5.73-577 

cftMP- and cOIP-dependent protein kinase phosphorylation site, 
amino acids .102-106 
N-nqfristoylation sites. 

amino acids 21-27, 50-56, 189-195, 333-339, 382-388, 448-454, 
490-496, 491-497, 508-514, 509-515, 531-537, 558-564, 569-575, 
574-580, 580-586, 610-616, 643-649, 663r669, 666-672, 66.7-673, 
668-674, 669-675, 670-676, 868-874, 879-885 
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GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGG 

GACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAA 

ACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTT 

CCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACG 

ACCTGTGCCACCAACTCGCACTCAGACTCTG7y\CTCAGACCTGAAATCTTCTCTTCACGGGAG 

GCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGT 

CTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAG 

ACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCT 

GAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTAAGGAGGACT 

GAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTC 

TATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGC 

AGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACA 

TGCCATTGTGGGGAGGAAGCAATGAAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTG 

GAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAG 

GAGACAGAAIAGGAGGAAAGTGATGCTGCTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCT 

TCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACTGTACTAGTCTTGTGCT 

GGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGATTGTCTTTATGCATCCCC 

AATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTTTCTGCTATTGGATATATT 

TATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTTATTTTTTTACTTGGACATG 

AAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAGAGCAGGTGATGTATTTTTAT 

ACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCTAGGGGGGTTATTCATTTGTAT 

TCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGATATTTGAAATTGAACCAATGAC 

TACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCACATCTACCTTACAATTACTG 

ACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAGCCAGGAATCCTACACGG 

CCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAAAAAAAAAAAAA 
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MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPE 

IFSSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQE 

IRNGFSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHY 

TLRKISSLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVICALGELDI 
LLQWMEETE 

Important: features : 
Signal peptide: 

amino acids 1-42 

CAMP- and cOlP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 



N-myristoylation sites . 

amino acids 42-47, 46-51, 136-141 
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AGCAACTCAAGTTCATCATTGTCCTGAGAGAGAGGAGCAGCGCGGTTCTCGGCCGGGACAGCA 
GAACGCCAGGGGACCCTCACCTGGGCGCGCCGGGGCACGGGCTTTGATTGTCCTGGGGTCGCG 
GAGACCCGCGCGCCTGCCCTGCACGCCGGGCGGCAACCTTTGCAGTCGCGTTGGCTGCTGCGA 
TCGGCCGGCGGGTCCCTGCCGAAGGCTCGGCTGCTTCTGTCCACCTCTTACACTTCTTCATTT 
ATCGGTGGATCATTTCGAGAGTCCGTCTTGTAAATGTTTGGCACTTTGCTACTTTATTGCTTC 
TTTCTGGCGACAGTTCCAGCACTCGCCGAGACCGGCGGAGAAAGGCAGCTGAGCCCGGAGAAG 
AGCGAAATATGGGGACCCGGGCTAAAAGCAGACGTCGTCCTTCCCGCCCGCTATTTCTATATT 
CAGGCAGTGGATACATCAGGGAATAAATTCACATCTTCTCCAGGCGAAAAGGTCTTCCAGGTG 
AAAGTCTCAGCACCAGAGGAGCAATTCACTAGAGTTGGAGTCCAGGTTTTAGACCGAAAAGAT 
GGGTCCTTCATAGTAAGATACAG/y^TGTATGCAAGCTACAAAAATCTGAAGGTGGAAATTAAA 
TTCCAAGGGCAACATGTGGCCAAATCCCCATATATTTTAAAAGGGCCGGTTTACCATGAGAAC 
TGTGACTGTCCTCTGCAAGATAGTGCAGCCTGGCTACGGGAGATGAACTGCCCTGAAACCATT 
GCTCAGATTCAGAGAGATCTGGCACATTTCCCTGCTGTGGATCCAGAAAAGATTGCAGTAGAA 
ATCCCAAAAAGATTTGGACAGAGGCAGAGCCTATGTCACTACACCTTAAAGGATAACAAGGTT 
TATATCAAGACTCATGGTGAACATGTAGGTTTTAGAATTTTCATGGATGCCATACTACTTTCT 
TTGACTAGAAAGGTGAAGATGCCAGATGTGGAGCTCTTTGTTAATTTGGGAGACTGGCCTTTG 
GAAAAAAAGAAATCCTyVTTCAAACATCCATCCGATCTTTTCCTGGTGTGGCTCCACAGATTCC 
AAGGATATCGTGATGCCTACGTACGATTTGACTGATTCTGTTCTGGAAACCATGGGCCGGGTA 
AGTCTGGATATGATGTCCGTGCAAGCTAACACGGGTCCTCCCTGGGAAAGCAAAAATTCCACT 
GCCGTCTGGAGAGGGCGAGACAGCCGCAAAGAGAGACTCGAGCTGGTTAAACTCAGTAGAAAA 
CACCCAGAACTCATAGACGCTGCTTTCACCAACTTTTTCTTCTTTAAACACGATGAAAACCTG 
TATGGTCCCATTGTGAAACATATTTCATTTTTTGATTTCTTCAAGCATAAGTATCAAATAAAT 
ATCGATGGCACTGTAGCAGCTTATCGCCTGCCATATTTGCTAGTTGGTGACAGTGTTGTGCTG 
AAGCAGGATTCCATCTACTATGAACATTTTTACAATGAGCTGCAGCCCTGGAAACACTACATT 
CCAGTTAAGAGCAACCTGAGCGATCTGCTAGAAAAACTTAAATGGGCGAAAGATCACGATGAA 
GAGGCCAAAAAGATAGCAAAAGCAGGACAAGAATTTGCAAGAAATAATCTCATGGGCGATGAC 
ATATTCTGTTATTATTTCAAACTTTTCCAGGAATATGCCAATTTACAAGTGAGTGAGCCCCAA 
ATCCGAGAGGGCATGAAAAGGGTAGAACCACAGACTGAGGACGACCTCTTCCCTTGTACTTGC 
CATAGGAAAAAGACCAAAGATGAACTCTGATATGCAAAATAACTTCTATTAGAATAATGGTGC 
TCTGAAGACTCTTCTTAACTAAAAAGAAGAATTTTTTTAAGTATTAATTCCATGGACAATATA 
AAATCTGTGTGATTGTTTGCAGTATGAAGACACATTTCTACTTATGCAGTATTCTCATGACTG 
TACTTTAAAGTACATTTTTAGAATTTTATAATAAAACCACCTTTATTTTAAAGGAAAAAAA 
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MFGTLLLYCFFLATVPALAETGGERQLSPEKSEIWGPGLKADVVLPARYFYIQAVDTSGNKFT 
SSPGEKVFQVKVSAPEEQFTRVGVQVLDRKDGSFIVRYEIMYASYKNLKVEIKFQGQHVAKSPY 
ILKGPVYHENCDCPLQDSAAWLREMNCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSL 
CHYTLKDNKVYIKTHGEHVGFRIFMDAILLSLTRKVKMPDVELFVNLGDWPLEKKKSNSNIHP 
IFSWCGSTDSKDIVMPTYDLTDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKE 
RLELVKLSRKHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLP 
YLLVGDSWLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLKWAKDHDEEAKKIAKAGQE 
FARNNLMGDDIFCYYFKLFQEYANLQVSEPQIREGMKRVEPQTEDDLFPCTCHRKKTKDEL 

In^rtant features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation sites. 

amino acids 302-306, 414-418 

cAMP- and cG^-dependent protein kinase phosphorylation sites. 

amino acids 243-247, 495-499 

Tyrosine kinase phosphoryla;tion site. 

amino acids 341-348 

N-oiiyristoylation sites. 

amino acids 59-65, 118-124, 184-190, 258-264, 370-376, 439-445 



Endoplasmic reticulum targeting sequence. 

amino acids 499-504 
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FIGURE 549 



GGGTGATTGAACT/UUVCCTTCGCCGCACCGAGTTTGCAGTACGGCCGTCACCCGCACCGCTGC 
CTGCTTGCGGTTGGAGAAATCAAGGCCCTACCGGGCCTCCGTAGTCACCTCTCTATAGTGGGC 
GTGGCCGAGGCCGGGGTGACCCTGCCGGAGCCTCCGCTGCCAGCGACAgEGTTCAAGGTAATTC 
AGAGGTCCGTGGGGCCAGCCAGCCTGAGCTTGCTCACCTTCAAAGTCTATGCAGCACCAAAAA 
AGGACTCACCTCCCAAAAATTCCGTGAAGGTTGATGAGCTTTCACTCTACTCAGTTCCTGAGG 
GTCAATCGAAGTATGTGGAGGAGGCAAGGAGCCAGCTTGAAGAAAGCATCTCACAGCTCCGAC 
ACTATTGCGAGCCATACACAACCTGGTGTCAGGAAACGTACTCCCAAACTAAGCCCAAGATGC 
AAAGTTTGGTTCAATGGGGGTTAGACAGCTATGACTATCTCCAAAATGCACCTCCTGGATTTT 
TTCCGAGACTTGGTGTTATTGGTTTTGCTGGCCTTATTGGACTCCTTTTGGCTAGAGGTTCAA 
AAATAAAGAAGCTAGTGTATCCGCCTGGTTTCATGGGATTAGCTGCCTCCCTCTATTATCCAC 
AACAAGCCATCGTGTTTGCCCAGGTCAGTGGGGAGAGATTATATGACTGGGGTTTACGAGGAT 
ATATAGTCATAGAAGATTTGTGGAAGGAGAACTTTCAAAAGCCAGGAAATGTGAAGAATTCAC 
CTGGAACTAAGTAfiAAAACTCCATGCTCTGCCATCTTAATCAGTTATAGGTAAACATTGGAAA 
CTCCATAGAATAAATCAGTATTTCTACAGAAAAATGGCATAGAAGTCAGTATTGAATGTATTA 
AATTGGCTTTCTTCTTCAGGAAAAACTAGACCAGACCTCTGTTATCTTCTGTGAAATCATCCT 
ACAAGCAAACTAACCTGGAATCCCTTCACCTAGAGATAATGTACAAGCCTTAGAACTCCTCAT 
TCTCATGTTGCTATTTATGTACCTAATTAAAACCCAAGTTTAAAAAAT^AAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAA 





wo 01/40466 



PCT/USOO/32678 



550/550 



FIGURE 550 



MFKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPEGQSKYVEEARSQLEES 

ISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAPPGFFPRLGVIGFAGLIGLL 

LARGSKIKKLVYPPGFMGLAASLYYPQQAIVFAQVSGERLYDWGLRGYIVIEDLWKENFQKPG 
NVKNSPGTK 

Inportant features: 
Signal peptide: 

T^ino acids 1-23 

Transmembrane domain: 

Amino acids 111-130 

cAMP- and c(^-dependent protein kinase phosphorylation site: 
Amino acids 26-30 

Tyrosine kinase phosphorylation site : 

Amino acids 36-44 

N-stirristoylation sites: 

Amino acids 124-130; 144-150; 189-195 
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